We have a time series data, in this video we will extend our discussion further and look at various functions from lubridate package that are used to extract the information from date-time objects. For e.g. if I want to extract only year from date object, or only month from date object..I can able to do this using various extract functions.
In this video, we will carry out exploratory data analysis of flights data from nycflights13 package. We will filter out Delta Airways and try to answer below questions:
On average, Which month has the largest arrival delay?
On average, Which day of a week has the largest arrival delay?
On average, Which hour of a day has the largest arrival delay?
We will start with month and drill down to day and hour. We will use various functions from lubridate package to extract out the month, day and hour information from time-series data.
The Rmarkdown file can be found at Git Repo:
https://github.com/siddharth-sahasrabudhe/time-series-data/blob/main/extract_information.Rmd
The dplyr package video series can be found at:
https://youtu.be/Kxt5H0LM-1A – How to use mutate function
https://youtu.be/JMHJReB00ZA – Generate data summary using group_by ( ) and summarise ( ) functions
ggplot2 package video series can be found at:
https://youtu.be/wUWwL5ksIXM – How to create a bar chart
Link to previous video on lubridate package is:
https://youtu.be/LBh1QS-4Og0- Parsig time-series data.
1