Manipulaton of Time Series Data

 


We have a time series data, in this video we will extend our discussion further and look at various functions from lubridate package that are used to extract the information from date-time objects. For e.g. if I want to extract only year from date object, or only month from date object..I can able to do this using various extract functions. 

In this video, we will carry out exploratory data analysis of flights data from nycflights13 package. We will filter out Delta Airways and try to answer below questions:
On average, Which month has the largest arrival delay?
On average, Which day of a week has the largest arrival delay?
On average, Which hour of a day has the largest arrival delay?

We will start with month and drill down to day and hour. We will use various functions from lubridate package to extract out the month, day and hour information from time-series data.

The Rmarkdown file can be found at Git Repo:
https://github.com/siddharth-sahasrabudhe/time-series-data/blob/main/extract_information.Rmd

The dplyr package video series can be found at:
https://youtu.be/Kxt5H0LM-1A - How to use mutate function
https://youtu.be/JMHJReB00ZA - Generate data summary using group_by ( ) and summarise ( ) functions

ggplot2 package video series can be found at:
https://youtu.be/wUWwL5ksIXM - How to create a bar chart

Link to previous video on lubridate package is:
https://youtu.be/LBh1QS-4Og0- Parsig time-series data.

We have a time series data, in this video we will extend our discussion further and look at various functions from lubridate package that are used to extract the information from date-time objects. For e.g. if I want to extract only year from date object, or only month from date object..I can able to do this using various extract functions.

In this video, we will carry out exploratory data analysis of flights data from nycflights13 package. We will filter out Delta Airways and try to answer below questions:
On average, Which month has the largest arrival delay?
On average, Which day of a week has the largest arrival delay?
On average, Which hour of a day has the largest arrival delay?

We will start with month and drill down to day and hour. We will use various functions from lubridate package to extract out the month, day and hour information from time-series data.

The Rmarkdown file can be found at Git Repo:
https://github.com/siddharth-sahasrabudhe/time-series-data/blob/main/extract_information.Rmd

The dplyr package video series can be found at:
https://youtu.be/Kxt5H0LM-1A – How to use mutate function
https://youtu.be/JMHJReB00ZA – Generate data summary using group_by ( ) and summarise ( ) functions

ggplot2 package video series can be found at:
https://youtu.be/wUWwL5ksIXM – How to create a bar chart

Link to previous video on lubridate package is:
https://youtu.be/LBh1QS-4Og0- Parsig time-series data.

1 0

YouTube Video UExCdnFrRHlDc2x4eEw2M2NMTHVUaFJEOVdUb2REaVRZYS4yODlGNEE0NkRGMEEzMEQy

lubridate Package| how to Extract date and time component from time series data?

Plotly Analytics – Giving Life to Data April 6, 2024 12:43 am


Scroll to Top