In this section, you will learn how dplyr
functions can be used to ask interesting questions of a more complex data set You’ll use a data set of flights that departed from New York city airports (including Newark, John F. Kennedy, and Laguardia airports) in 2013. This data set is also featured online in the Introduction to dplyr vignette, and is drawn from the Bureau of Transportation Statistics database.
This dataset will use over 300,000 observations to ask the following questions:
- Which airline has the highest number of delayed departures?
- On average, to which airport do flights arrive most early?
- In which month do flights tend to have the longest delays?
See all code in the analysis.R
file.