Predict Flight Delays
You are a data scientist at a company that provides lounges to airport travelers. They have realized that demand peaks whenever flights are canceled or delayed.
They have asked you whether you can develop a model to predict which flights will be delayed or canceled. You can access a large dataset on flights from 2022 to help train your model. They would be even happier if you could also predict the magnitude of delays.
Your report should include your conclusions, including whether and how stakeholders should use your model.
The original source of the data (prior to reduction for runtime purposes by DataCamp) can be found here, and the data dictionary can be found in the
data_dictionary.ipynb file in your file browser!
import pandas as pd df = pd.read_csv("flights.csv") df