Beta

This dataset captures the details of how CO2 emissions by a vehicle can vary with the different features. The dataset has been taken from Canada Government official open data website which contains data for over a period of 7 years.
**There are total 7385 rows and 12 columns.**
Listed below is few abbreviations to describe different data points.

**Model**

- 4WD/4X4 = Four-wheel drive
- AWD = All-wheel drive
- FFV = Flexible-fuel vehicle
- SWB = Short wheelbase
- LWB = Long wheelbase
- EWB = Extended wheelbase

**Transmission**

- A = Automatic
- AM = Automated manual
- AS = Automatic with select shift
- AV = Continuously variable
- M = Manual
- 3 - 10 = Number of gears

**Fuel type**

- X = Regular gasoline
- Z = Premium gasoline
- D = Diesel
- E = Ethanol (E85)
- N = Natural gas

**Fuel Consumption**
City and highway fuel consumption ratings are shown in litres per 100 kilometres (L/100 km) - the combined rating (55% city, 45% hwy) is shown in L/100 km and in miles per gallon (mpg)

**CO2 Emissions**
The tailpipe emissions of carbon dioxide (in grams per kilometre) for combined city and highway driving

## LOADING THE DATA

```
# Import the pandas and numpy packages
import pandas as pd
import numpy as np
# Load the data
cars = pd.read_csv('data/co2_emissions_canada.csv')
# create numpy arrays
cars_makes = cars['Make'].to_numpy()
cars_models = cars['Model'].to_numpy()
cars_classes = cars['Vehicle Class'].to_numpy()
cars_engine_sizes = cars['Engine Size(L)'].to_numpy()
cars_cylinders = cars['Cylinders'].to_numpy()
cars_transmissions = cars['Transmission'].to_numpy()
cars_fuel_types = cars['Fuel Type'].to_numpy()
cars_fuel_consumption = cars['Fuel Consumption Comb (L/100 km)'].to_numpy()
cars_co2_emissions = cars['CO2 Emissions(g/km)'].to_numpy()
# Preview the dataframe
cars.sample(20)
```

## Exploratory Data Analysis

```
cars.info()
cars.describe()
```

#### Look at the first ten items in the CO2 emissions array

```
cars_co2_emissions[:10]
```

### 1. There are total 42 types of car brand.

### 2. There are total 2053 unique car model.

### 3. There are total 16 types of vehicle class basis on their gross vehicle weight rating (GVWR) and volume index.

#### 1. What is the median engine size in liters?

```
cars['Engine Size(L)'].median()
```

#### 2. What is the average fuel consumption for regular gasoline (Fuel Type = X), premium gasoline (Z), ethanol (E), and diesel (D)?

```
cars.groupby(['Fuel Type'])['Fuel Consumption Comb (L/100 km)'].mean()
```

#### 3. What is the correlation between fuel consumption and CO2 emissions?