Beta
Introduction to Statistics in R
Run the hidden code cell below to import the data used in this course.
Take Notes
Add notes about the concepts you've learned and code cells with code you want to keep.
Difference between descriptive and inferential statistics
Median is always better than mean in skewed data
Most data on left of histogram is left-skewed and vice-versa
You can find the sd by taking sqrt of var -> sqrt(var(variable name))
Code for looking at deciles
# Calculate the deciles of co2_emission
quantile(food_consumption$co2_emission, probs = seq(0,1,0.1))
producing more numbers makes the data harder to interpret on the fly
Interquartile range, or IQR, is another way of measuring spread that's less influenced by outliers. IQR is also often used to find outliers. If a value is less than
or greater than
, it's considered an outlier. In fact, this is how the lengths of the whiskers in a ggplot2 box plot are calculated.
refresh mutate function to create new dataset columns
Add your notes here
# Add your code snippets here