Introduction to Statistics in R
  • AI Chat
  • Code
  • Report
  • Beta
    Spinner

    Introduction to Statistics in R

    Run the hidden code cell below to import the data used in this course.

    Intro to Statistics ->

    • Descriptive statistics focuses on describing and summarizing the data at hand. After asking four friends how they get to work, we can see that 50% of them drive to work, 25% ride the bus, and 25% bike. These are examples of descriptive statistics.
    • Inferential statistics uses the data at hand, which is called sample data, to make inferences about a larger population. We could use inferential statistics to figure out what percent of people drive to work based on our sample data.

    Types of Data ->

    There are two main types of data.

    • Numeric (quantitative data) is made up of numeric values.
    • Categorical (qualitative data) is made up of values that belong to distinct groups.

    msleep

    WAYS TO MEASURE CENTER - MEAN, MEDIAN, MODE

    calculate mean

    mean(msleep$sleep_total) 

    calculate median

    • The median is the value where 50% of the data is lower than it, and 50% of the data is higher. We can calculate this by sorting all the data points and taking the middle one
    sort(msleep$sleep_total) #way 1
    sort(msleep$sleep_total)[42]
    median(msleep$sleep_total) #way 2

    calculate mode

    • most frequent value in the data
    library(dplyr)
    msleep %>% count(sleep_total, sort = TRUE)

    There are 4 mammals that sleep for 12.5 hours, so this is the mode.