Skip to content

Introduction to Statistics in R

Run the hidden code cell below to import the data used in this course.


1 hidden cell

Take Notes

Add notes about the concepts you've learned and code cells with code you want to keep.

Difference between descriptive and inferential statistics
Median is always better than mean in skewed data
Most data on left of histogram is left-skewed and vice-versa
You can find the sd by taking sqrt of var -> sqrt(var(variable name))
Code for looking at deciles 
# Calculate the deciles of co2_emission
quantile(food_consumption$co2_emission, probs = seq(0,1,0.1))
producing more numbers makes the data harder to interpret on the fly

Interquartile range, or IQR, is another way of measuring spread that's less influenced by outliers. IQR is also often used to find outliers. If a value is less than 
 or greater than 
, it's considered an outlier. In fact, this is how the lengths of the whiskers in a ggplot2 box plot are calculated.
refresh mutate function to create new dataset columns

Add your notes here

# Add your code snippets here