Loved by learners at thousands of companies
Surveys are often used to study health behavior and determine the risks of disease. Meanwhile, seemingly every day, news outlets publish a different "research says" article about how to lose weight (fast! with no effort at all!). In this project, you will use survey data of ~20k people sampled from the United States to explore health behaviors associated with lower Body Mass Index (BMI), a standardized measure of healthy weight and obesity. Surveys with complex designs use special statistical methods to incorporate sampling weights and design factors into the estimation and inference. Incorporating survey design methods, you will use multiple regression to handle confounders when testing whether physical activity is associated with lower BMI. This project will use [National Health and Nutrition Examination Survey (NHANES)](https://www.cdc.gov/nchs/nhanes/index.htm) data from ~20,000 participants surveyed in years 2009-2012 found in the [NHANES R package](https://www.rdocumentation.org/packages/NHANES/versions/2.1.0/topics/NHANES).
- 1Survey of BMI and physical activity
- 2Visualize survey weight and strata variables
- 3Specify the survey design
- 4Subset the data
- 5Visualizing BMI
- 6Is BMI lower in physically active people?
- 7Could there be confounding by smoking? (part 1)
- 8Could there be confounding by smoking? (part 2)
- 9Add smoking in the mix
- 10Incorporate possible confounding in the model
- 11What does it all mean?
Assistant Professor of Biostatistics at Oregon Health & Science University
Jessica is an Assistant Professor of Biostatistics in the OHSU-PSU School of Public Health at Oregon Health & Science University. Her statistical research interests include risk prediction with high dimensional data sets and the analysis of genetic and other omics data. She is passionate about teaching R and programming, reproducible research, and open science.
What do other learners have to say?
I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.
Devon Edwards Joseph
Lloyds Banking Group
DataCamp is the top resource I recommend for learning data science.
Harvard Business School
DataCamp is by far my favorite website to learn from.
Decision Science Analytics, USAA