Skip to main content
HomeR

Course

Modeling with Data in the Tidyverse

IntermediateSkill Level
4.8+
215 reviews
Updated 09/2022
Discover different types in data modeling, including for prediction, and learn how to conduct linear regression and model assessment measures in the Tidyverse.
Start Course for Free
RProbability & Statistics4 hr17 videos49 Exercises3,900 XP27,103Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training 2 or more people?

Try DataCamp for Business

Course Description

In this course, you will learn to model with data. Models attempt to capture the relationship between an outcome variable of interest and a series of explanatory/predictor variables. Such models can be used for both explanatory purposes, e.g. "Does knowing professors' ages help explain their teaching evaluation scores?", and predictive purposes, e.g., "How well can we predict a house's price based on its size and condition?" You will leverage your tidyverse skills to construct and interpret such models. This course centers around the use of linear regression, one of the most commonly-used and easy to understand approaches to modeling. Such modeling and thinking is used in a wide variety of fields, including statistics, causal inference, machine learning, and artificial intelligence.

Prerequisites

Data Manipulation with dplyr
1

Introduction to Modeling

This chapter will introduce you to some background theory and terminology for modeling, in particular, the general modeling framework, the difference between modeling for explanation and modeling for prediction, and the modeling problem. Furthermore, you'll start performing your first exploratory data analysis, a crucial first step before any formal modeling.
Start Chapter
2

Modeling with Basic Regression

Equipped with your understanding of the general modeling framework, in this chapter, we'll cover basic linear regression where you'll keep things simple and model the outcome variable y as a function of a single explanatory/ predictor variable x. We'll use both numerical and categorical x variables. The outcome variable of interest in this chapter will be teaching evaluation scores of instructors at the University of Texas, Austin.
Start Chapter
3

Modeling with Multiple Regression

In the previous chapter, you learned about basic regression using either a single numerical or a categorical predictor. But why limit ourselves to using only one variable to inform your explanations/predictions? You will now extend basic regression to multiple regression, which allows for incorporation of more than one explanatory or one predictor variable in your models. You'll be modeling house prices using a dataset of houses in the Seattle, WA metropolitan area.
Start Chapter
4

Model Assessment and Selection

In the previous chapters, you fit various models to explain or predict an outcome variable of interest. However, how do we know which models to choose? Model assessment measures allow you to assess how well an explanatory model "fits" a set of data or how accurate a predictive model is. Based on these measures, you'll learn about criteria for determining which models are "best".
Start Chapter
Modeling with Data in the Tidyverse
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.8
from 215 reviews
90%
9%
0%
0%
0%
  • Anna
    6 hours ago

  • julio
    3 days ago

  • ‪IBRAHIM
    3 days ago

  • Carlos
    4 days ago

    BOM!

  • Brianne
    6 days ago

  • Methila
    last week

    Amazing teacher! Best explanations.

Anna

julio

"BOM!"

Carlos

FAQs

What type of modeling does this course focus on?

The course centers on linear regression, one of the most commonly used modeling approaches, covering both basic single-variable and multiple regression with the tidyverse.

What datasets are used for hands-on practice?

You will model teaching evaluation scores for University of Texas instructors and house prices from the Seattle metropolitan area housing market.

Does the course explain the difference between explanatory and predictive modeling?

Yes. Chapter 1 introduces both approaches, explaining when you want to understand why something happens versus when you want to predict future outcomes.

What R prerequisites are needed?

You need Data Manipulation with dplyr and Introduction to the Tidyverse. These provide the core data wrangling and visualization skills used throughout.

How will I learn to choose between different models?

Chapter 4 covers model assessment measures that evaluate how well models fit data or predict outcomes, giving you criteria to determine which model is best.

Join over 19 million learners and start Modeling with Data in the Tidyverse today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.