Kevin Jolly has completed

Introduction to Statistical Modeling in R

4 hr

3,250 XP

Loved by learners at thousands of companies

Course Description

Introduction Statistical Modeling in R is a multi-part course designed to get you up to speed with the most important and powerful methodologies in statistical modeling in R.

In In this introduction, we’ll take a look at what statistical modeling is and what it’s used for, R tools for model building, using models for prediction (and using prediction to test models), and how to account for the combined influences of multiple variables.This course has been written from scratch, specifically for DataCamp users. As you’ll see, by using computing and concepts from machine learning, we’ll be able to leapfrog many of the marginal and esoteric topics encountered in traditional ’regression’ courses.

The intermediate course will get you up to speed with the most important and powerful methodologies in statistics.

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

1
What is statistical modeling?
Free
This chapter explores what a statistical model is, R objects which build models, and the basic R notation, called formulas used for models.
Play Chapter Now
Welcome to statistical modeling!
50 xp
A mathematical model
50 xp
Running experiments on the toy model
50 xp
From experimental results to a prediction
100 xp
R objects for statistical modeling
50 xp
Accessing data
100 xp
Starting with formulas
100 xp
Graphics with formulas
100 xp
2
Designing, training, and evaluating models
In this chapter, you'll start building models: specifying what variables models should relate to one another and training models on the available data. You'll also provide new inputs to models to generate the corresponding outputs.
Play Chapter Now
Designing and training models
50 xp
Modeling running times
100 xp
Using the recursive partitioning model architecture
100 xp
Will they run again?
100 xp
Evaluating models
50 xp
From inputs to outputs
100 xp
Extrapolation
100 xp
Typical values of data
100 xp
3
Assessing prediction performance
This chapter is about techniques for deciding whether an explanatory variable improves the prediction performance of a model. You'll use cross validation to compare different models.
Play Chapter Now
Choosing explanatory variables
50 xp
Conceptual warm-up
50 xp
Running experience
100 xp
Prediction performance
100 xp
Where's the statistics?
100 xp
Cross validation
50 xp
Tidying up
50 xp
Testing and training datasets
100 xp
Repeating random trials
50 xp
To add or not to add (an explanatory variable)?
100 xp
4
Exploring data with models
This chapter is about constructing models to explore masses of data, for instance to generate hypotheses about what factors are important in how a system works. You'll see how the recursive partitioning model architecture, which has an internal logic for selecting explanatory variables, can be used to explore potentially complex relationships among variables. The chapter also covers the evaluation of prediction performance in models where the response variable is categorical, that is, models used for classification.
Play Chapter Now
Prediction error for categorical response variables
50 xp
The maximum error rate
100 xp
A non-null model
100 xp
A better model?
100 xp
Exploring data for relationships
50 xp
Evaluating a recursive partitioning model
50 xp
Exploring birth-weight data
50 xp
Exploring more broadly
50 xp
5
Covariates and effect size
Real-world systems are complicated. To faithfully reflect that complexity, models can incorporate multiple explanatory variables. This chapter introduces the notion of covariates and how they allow you to model the effect of an explanatory variable while taking into account the effects of other variables.
Play Chapter Now
Covariates
50 xp
House prices
100 xp
Crime and poverty
100 xp
Equal pay?
100 xp
Effect size
50 xp
Sex and death
50 xp
Comparing effect sizes
50 xp
How do GPAs compare?
100 xp
Housing units
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Ran twice 100 Runners

collaborators

Nick Carchedi

Tom Jeon

prerequisites

Introduction to R Introduction to the Tidyverse

Daniel Kaplan

DeWitt Wallace Professor at Macalester College

Join over 18 million learners and start Introduction to Statistical Modeling in R today!

Create Your Free Account

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Introduction to Statistical Modeling in R

Loved by learners at thousands of companies

Course Description

.css-10r9e5n{-webkit-margin-end:8px;margin-inline-end:8px;}.css-1309hh9{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;-webkit-margin-end:8px;margin-inline-end:8px;}Training 2 or more people?

What is statistical modeling?

Designing, training, and evaluating models

Assessing prediction performance

Exploring data with models

Covariates and effect size

Training 2 or more people?

Join over .css-ou6dz6{color:#03ef62;}18 million learners and start Introduction to Statistical Modeling in R today!

Create Your Free Account

Training 2 or more people?

Join over 18 million learners and start Introduction to Statistical Modeling in R today!