Interactive Course

Machine Learning Toolbox

This course teaches the big ideas in machine learning like how to build and evaluate predictive models.

  • 4 hours
  • 24 Videos
  • 88 Exercises
  • 34,413 Participants
  • 6,200 XP

Loved by learners at thousands of top companies:

ea-grey.svg
paypal-grey.svg
uber-grey.svg
lego-grey.svg
mls-grey.svg
3m-grey.svg

Course Description

Machine learning is the study and application of algorithms that learn from and make predictions on data. From search results to self-driving cars, it has manifested itself in all areas of our lives and is one of the most exciting and fast growing fields of research in the world of data science. This course teaches the big ideas in machine learning: how to build and evaluate predictive models, how to tune them for optimal performance, how to preprocess data for better results, and much more. The popular caret R package, which provides a consistent interface to all of R's most powerful machine learning facilities, is used throughout the course.

  1. 1

    Regression models: fitting them and evaluating their performance

    Free

    In the first chapter of this course, you'll fit regression models with train() and evaluate their out-of-sample performance using cross-validation and root-mean-square error (RMSE).

  2. Tuning model parameters to improve performance

    In this chapter, you will use the train() function to tweak model parameters through cross-validation and grid search.

  3. Selecting models: a case study in churn prediction

    In the final chapter of this course, you'll learn how to use resamples() to compare multiple models and select (or ensemble) the best one(s).

  4. Classification models: fitting them and evaluating their performance

    In this chapter, you'll fit classification models with train() and evaluate their out-of-sample performance using cross-validation and area under the curve (AUC).

  5. Preprocessing your data

    In this chapter, you will practice using train() to preprocess data before fitting models, improving your ability to making accurate predictions.

  1. 1

    Regression models: fitting them and evaluating their performance

    Free

    In the first chapter of this course, you'll fit regression models with train() and evaluate their out-of-sample performance using cross-validation and root-mean-square error (RMSE).

  2. Classification models: fitting them and evaluating their performance

    In this chapter, you'll fit classification models with train() and evaluate their out-of-sample performance using cross-validation and area under the curve (AUC).

  3. Tuning model parameters to improve performance

    In this chapter, you will use the train() function to tweak model parameters through cross-validation and grid search.

  4. Preprocessing your data

    In this chapter, you will practice using train() to preprocess data before fitting models, improving your ability to making accurate predictions.

  5. Selecting models: a case study in churn prediction

    In the final chapter of this course, you'll learn how to use resamples() to compare multiple models and select (or ensemble) the best one(s).

What do other learners have to say?

Devon

“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group

Louis

“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School

Ronbowers

“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Zachary Deane-Mayer
Zachary Deane-Mayer

Automation First Data Scientist at DataRobot

Zach is a Data Scientist at DataRobot and co-author of the caret R package. He's fascinated by predicting the future and spends his free time competing in predictive modeling competitions. He's currently one of top 500 data scientists on Kaggle and took 9th place in the Heritage Health Prize as part of the Analytics Inside team.

See More
Max Kuhn
Max Kuhn

Software Engineer at RStudio and creator of caret

Dr. Max Kuhn is a Software Engineer at RStudio. He is the author or maintainer of several R packages for predictive modeling including caret, AppliedPredictiveModeling, Cubist, C50 and SparseLDA. He routinely teaches classes in predictive modeling at Predictive Analytics World and UseR! and his publications include work on neuroscience biomarkers, drug discovery, molecular diagnostics and response surface methodology.

See More
Icon Icon Icon professional info
Do you have 5 minutes to help us improve our navigation?
I'll do it No thanks