Skip to main content
HomePython

Course

Supervised Learning with scikit-learn

IntermediateSkill Level
4.8+
7,846 reviews
Updated 12/2025
Grow your machine learning skills with scikit-learn in Python. Use real-world datasets in this interactive course and learn how to make powerful predictions!
Start Course for Free
PythonMachine Learning
4 hr
15 videos
49 Exercises
4,050 XP
270K+
Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training a Team?

Try for Business

Course Description

Grow your machine learning skills with scikit-learn and discover how to use this popular Python library to train models using labeled data. In this course, you'll learn how to make powerful predictions, such as whether a customer is will churn from your business, whether an individual has diabetes, and even how to tell classify the genre of a song. Using real-world datasets, you'll find out how to build predictive models, tune their parameters, and determine how well they will perform with unseen data.The videos contain live transcripts you can reveal by clicking "Show transcript" at the bottom left of the videos. The course glossary can be found on the right in the resources section.To obtain CPE credits you need to complete the course and reach a score of 70% on the qualified assessment. You can navigate to the assessment by clicking on the CPE credits callout on the right.

What you'll learn

  • Assess model generalization using train-test splits, k-fold cross-validation, and hyperparameter tuning with GridSearchCV or RandomizedSearchCV
  • Differentiate key evaluation metrics for supervised models, including accuracy, precision, recall, F1, ROC-AUC, R-squared, MSE, and RMSE
  • Evaluate model complexity and its impact on overfitting or underfitting by adjusting parameters such as k in KNN and alpha in regularized regression.
  • Identify supervised learning problem types and select appropriate scikit-learn algorithms for classification and regression
  • Recognize essential preprocessing techniques—dummy encoding, imputation, scaling, and pipeline construction—required for scikit-learn workflows

Feels like what you want to learn?

Start Course for Free

Prerequisites

Introduction to Statistics in Python
1

Classification

In this chapter, you'll be introduced to classification problems and learn how to solve them using supervised learning techniques. You'll learn how to split data into training and test sets, fit a model, make predictions, and evaluate accuracy. You’ll discover the relationship between model complexity and performance, applying what you learn to a churn dataset, where you will classify the churn status of a telecom company's customers.
Start Chapter
2

Regression

In this chapter, you will be introduced to regression, and build models to predict sales values using a dataset on advertising expenditure. You will learn about the mechanics of linear regression and common performance metrics such as R-squared and root mean squared error. You will perform k-fold cross-validation, and apply regularization to regression models to reduce the risk of overfitting.
Start Chapter
3

Fine-Tuning Your Model

Having trained models, now you will learn how to evaluate them. In this chapter, you will be introduced to several metrics along with a visualization technique for analyzing classification model performance using scikit-learn. You will also learn how to optimize classification and regression models through the use of hyperparameter tuning.
Start Chapter
Supervised Learning with scikit-learn
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.8
from 7,846 reviews
82%
16%
1%
0%
0%
  • Stanley
    2 hours ago

  • Mubarak
    4 hours ago

  • Erisa
    5 hours ago

    some parts did not explain very well. i had to use AI to understand it a bit better. some exercise hints are not helpful.

  • MOST
    6 hours ago

  • JUAN RAMON
    7 hours ago

    Buen curso

  • Luis
    14 hours ago

Mubarak

MOST

"Buen curso"

JUAN RAMON

FAQs

Who will benefit from this course?

This course is beneficial for anyone interested in data analysis, machine learning, and related fields. People working in finance, analytics, data science, economics, software engineering, and other related fields would find this course useful.

Will I receive a certificate at the end of the course?

Yes, upon completion of this course you will receive a DataCamp certificate.

What topics does this course cover?

This course covers supervised learning methods, regression, data pre-processing, building pipelines, fine-tuning models, and more. It will also demonstrate how to use the scikit-learn library to solve classification and regression problems.

What is classification?

Classification is a supervised machine learning technique used for predicting discrete values for a given set of inputs.

What is regression?

Regression is a supervised machine learning technique used for predicting continuous values for a given set of inputs.

Join over 19 million learners and start Supervised Learning with scikit-learn today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.