Skip to main content
HomePython

Course

Dealing with Missing Data in Python

IntermediateSkill Level
4.8+
172 reviews
Updated 08/2023
Learn how to identify, analyze, remove and impute missing data in Python.
Start Course for Free
PythonData Manipulation4 hr14 videos46 Exercises3,800 XP25,825Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training 2 or more people?

Try DataCamp for Business

Course Description

Tired of working with messy data? Did you know that most of a data scientist's time is spent in finding, cleaning and reorganizing data?! Well turns out you can clean your data in a smart way! In this course Dealing with Missing Data in Python, you'll do just that! You'll learn to address missing values for numerical, and categorical data as well as time-series data. You'll learn to see the patterns the missing data exhibits! While working with air quality and diabetes data, you'll also learn to analyze, impute and evaluate the effects of imputing the data.

Prerequisites

Introduction to Data Visualization with MatplotlibSupervised Learning with scikit-learn
1

The Problem With Missing Data

Get familiar with missing data and how it impacts your analysis! Learn about different null value operations in your dataset, how to find missing data and summarizing missingness in your data.
Start Chapter
2

Does Missingness Have A Pattern?

Analyzing the type of missingness in your dataset is a very important step towards treating missing values. In this chapter, you'll learn in detail how to establish patterns in your missing and non-missing data, and how to appropriately treat the missingness using simple techniques such as listwise deletion.
Start Chapter
3

Imputation Techniques

4

Advanced Imputation Techniques

Finally, go beyond simple imputation techniques and make the most of your dataset by using advanced imputation techniques that rely on machine learning models, to be able to accurately impute and evaluate your missing data. You will be using methods such as KNN and MICE in order to get the most out of your missing data!
Start Chapter
Dealing with Missing Data in Python
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.8
from 172 reviews
83%
16%
2%
0%
0%
  • Hannah
    3 days ago

  • Pearce
    2 weeks ago

  • Ciara
    2 weeks ago

    fsf

  • Tiras Murage
    3 weeks ago

  • Ninghao
    4 weeks ago

    I used Chrome as the browser, but somehow several videos were not loaded initially. After refreshing the page, the video was loaded, but the figure/figures in the video weren't displayed.

  • Chuan
    4 weeks ago

Hannah

Pearce

"fsf"

Ciara

FAQs

Is this course suitable for beginners?

Yes, this course is suitable for beginners. The course provides a comprehensive overview of common methods to deal with missing data, including both simple and advanced imputation techniques.

Will I receive a certificate at the end of the course?

Yes, upon completion of the course you will receive a DataCamp Certificate of Completion.

What topics are covered in this course?

This course covers topics such as null value operations, establishing patterns in missing and non-missing data, basic imputation techniques, advanced imputation techniques, and evaluating missing data.

What types of data does the course cover?

This course covers numerical, categorical, and time-series data.

Who will benefit from this course?

This course is ideal for data scientists and analysts who need to clean data more efficiently and accurately. It can also be beneficial for software engineers, databases administrators, and other professionals that work with data in their day-to-day.

What data is used in this course?

This course uses air quality and diabetes datasets to demonstrate how to use the various methods presented.

Join over 19 million learners and start Dealing with Missing Data in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.