Skip to main content
HomePython

Course

Dimensionality Reduction in Python

IntermediateSkill Level
4.8+
791 reviews
Updated 01/2023
Understand the concept of reducing dimensionality in your data, and master the techniques to do so in Python.
Start Course for Free
PythonMachine Learning4 hr16 videos58 Exercises4,700 XP36,076Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training 2 or more people?

Try DataCamp for Business

Course Description

High-dimensional datasets can be overwhelming and leave you not knowing where to start. Typically, you’d visually explore a new dataset first, but when you have too many dimensions the classical approaches will seem insufficient. Fortunately, there are visualization techniques designed specifically for high dimensional data and you’ll be introduced to these in this course. After exploring the data, you’ll often find that many features hold little information because they don’t show any variance or because they are duplicates of other features. You’ll learn how to detect these features and drop them from the dataset so that you can focus on the informative ones. In a next step, you might want to build a model on these features, and it may turn out that some don’t have any effect on the thing you’re trying to predict. You’ll learn how to detect and drop these irrelevant features too, in order to reduce dimensionality and thus complexity. Finally, you’ll learn how feature extraction techniques can reduce dimensionality for you through the calculation of uncorrelated principal components.

Prerequisites

Supervised Learning with scikit-learn
1

Exploring High Dimensional Data

You'll be introduced to the concept of dimensionality reduction and will learn when an why this is important. You'll learn the difference between feature selection and feature extraction and will apply both techniques for data exploration. The chapter ends with a lesson on t-SNE, a powerful feature extraction technique that will allow you to visualize a high-dimensional dataset.
Start Chapter
2

Feature Selection I - Selecting for Feature Information

In this first out of two chapters on feature selection, you'll learn about the curse of dimensionality and how dimensionality reduction can help you overcome it. You'll be introduced to a number of techniques to detect and remove features that bring little added value to the dataset. Either because they have little variance, too many missing values, or because they are strongly correlated to other features.
Start Chapter
3

Feature Selection II - Selecting for Model Accuracy

4

Feature Extraction

Dimensionality Reduction in Python
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.8
from 791 reviews
85%
13%
1%
0%
0%
  • Nicholas
    11 hours ago

  • Gyula
    yesterday

  • Sachal
    2 days ago

  • MTE
    3 days ago

  • SAIDUR AHRAR
    4 days ago

  • Huyen
    6 days ago

Nicholas

Gyula

Sachal

FAQs

Is this course suitable for beginners?

Yes, this course is suitable for beginners as it covers the basics of dimensionality reduction from the ground up.

Will I receive a certificate at the end of the course?

Yes, upon completion of the course you will receive a certificate of completion.

What jobs would benefit from this course?

Professionals from many different roles and fields, such as data scientists, data analysts, machine learning engineers, statisticians and other data scientists, would benefit from this course.

What skills will I gain after completing this course?

You will gain the ability to detect and drop features with little added value, to identify and drop irrelevant features, and to use feature extraction techniques to reduce dimensionality.

What techniques will be covered in the course?

You'll be introduced to techniques such as t-SNE, feature selection, feature extraction and Principal Component Analysis (PCA), allowing you to effectively explore and reduce the dimensionality of high-dimensional datasets.

Does this course cover specific visualization techniques?

Yes, the course covers visualization techniques designed specifically for high dimensional data, such as t-SNE, allowing you to effectively explore and reduce the dimensionality of the dataset.

Join over 19 million learners and start Dimensionality Reduction in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.