Course
Dimensionality Reduction in R
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Why learn dimensionality reduction?
We live in the information age—an era of information overload. The art of extracting essential information from data is a marketable skill. Models train faster on reduced data. In production, smaller models mean faster response time. Perhaps most important, smaller data and models are often easier to understand. Dimensionality reduction is your Occam’s razor in data science.
What will you learn in this course?
The difference between feature selection and feature extraction! Using R, you will learn how to identify and remove features with low or redundant information, keeping the features with the most information. That’s feature selection. You will also learn how to extract combinations of features as condensed components that contain maximal information. That’s feature extraction!
But most importantly, using R’s new tidymodel package, you will use real-world data to build models with fewer features without sacrificing significant performance.
Prerequisites
Modeling with tidymodels in RFoundations of Dimensionality Reduction
Feature Selection for Feature Importance
Feature Selection for Model Performance
Feature Extraction and Model Performance
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance reviewEnroll Now
FAQs
Is this course appropriate for someone new to machine learning in R?
Yes. It is listed as beginner level and assumes familiarity with tidyverse basics and tidymodels rather than advanced machine learning experience.
What dimensionality reduction techniques are covered?
You learn feature selection methods like variance and correlation filtering, lasso and random forest selection, and feature extraction techniques including PCA, t-SNE, and UMAP.
Does the course use the tidymodels framework?
Yes. You build tidymodels recipes and workflows throughout the course to select and extract features as part of a reproducible model-building pipeline.
What is the difference between feature selection and feature extraction?
Feature selection removes low-information variables, while feature extraction creates new condensed components like principal components that combine information from multiple original features.
Will reducing features hurt my model performance?
The course teaches you how to simplify data while maintaining strong predictive performance, showing that removing redundant features often improves model speed and interpretability.
Join over 19 million learners and start Dimensionality Reduction in R today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Grow your data skills with DataCamp for Mobile
Make progress on the go with our mobile courses and daily 5-minute coding challenges.