Home RUnsupervised Learning in R

Unsupervised Learning in R

Name: Unsupervised Learning in R
Rating: 4.6 (20 reviews)

4.6+

20 reviews

Intermediate

This course provides an intro to clustering and dimensionality reduction in R from a machine learning perspective.

Start Course for Free

4 Hours16 Videos49 Exercises

49,087 LearnersStatement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?Try DataCamp For Business

Loved by learners at thousands of companies

Course Description

Many times in machine learning, the goal is to find patterns in data without trying to make predictions. This is called unsupervised learning. One common use case of unsupervised learning is grouping consumers based on demographics and purchasing history to deploy targeted marketing campaigns. Another example is wanting to describe the unmeasured factors that most influence crime differences between cities. This course provides a basic introduction to clustering and dimensionality reduction in R from a machine learning perspective, so that you can get from data to insights as quickly as possible.

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

In the following Tracks

Certification Available

SQL Fundamentals

Go To Track

Certification Available

Associate Data Scientist in R

Go To Track

Machine Learning Fundamentals in R

Go To Track

1
Unsupervised learning in R
Free
The k-means algorithm is one common approach to clustering. Learn how the algorithm works under the hood, implement k-means clustering in R, visualize and interpret the results, and select the number of clusters when it's not known ahead of time. By the end of the chapter, you'll have applied k-means clustering to a fun "real-world" dataset!
Play Chapter Now
Welcome to the course!
50 xp
Identify clustering problems
50 xp
Introduction to k-means clustering
50 xp
k-means clustering
100 xp
Results of kmeans()
100 xp
Visualizing and interpreting results of kmeans()
100 xp
How k-means works and practical matters
50 xp
Handling random algorithms
100 xp
Selecting number of clusters
100 xp
Introduction to the Pokemon data
50 xp
Practical matters: working with real data
100 xp
Review of k-means clustering
50 xp
2
Hierarchical clustering
Hierarchical clustering is another popular method for clustering. The goal of this chapter is to go over how it works, how to use it, and how it compares to k-means clustering.
Play Chapter Now
Introduction to hierarchical clustering
50 xp
Hierarchical clustering with results
100 xp
Selecting number of clusters
50 xp
Interpreting dendrogram
50 xp
Cutting the tree
100 xp
Clustering linkage and practical matters
50 xp
Linkage methods
100 xp
Comparing linkage methods
50 xp
Practical matters: scaling
100 xp
Comparing kmeans() and hclust()
100 xp
Review of hierarchical clustering
50 xp
3
Dimensionality reduction with PCA
Principal component analysis, or PCA, is a common approach to dimensionality reduction. Learn exactly what PCA does, visualize the results of PCA with biplots and scree plots, and deal with practical issues such as centering and scaling the data before performing PCA.
Play Chapter Now
Introduction to PCA
50 xp
PCA using prcomp()
100 xp
Results of PCA
50 xp
Additional results of PCA
50 xp
Visualizing and interpreting PCA results
50 xp
Interpreting biplots (1)
50 xp
Interpreting biplots (2)
50 xp
Variance explained
100 xp
Visualize variance explained
100 xp
Practical issues with PCA
50 xp
Practical issues: scaling
100 xp
Additional uses of PCA and wrap-up
50 xp
4
Putting it all together with a case study
The goal of this chapter is to guide you through a complete analysis using the unsupervised learning techniques covered in the first three chapters. You'll extend what you've learned by combining PCA as a preprocessing step to clustering using data that consist of measurements of cell nuclei of human breast masses.
Play Chapter Now
Introduction to the case study
50 xp
Preparing the data
100 xp
Exploratory data analysis
50 xp
Performing PCA
100 xp
Interpreting PCA results
100 xp
Variance explained
100 xp
PCA review and next steps
50 xp
Communicating PCA results
50 xp
Hierarchical clustering of case data
100 xp
Results of hierarchical clustering
50 xp
Selecting number of clusters
100 xp
k-means clustering and comparing results
100 xp
Clustering on PCA results
100 xp
Wrap-up and review
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

In the following Tracks

Certification Available

SQL Fundamentals

Go To Track

Certification Available

Associate Data Scientist in R

Go To Track

Machine Learning Fundamentals in R

Go To Track

In other tracks

Machine Learning Scientist with R

Datasets

Pokemon data Wisconsin breast cancer data

Collaborators

Nick Carchedi

Tom Jeon

Prerequisites

Introduction to R

Hank Roark

Senior Data Scientist, Boeing

Hank is a Senior Data Scientist at Boeing and a long time user of the R language. Prior to his current role, he led the Customer Data Science team at H2O.ai, a leading provider of machine learning and predictive analytics services.

Don’t just take our word for it

*4.6

from 20 reviews

75%

10%

15%

Sort by

Highest to Lowest
Lowest to Highest
Most recent
Top reviews

Li D.

7 months

Review content required

Walter F.

9 months

The pokemon data set provided was not the same one used in the class, but should be. On the `Selecting number of clusters` the part on comparing to diagnosis could be better explained. Similarly, on the k-means clusters and comparing results needed more explanation on interpreting results.

Denis Y.

about 1 year

Great course with a lot of information in simple and clear form.

Marcel O.

about 1 year

Quite good introduction to dims reduction and most popular clustering techniques.

Edwin A.

over 1 year

This is a recommended course to learn unsupervised learning in R.

"Review content required"

Li D.

"The pokemon data set provided was not the same one used in the class, but should be. On the `Selecting number of clusters` the part on comparing to diagnosis could be better explained. Similarly, on the k-means clusters and comparing results needed more explanation on interpreting results."

Walter F.

"Great course with a lot of information in simple and clear form."

Denis Y.

Join over 13 million learners and start Unsupervised Learning in R today!

Create Your Free Account

Google LinkedIn Facebook

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Course Description

.css-1goj2uy{margin-right:8px;}Group.css-gnv7tt{font-size:20px;font-weight:700;white-space:nowrap;}.css-12nwtlk{box-sizing:border-box;margin:0;min-width:0;color:#05192D;font-size:16px;line-height:1.5;font-size:20px;font-weight:700;white-space:nowrap;}Training 2 or more people?

In the following Tracks

SQL Fundamentals

Associate Data Scientist in R

Machine Learning Fundamentals in R