Course
Cluster Analysis in R
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Learn How to Perform Cluster Analysis
Cluster analysis is a powerful toolkit in the data science workbench. It is used to find groups of observations (clusters) that share similar characteristics. These similarities can inform all kinds of business decisions; for example, in marketing, it is used to identify distinct groups of customers for which advertisements can be tailored.Explore Hierarchical and K-Means Clustering Techniques
In this course, you will learn about two commonly used clustering methods - hierarchical clustering and k-means clustering. You won't just learn how to use these methods, you'll build a strong intuition for how they work and how to interpret their results. You'll develop this intuition by exploring three different datasets: soccer player positions, wholesale customer spending data, and longitudinal occupational wage data.Hone Your Skills with a Hands-On Case Study
You’ll finish the course by applying your new skills to a case study based around average salaries and how they have changed over time. This will combine hierarchical clustering techniques such as occupation trees, preparing for exploration, and plotting occupational clusters, with k-means techniques including elbow analysis and average silhouette widths.DataCamp courses are comprised of a mixture of videos, articles, and practice exercises so that you have the chance to test and cement your new-found skills so that you feel confident applying them outside a course setting.
Prerequisites
Intermediate RCalculating Distance Between Observations
Hierarchical Clustering
K-means Clustering
Case Study: National Occupational Mean Wage
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance reviewEnroll Now
FAQs
What is cluster analysis?
Cluster analysis is an important technique in data science, where you organize items into groups (clusters) based on shared characteristics. It’s an unsupervised machine learning algorithm, meaning that you don’t know how many clusters your data might have before running the model, and there are no assumptions made about likely relationships within your data. The most common uses for cluster analysis are to classify objects in data; for example, in market research, you might identify categories like age, income, and type of residence.
Is R good for cluster analysis?
R is an excellent programming language for cluster analysis tasks. It has a number of functions that help you to prepare the data, partition it (via K-means clustering), and plot cluster solutions.
What is hierarchical clustering?
Hierarchical clustering is an algorithm used to group similar objects into clusters that have a predetermined ordering from top to bottom.
What is K-Means clustering?
K-Means clustering is an unsupervised machine learning algorithm often used in statistics and data mining. K-Means algorithms group together data points based on certain similarities.
Join over 19 million learners and start Cluster Analysis in R today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Grow your data skills with DataCamp for Mobile
Make progress on the go with our mobile courses and daily 5-minute coding challenges.