课程
Cluster Analysis in R
中级技能水平
更新时间 2024年11月
RMachine Learning4 小时16 视频52 练习3,800 经验值43,964成就声明
创建您的免费帐户
Continue with GoogleShow more options或
继续操作即表示您接受我们的《使用条款》和《隐私政策》,并同意您的数据存储在美国。
深受数千家公司学习者的喜爱
Training a Team?
Try for Business课程描述
Learn How to Perform Cluster Analysis
Cluster analysis is a powerful toolkit in the data science workbench. It is used to find groups of observations (clusters) that share similar characteristics. These similarities can inform all kinds of business decisions; for example, in marketing, it is used to identify distinct groups of customers for which advertisements can be tailored.Explore Hierarchical and K-Means Clustering Techniques
In this course, you will learn about two commonly used clustering methods - hierarchical clustering and k-means clustering. You won't just learn how to use these methods, you'll build a strong intuition for how they work and how to interpret their results. You'll develop this intuition by exploring three different datasets: soccer player positions, wholesale customer spending data, and longitudinal occupational wage data.Hone Your Skills with a Hands-On Case Study
You’ll finish the course by applying your new skills to a case study based around average salaries and how they have changed over time. This will combine hierarchical clustering techniques such as occupation trees, preparing for exploration, and plotting occupational clusters, with k-means techniques including elbow analysis and average silhouette widths.DataCamp courses are comprised of a mixture of videos, articles, and practice exercises so that you have the chance to test and cement your new-found skills so that you feel confident applying them outside a course setting.
先决条件
Intermediate R1
Calculating Distance Between Observations
Cluster analysis seeks to find groups of observations that are similar to one another, but the identified groups are different from each other. This similarity/difference is captured by the metric called distance. In this chapter, you will learn how to calculate the distance between observations for both continuous and categorical features. You will also develop an intuition for how the scales of your features can affect distance.
2
Hierarchical Clustering
This chapter will help you answer the last question from chapter 1—how do you find groups of similar observations (clusters) in your data using the distances that you have calculated? You will learn about the fundamental principles of hierarchical clustering - the linkage criteria and the dendrogram plot - and how both are used to build clusters. You will also explore data from a wholesale distributor in order to perform market segmentation of clients using their spending habits.
3
K-means Clustering
In this chapter, you will build an understanding of the principles behind the k-means algorithm, learn how to select the right k when it isn't previously known, and revisit the wholesale data from a different perspective.
4
Case Study: National Occupational Mean Wage
In this chapter, you will apply the skills you have learned to explore how the average salary amongst professions have changed over time.
Cluster Analysis in R
课程完成 加入超过19百万学习者,今天就开始Cluster Analysis in R!
创建您的免费帐户
Continue with GoogleShow more options或
继续操作即表示您接受我们的《使用条款》和《隐私政策》,并同意您的数据存储在美国。
通过 DataCamp for Mobile 提升您的数据技能
随时随地通过我们的移动课程和每日 5 分钟编程挑战提升技能。