Course
Cluster Analysis in Python
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Prerequisites
Intermediate PythonIntroduction to Clustering
Hierarchical Clustering
K-Means Clustering
Clustering in Real World
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance reviewEnroll Now
FAQs
Which Python library is used for clustering in this course?
The course primarily uses the SciPy library to implement both hierarchical and k-means clustering algorithms, along with standard tools for data visualization.
What dataset will I use to practice clustering?
You will explore player statistics from the FIFA 18 video game, applying clustering techniques to group players based on their performance attributes.
How will I determine the right number of clusters for my data?
For hierarchical clustering you will use dendrograms, and for k-means you will learn a separate method to evaluate the optimal number of clusters before running the algorithm.
Does the course cover real-world clustering applications beyond sports data?
Yes. The final chapter applies clustering to find dominant colors in images and to group news articles by topic, demonstrating practical uses in different domains.
What preprocessing steps are taught before clustering?
You will learn essential preprocessing steps like feature scaling and data normalization that are necessary before applying distance-based clustering algorithms effectively.
Join over 19 million learners and start Cluster Analysis in Python today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Grow your data skills with DataCamp for Mobile
Make progress on the go with our mobile courses and daily 5-minute coding challenges.