Course
Unsupervised Learning in Python
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training a Team?
Try for BusinessCourse Description
What you'll learn
- Assess intrinsic dimensionality by interpreting PCA explained-variance ratios and selecting optimal n_components for compression
- Distinguish between k-means, agglomerative hierarchical clustering, and t-SNE based on their algorithms, input requirements, and visualization outputs
- Evaluate cluster quality using inertia plots, dendrogram linkage distances, and cross-tabulations against known categories
- Identify appropriate preprocessing, clustering, and dimension-reduction tools in scikit-learn for specific unsupervised learning tasks
- Recognize significant latent features produced by NMF and apply cosine similarity to recommend documents or images with related topics or patterns
Feels like what you want to learn?
Start Course for FreePrerequisites
Supervised Learning with scikit-learnClustering for Dataset Exploration
Visualization with Hierarchical Clustering and t-SNE
Decorrelating Your Data and Dimension Reduction
Discovering Interpretable Features
Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance reviewEnroll Now
FAQs
Is this course suitable for beginners?
You should be comfortable with basic and intermediate Python before starting. No prior knowledge of machine learning or unsupervised learning is required.
What unsupervised learning techniques does this course cover?
The course covers k-means clustering, hierarchical clustering, principal component analysis, and non-negative matrix factorization, all implemented using scikit-learn.
What is the difference between clustering and dimensionality reduction, and does the course explain both?
Yes. Clustering groups similar items together, while dimensionality reduction re-expresses data along its most important axes to reduce size and noise. The course covers both and explains when each approach is useful.
Which Python library is used throughout the course?
All clustering and dimensionality reduction algorithms are implemented using scikit-learn, the standard Python library for machine learning.
Join over 19 million learners and start Unsupervised Learning in Python today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Grow your data skills with DataCamp for Mobile
Make progress on the go with our mobile courses and daily 5-minute coding challenges.