Direkt zum Inhalt
StartseitePythonCluster Analysis in Python

Cluster Analysis in Python

In this course, you will be introduced to unsupervised learning through techniques such as hierarchical and k-means clustering using the SciPy library.

Kurs Kostenlos Starten
4 Stunden14 Videos46 Übungen55.962 LernendeTrophyLeistungsnachweis

Kostenloses Konto erstellen

GoogleLinkedInFacebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.
Group

Trainierst du 2 oder mehr?

Versuchen DataCamp for Business

Beliebt bei Lernenden in Tausenden Unternehmen


Kursbeschreibung

You have probably come across Google News, which automatically groups similar news articles under a topic. Have you ever wondered what process runs in the background to arrive at these groups? In this course, you will be introduced to unsupervised learning through clustering using the SciPy library in Python. This course covers pre-processing of data and application of hierarchical and k-means clustering. Through the course, you will explore player statistics from a popular football video game, FIFA 18. After completing the course, you will be able to quickly apply various clustering algorithms on data, visualize the clusters formed and analyze results.
Für Unternehmen

GroupTrainierst du 2 oder mehr?

Erhalten Sie für Ihr Team Zugriff auf die vollständige DataCamp-Bibliothek mit zentralisierten Berichten, Zuweisungen, Projekten und mehr
Testen Sie DataCamp for BusinessFür eine maßgeschneiderte Lösung buchen Sie eine Demo.

In den folgenden Tracks

Machine Learning Scientist mit Python

Gehe zu Track
  1. 1

    Introduction to Clustering

    Kostenlos

    Before you are ready to classify news articles, you need to be introduced to the basics of clustering. This chapter familiarizes you with a class of machine learning algorithms called unsupervised learning and then introduces you to clustering, one of the popular unsupervised learning algorithms. You will know about two popular clustering techniques - hierarchical clustering and k-means clustering. The chapter concludes with basic pre-processing steps before you start clustering data.

    Kapitel Jetzt Abspielen
    Unsupervised learning: basics
    50 xp
    Unsupervised learning in real world
    50 xp
    Pokémon sightings
    100 xp
    Basics of cluster analysis
    50 xp
    Pokémon sightings: hierarchical clustering
    100 xp
    Pokémon sightings: k-means clustering
    100 xp
    Data preparation for cluster analysis
    50 xp
    Normalize basic list data
    100 xp
    Visualize normalized data
    100 xp
    Normalization of small numbers
    100 xp
    FIFA 18: Normalize data
    100 xp
  2. 2

    Hierarchical Clustering

    This chapter focuses on a popular clustering algorithm - hierarchical clustering - and its implementation in SciPy. In addition to the procedure to perform hierarchical clustering, it attempts to help you answer an important question - how many clusters are present in your data? The chapter concludes with a discussion on the limitations of hierarchical clustering and discusses considerations while using hierarchical clustering.

    Kapitel Jetzt Abspielen
  3. 3

    K-Means Clustering

    This chapter introduces a different clustering algorithm - k-means clustering - and its implementation in SciPy. K-means clustering overcomes the biggest drawback of hierarchical clustering that was discussed in the last chapter. As dendrograms are specific to hierarchical clustering, this chapter discusses one method to find the number of clusters before running k-means clustering. The chapter concludes with a discussion on the limitations of k-means clustering and discusses considerations while using this algorithm.

    Kapitel Jetzt Abspielen
  4. 4

    Clustering in Real World

    Now that you are familiar with two of the most popular clustering techniques, this chapter helps you apply this knowledge to real-world problems. The chapter first discusses the process of finding dominant colors in an image, before moving on to the problem discussed in the introduction - clustering of news articles. The chapter concludes with a discussion on clustering with multiple variables, which makes it difficult to visualize all the data.

    Kapitel Jetzt Abspielen
Für Unternehmen

GroupTrainierst du 2 oder mehr?

Erhalten Sie für Ihr Team Zugriff auf die vollständige DataCamp-Bibliothek mit zentralisierten Berichten, Zuweisungen, Projekten und mehr

In den folgenden Tracks

Machine Learning Scientist mit Python

Gehe zu Track

Datensätze

FIFA sampleFIFAMovies

Mitwirkende

Collaborator's avatar
Hillary Green-Lerman
Collaborator's avatar
Sara Billen

Voraussetzungen

Intermediate Python
Shaumik Daityari HeadshotShaumik Daityari

Business Analyst at American Express

Mehr Anzeigen

Was sagen andere Lernende?

Melden Sie sich an 15 Millionen Lernende und starten Sie Cluster Analysis in Python Heute!

Kostenloses Konto erstellen

GoogleLinkedInFacebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.