Skip to main content
Jordi Fajula avatar

Jordi Fajula has completed

Exploratory Data Analysis in Python

Start course For Free
4 hr
4,150 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

So you’ve got some interesting data - where do you begin your analysis? This course will cover the process of exploring and analyzing data, from understanding what’s included in a dataset to incorporating exploration findings into a data science workflow.

Using data on unemployment figures and plane ticket prices, you’ll leverage Python to summarize and validate data, calculate, identify and replace missing values, and clean both numerical and categorical values. Throughout the course, you’ll create beautiful Seaborn visualizations to understand variables and their relationships.

For example, you’ll examine how alcohol use and student performance are related. Finally, the course will show how exploratory findings feed into data science workflows by creating new features, balancing categorical features, and generating hypotheses from findings.

By the end of this course, you’ll have the confidence to perform your own exploratory data analysis (EDA) in Python.You’ll be able to explain your findings visually to others and suggest the next steps for gathering insights from your data!
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Getting to Know a Dataset

    Free

    What's the best way to approach a new dataset? Learn to validate and summarize categorical and numerical data and create Seaborn visualizations to communicate your findings.

    Play Chapter Now
    Initial exploration
    50 xp
    Functions for initial exploration
    100 xp
    Counting categorical values
    100 xp
    Global unemployment in 2021
    100 xp
    Data validation
    50 xp
    Detecting data types
    100 xp
    Validating continents
    100 xp
    Validating range
    100 xp
    Data summarization
    50 xp
    Summaries with .groupby() and .agg()
    100 xp
    Named aggregations
    100 xp
    Visualizing categorical summaries
    100 xp
  2. 3

    Relationships in Data

    Variables in datasets don't exist in a vacuum; they have relationships with each other. In this chapter, you'll look at relationships across numerical, categorical, and even DateTime data, exploring the direction and strength of these relationships as well as ways to visualize them.

    Play Chapter Now
  3. 4

    Turning Exploratory Analysis into Action

    Exploratory data analysis is a crucial step in the data science workflow, but it isn't the end! Now it's time to learn techniques and considerations you can use to successfully move forward with your projects after you've finished exploring!

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

unemployment.csvdata_science_salaries.csvbooks.csvdivorce.csvplanes.csv

collaborators

Collaborator's avatar
Amy Peterson
Collaborator's avatar
Maham Khan
George Boorman HeadshotGeorge Boorman

Senior Curriculum Manager, DataCamp

George is a Curriculum Manager at DataCamp. He holds a PGDip in Exercise for Health and BSc (Hons) in Sports Science and has experience in project management across public health, applied research, and not-for-profit sectors. George is passionate about sports, tech for good, and all things data science.
See More
Izzy Weber HeadshotIzzy Weber

Data Coach at iO-Sphere

Izzy is a Head Coach at iO-Sphere. She discovered a love for data during her seven years as an accounting professor at the University of Washington. She holds a masters degree in Taxation and is a Certified Public Accountant. Her passion is making learning technical topics fun.
See More

Join over 18 million learners and start Exploratory Data Analysis in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.