Home PythonExploratory Data Analysis in Python

Exploratory Data Analysis in Python

Name: Exploratory Data Analysis in Python
Rating: 4.7586207 (29 reviews)

4.7+

29 reviews

Intermediate

Learn how to explore, visualize, and extract insights from data using exploratory data analysis (EDA) in Python.

Start Course for Free

4 Hours14 Videos49 Exercises

34,812 LearnersStatement of Accomplishment

Create Your Free Account

Google LinkedIn Facebook

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Training 2 or more people?Try DataCamp For Business

Loved by learners at thousands of companies

Course Description

So you’ve got some interesting data - where do you begin your analysis? This course will cover the process of exploring and analyzing data, from understanding what’s included in a dataset to incorporating exploration findings into a data science workflow.

Using data on unemployment figures and plane ticket prices, you’ll leverage Python to summarize and validate data, calculate, identify and replace missing values, and clean both numerical and categorical values. Throughout the course, you’ll create beautiful Seaborn visualizations to understand variables and their relationships.

For example, you’ll examine how alcohol use and student performance are related. Finally, the course will show how exploratory findings feed into data science workflows by creating new features, balancing categorical features, and generating hypotheses from findings.

By the end of this course, you’ll have the confidence to perform your own exploratory data analysis (EDA) in Python.You’ll be able to explain your findings visually to others and suggest the next steps for gathering insights from your data!

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

In the following Tracks

Certification Available

Data Analyst with Python

Go To Track

Certification Available

Associate Data Scientist in Python

Go To Track

1
Getting to Know a Dataset
Free
What's the best way to approach a new dataset? Learn to validate and summarize categorical and numerical data and create Seaborn visualizations to communicate your findings.
Play Chapter Now
Initial exploration
50 xp
Functions for initial exploration
100 xp
Counting categorical values
100 xp
Global unemployment in 2021
100 xp
Data validation
50 xp
Detecting data types
100 xp
Validating continents
100 xp
Validating range
100 xp
Data summarization
50 xp
Summaries with .groupby() and .agg()
100 xp
Named aggregations
100 xp
Visualizing categorical summaries
100 xp
2
Data Cleaning and Imputation
Exploring and analyzing data often means dealing with missing values, incorrect data types, and outliers. In this chapter, you’ll learn techniques to handle these issues and streamline your EDA processes!
Play Chapter Now
Addressing missing data
50 xp
Dealing with missing data
100 xp
Strategies for remaining missing data
100 xp
Imputing missing plane prices
100 xp
Converting and analyzing categorical data
50 xp
Finding the number of unique values
100 xp
Flight duration categories
100 xp
Adding duration categories
100 xp
Working with numeric data
50 xp
Flight duration
100 xp
Adding descriptive statistics
100 xp
Handling outliers
50 xp
What to do with outliers
100 xp
Identifying outliers
100 xp
Removing outliers
100 xp
3
Relationships in Data
Variables in datasets don't exist in a vacuum; they have relationships with each other. In this chapter, you'll look at relationships across numerical, categorical, and even DateTime data, exploring the direction and strength of these relationships as well as ways to visualize them.
Play Chapter Now
Patterns over time
50 xp
Importing DateTime data
100 xp
Updating data type to DateTime
100 xp
Visualizing relationships over time
100 xp
Correlation
50 xp
Interpreting a heatmap
50 xp
Visualizing variable relationships
100 xp
Visualizing multiple variable relationships
100 xp
Factor relationships and distributions
50 xp
Categorial data in scatter plots
100 xp
Exploring with KDE plots
100 xp
4
Turning Exploratory Analysis into Action
Exploratory data analysis is a crucial step in the data science workflow, but it isn't the end! Now it's time to learn techniques and considerations you can use to successfully move forward with your projects after you've finished exploring!
Play Chapter Now
Considerations for categorical data
50 xp
Checking for class imbalance
100 xp
Cross-tabulation
100 xp
Generating new features
50 xp
Extracting features for correlation
100 xp
Calculating salary percentiles
100 xp
Categorizing salaries
100 xp
Generating hypotheses
50 xp
Comparing salaries
100 xp
Choosing a hypothesis
100 xp
Congratulations
50 xp

Datasets

unemployment.csv data_science_salaries.csv books.csv divorce.csv planes.csv

Collaborators

Amy Peterson

Maham Khan

Prerequisites

Introduction to Statistics in Python Introduction to Data Visualization with Seaborn

George Boorman

Curriculum Manager, DataCamp

George is a Curriculum Manager at DataCamp. He holds a PGDip in Exercise for Health and BSc (Hons) in Sports Science and has experience in project management across public health, applied research, and not-for-profit sectors. George is passionate about sports, tech for good, and all things data science.

Izzy Weber

Data Coach at iO-Sphere

Izzy is a Data Coach at iO-Sphere. She discovered a love for data during her seven years as an accounting professor at the University of Washington. She holds a masters degree in Taxation and is a Certified Public Accountant. Her passion is making learning technical topics fun.

Don’t just take our word for it

*4.7

from 29 reviews

79%

17%

Sort by

Highest to Lowest
Lowest to Highest
Most recent
Top reviews

Baba I.

about 2 months

The course was very informative, comprehensive and easy to understand. The course instructors did a great job with the delivery.

olumide o.

about 2 months

Very detailed step by step explanation

Diego B.

6 months

Great!

Josue U.

8 months

It was very useful.

abdul w.

10 months

Best

"The course was very informative, comprehensive and easy to understand. The course instructors did a great job with the delivery."

Baba I.

"Very detailed step by step explanation"

olumide o.

"Great!"

Diego B.

FAQs

Join over 13 million learners and start Exploratory Data Analysis in Python today!

Create Your Free Account

Google LinkedIn Facebook

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Exploratory Data Analysis in Python

Create Your Free Account

Loved by learners at thousands of companies