Napoleon-Christos Oikonomou has completed

Manipulating DataFrames with pandas

4 hours

6,300 XP

Loved by learners at thousands of companies

Course Description

In this course, you'll learn how to leverage pandas' extremely powerful data manipulation engine to get the most out of your data. You’ll learn how to drill into the data that really matters by extracting, filtering, and transforming data from DataFrames. The pandas library has many techniques that make this process efficient and intuitive. You will learn how to tidy, rearrange, and restructure your data by pivoting or melting and stacking or unstacking DataFrames. These are all fundamental next steps on the road to becoming a well-rounded data scientist, and you will have the chance to apply all the concepts you learn to real-world datasets.

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

1
Extracting and transforming data
Free
In this chapter, you will learn how to index, slice, filter, and transform DataFrames using a variety of datasets, ranging from 2012 US election data for the state of Pennsylvania to Pittsburgh weather data.
Play Chapter Now
Indexing DataFrames
50 xp
Index ordering
50 xp
Positional and labeled indexing
100 xp
Indexing and column rearrangement
100 xp
Slicing DataFrames
50 xp
Slicing rows
100 xp
Slicing columns
100 xp
Subselecting DataFrames with lists
100 xp
Filtering DataFrames
50 xp
Thresholding data
100 xp
Filtering columns using other columns
100 xp
Filtering using NaNs
100 xp
Transforming DataFrames
50 xp
Using apply() to transform a column
100 xp
Using .map() with a dictionary
100 xp
Using vectorized functions
100 xp
2
Advanced indexing
Having learned the fundamentals of working with DataFrames, you will now move on to more advanced indexing techniques. You will learn about MultiIndexes, or hierarchical indexes, and learn how to interact with and extract data from them.
Play Chapter Now
Index objects and labeled data
50 xp
Index values and names
50 xp
Changing index of a DataFrame
100 xp
Changing index name labels
100 xp
Building an index, then a DataFrame
100 xp
Hierarchical Indexing
50 xp
Extracting data with a MultiIndex
100 xp
Setting & sorting a MultiIndex
100 xp
Using .loc[] with nonunique indexes
100 xp
Indexing multiple levels of a MultiIndex
100 xp
3
Rearranging and reshaping data
Here, you will learn how to reshape your DataFrames using techniques such as pivoting, melting, stacking, and unstacking. These are powerful techniques that allow you to tidy and rearrange your data into the optimal format for data analysis.
Play Chapter Now
Pivoting DataFrames
50 xp
Pivoting and the index
50 xp
Pivoting a single variable
100 xp
Pivoting all variables
100 xp
Stacking & unstacking DataFrames
50 xp
Stacking & unstacking I
100 xp
Stacking & unstacking II
100 xp
Restoring the index order
100 xp
Melting DataFrames
50 xp
Adding names for readability
100 xp
Going from wide to long
100 xp
Obtaining key-value pairs with melt()
100 xp
Pivot tables
50 xp
Setting up a pivot table
100 xp
Using other aggregations in pivot tables
100 xp
Using margins in pivot tables
100 xp
4
Grouping data
In this chapter, you'll learn how to identify and split DataFrames by groups or categories for further aggregation or analysis. You'll also learn how to transform and filter your data, and how to detect outliers and impute missing values. Knowing how to effectively group data in pandas can be a seriously powerful addition to your data science toolbox.
Play Chapter Now
Categoricals and groupby
50 xp
Advantages of categorical data types
50 xp
Grouping by multiple columns
100 xp
Grouping by another series
100 xp
Groupby and aggregation
50 xp
Computing multiple aggregates of multiple columns
100 xp
Aggregating on index levels/fields
100 xp
Grouping on a function of the index
100 xp
Groupby and transformation
50 xp
Detecting outliers with Z-Scores
100 xp
Filling missing data (imputation) by group
100 xp
Other transformations with .apply
100 xp
Groupby and filtering
50 xp
Grouping and filtering with .apply()
100 xp
Grouping and filtering with .filter()
100 xp
Filtering and grouping with .map()
100 xp
5
Bringing it all together
We’ll bring together everything you have learned in this course while working with data recorded from the Summer Olympic games that goes as far back as 1896! This is a rich dataset that will allow you to fully apply the data manipulation techniques you have learned. You will pivot, unstack, group, slice, and reshape your data as you explore this dataset and uncover some truly fascinating insights.
Play Chapter Now
Case study: Olympic medals
50 xp
Grouping and aggregating
50 xp
Using .value_counts() for ranking
100 xp
Using .pivot_table() to count medals by type
100 xp
Understanding the column labels
50 xp
Applying .drop_duplicates()
100 xp
Finding possible errors with .groupby()
100 xp
Locating suspicious data
100 xp
Constructing alternative country rankings
50 xp
Using .nunique() to rank by distinct sports
100 xp
Counting USA vs. USSR Cold War Olympic Sports
100 xp
Counting USA vs. USSR Cold War Olympic Medals
100 xp
Reshaping DataFrames for visualization
50 xp
Visualizing USA Medal Counts by Edition: Line Plot
100 xp
Visualizing USA Medal Counts by Edition: Area Plot
100 xp
Visualizing USA Medal Counts by Edition: Area Plot with Ordered Medals
100 xp
Congratulations!
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more

Datasets

Olympic medals Gapminder 2012 US election results (Pennsylvania)Pittsburgh weather data Sales Titanic Users

Prerequisites

Introduction to Python Intermediate Python pandas Foundations

Team Anaconda

Data Science Training

Join over 13 million learners and start Manipulating DataFrames with pandas today!

Create Your Free Account

Google LinkedIn Facebook

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Manipulating DataFrames with pandas

Loved by learners at thousands of companies

Course Description

.css-1goj2uy{margin-right:8px;}Group.css-gnv7tt{font-size:20px;font-weight:700;white-space:nowrap;}.css-12nwtlk{box-sizing:border-box;margin:0;min-width:0;color:#05192D;font-size:16px;line-height:1.5;font-size:20px;font-weight:700;white-space:nowrap;}Training 2 or more people?

Extracting and transforming data

Advanced indexing

Rearranging and reshaping data

Grouping data

Bringing it all together

GroupTraining 2 or more people?

Join over .css-ou6dz6{color:#03ef62;}13 million learners and start Manipulating DataFrames with pandas today!

Create Your Free Account

Training 2 or more people?

Training 2 or more people?

Join over 13 million learners and start Manipulating DataFrames with pandas today!