Skip to main content

Data Manipulation with pandas

Use the world’s most popular Python data science package to manipulate data and calculate summary statistics.

Start Course for Free
4 Hours15 Videos56 Exercises194,550 Learners4850 XPData Analyst TrackData Manipulation TrackData Scientist TrackPython Programmer Track

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies


Course Description

pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. In this course, you'll learn how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis. Using pandas you’ll explore all the core data science concepts. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python!

  1. 1

    Transforming DataFrames

    Free

    Let’s master the pandas basics. Learn how to inspect DataFrames and perform fundamental manipulations, including sorting rows, subsetting, and adding new columns.

    Play Chapter Now
    Introducing DataFrames
    50 xp
    Inspecting a DataFrame
    100 xp
    Parts of a DataFrame
    100 xp
    Sorting and subsetting
    50 xp
    Sorting rows
    100 xp
    Subsetting columns
    100 xp
    Subsetting rows
    100 xp
    Subsetting rows by categorical variables
    100 xp
    New columns
    50 xp
    Adding new columns
    100 xp
    Combo-attack!
    100 xp

In the following tracks

Data Analyst Data Manipulation Data Scientist Python Programmer

Collaborators

alexandrayaroshAlex YaroshAAN94Adel Nehmeamy-4121b590-cc52-442a-9779-03eb58089e08Amy Petersonjustin-saddlemyerJustin Saddlemyer

Prerequisites

Intermediate Python
Richie Cotton Headshot

Richie Cotton

Curriculum Architect at DataCamp

Richie is a Learning Solutions Architect at DataCamp. He has been using R since 2004, in the fields of proteomics, debt collection, and chemical health and safety. He has released almost 30 R packages on CRAN and Bioconductor – most famously the assertive suite of packages – as well as creating and contributing to many others. He also has written two books on R programming, Learning R and Testing R Code.
See More
Maggie Matsui Headshot

Maggie Matsui

Curriculum Manager at DataCamp

Maggie is a Curriculum Manager at DataCamp. She holds a Bachelor's degree in Statistics and Computer Science from Brown University, where she spent lots of time teaching math, programming, and statistics as a tutor and teaching assistant. She's passionate about teaching all things data-related and making programming accessible to everyone.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA