Skip to main content

Joining Data with pandas

Learn to combine data from multiple tables by joining data together using pandas.

Start Course for Free
4 Hours15 Videos52 Exercises48,047 Learners
4150 XP

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies


Course Description

Being able to combine and work with multiple datasets is an essential skill for any aspiring Data Scientist. Pandas is a crucial cornerstone of the Python data science ecosystem, with Stack Overflow recording 5 million views for pandas questions. Learn to handle multiple DataFrames by combining, organizing, joining, and reshaping them using pandas. You'll work with datasets from the World Bank and the City Of Chicago. You will finish the course with a solid skillset for data-joining in pandas.

  1. 1

    Data Merging Basics

    Free

    Learn how you can merge disparate data using inner joins. By combining information from multiple sources you’ll uncover compelling insights that may have previously been hidden. You’ll also learn how the relationship between those sources, such as one-to-one or one-to-many, can affect your result.

    Play Chapter Now
    Inner join
    50 xp
    What column to merge on?
    50 xp
    Your first inner join
    100 xp
    Inner joins and number of rows returned
    100 xp
    One-to-many relationships
    50 xp
    One-to-many classification
    100 xp
    One-to-many merge
    100 xp
    Merging multiple DataFrames
    50 xp
    Total riders in a month
    100 xp
    Three table merge
    100 xp
    One-to-many merge with multiple tables
    100 xp
  2. 2

    Merging Tables With Different Join Types

    Take your knowledge of joins to the next level. In this chapter, you’ll work with TMDb movie data as you learn about left, right, and outer joins. You’ll also discover how to merge a table to itself and merge on a DataFrame index.

    Play Chapter Now
  3. 3

    Advanced Merging and Concatenating

    In this chapter, you’ll leverage powerful filtering techniques, including semi-joins and anti-joins. You’ll also learn how to glue DataFrames by vertically combining and using the pandas.concat function to create new datasets. Finally, because data is rarely clean, you’ll also learn how to validate your newly combined data structures.

    Play Chapter Now

In the following tracks

Data Analyst Data Manipulation Data Scientist

Collaborators

Maggie MatsuiAmy Peterson
Aaren Stubberfield Headshot

Aaren Stubberfield

Manager, Supply Chain Analytics @ Ingredion Incorporated

Manager of Supply Chain Analytics, with over 7 years of experience analyzing data to find insight for business related questions. I am responsible Supply Chain related Analytics for the NA business for $5.8 billion ingredient solutions provider to the food, beverage, brewing and pharmaceutical sectors. I graduated from DePaul University with distinction and received a MS in Predictive Analytics. I am passionate about Data Science / Machine Learning and I continue to work on my craft by learning new concepts through online classes.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA