Skip to main content
HomePython

Joining Data with pandas

4.3+
150 reviews
Intermediate

Learn to combine data from multiple tables by joining data together using pandas.

Start Course for Free
4 hours15 videos51 exercises165,321 learnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies


Course Description

Being able to combine and work with multiple datasets is an essential skill for any aspiring Data Scientist. pandas is a crucial cornerstone of the Python data science ecosystem, with Stack Overflow recording 5 million views for pandas questions. Learn to handle multiple DataFrames by combining, organizing, joining, and reshaping them using pandas. You'll work with datasets from the World Bank and the City Of Chicago. You will finish the course with a solid skillset for data-joining in pandas.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.

In the following Tracks

Certification Available

Data Analyst in Python

Go To Track
Certification Available

Associate Data Scientist in Python

Go To Track

Data Manipulation in Python

Go To Track
  1. 1

    Data Merging Basics

    Free

    Learn how you can merge disparate data using inner joins. By combining information from multiple sources you’ll uncover compelling insights that may have previously been hidden. You’ll also learn how the relationship between those sources, such as one-to-one or one-to-many, can affect your result.

    Play Chapter Now
    Inner join
    50 xp
    What column to merge on?
    50 xp
    Your first inner join
    100 xp
    Inner joins and number of rows returned
    100 xp
    One-to-many relationships
    50 xp
    One-to-many classification
    100 xp
    One-to-many merge
    100 xp
    Merging multiple DataFrames
    50 xp
    Total riders in a month
    100 xp
    Three table merge
    100 xp
    One-to-many merge with multiple tables
    100 xp
  2. 2

    Merging Tables With Different Join Types

    Take your knowledge of joins to the next level. In this chapter, you’ll work with TMDb movie data as you learn about left, right, and outer joins. You’ll also discover how to merge a table to itself and merge on a DataFrame index.

    Play Chapter Now
  3. 3

    Advanced Merging and Concatenating

    In this chapter, you’ll leverage powerful filtering techniques, including semi-joins and anti-joins. You’ll also learn how to glue DataFrames by vertically combining and using the pandas.concat function to create new datasets. Finally, because data is rarely clean, you’ll also learn how to validate your newly combined data structures.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

In the following Tracks

Certification Available

Data Analyst in Python

Go To Track
Certification Available

Associate Data Scientist in Python

Go To Track

Data Manipulation in Python

Go To Track

In other tracks

Python Data Fundamentals

datasets

Chicago WardsChicago Business LicensesChicago CensusChicago Demographics by Zip CodeChicago Business OwnersChicago Land UseChicago Taxi VehiclesChicago Taxi OwnersCTA RidershipCTA CalendarCTA StationsMoviesMovie ActorsMovie RatingsMovie CastsMovie CrewsMovie GenresMovie SequelsMovie Financial DataMovie Tag LinesS&P 500World Bank GDPWorld Bank Population

collaborators

Collaborator's avatar
Amy Peterson
Collaborator's avatar
Maggie Matsui
Aaren Stubberfield HeadshotAaren Stubberfield

Senior Data Scientist @ Microsoft

I am a Senior Data Scientist with expertise in Machine Learning, AI, and data governance. Currently, I work for Microsoft's Digital Advertising, which has revenues of more than $10 billion in the fiscal year 2023. However, my experience is not limited to just the advertising industry. I have worked in the Supply Chain and Data Governance industries. With my vast experience, I have led numerous teams of data scientists and have been instrumental in the successful completion of many projects. My technical skills include the use of AI, like LLMs, Python, and other various tools necessary for the execution of data science projects. My passion lies in using data to gain insights and making data-driven decisions. I constantly strive to improve my skills and knowledge and am always open to learning new techniques and tools.
See More

Don’t just take our word for it

*4.3
from 150 reviews
59%
22%
14%
3%
2%
  • Thomas K.
    about 18 hours

    Pandas in Python is likely one of the first packages you encounter in the Python universe, and one of the most powerful. Similar in functionality with dplyr in R, it can do pretty much any data wrangling task you throw at it. By taking this course at DataCamp, you build up your data manipulation skills step by step, gaining data analysis confidence along the way.

  • Johan K.
    8 days

    DataCamp's Python for Data Analysis courses are great to learn both Python and Data Analysis. The code is illustrated with the theory and the other way around. Read the excersize carefully to minimize errors in your code!

  • Carlos E.
    10 days

    .

  • P. T.
    22 days

    Great learning..Easy to understand

  • Luiz F.
    about 1 month

    Extremely usefull

"DataCamp's Python for Data Analysis courses are great to learn both Python and Data Analysis. The code is illustrated with the theory and the other way around. Read the excersize carefully to minimize errors in your code!"

Johan K.

"."

Carlos E.

"Great learning..Easy to understand"

P. T.

FAQs

Join over 15 million learners and start Joining Data with pandas today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.