Interactive Course

Joining Data with dplyr in R

Learn to combine data across multiple tables to answer more complex questions with dplyr.

  • 4 hours
  • 13 Videos
  • 49 Exercises
  • 3,500 Participants
  • 4,200 XP

Loved by learners at thousands of top companies:

deloitte-grey.svg
intel-grey.svg
forrester-grey.svg
axa-grey.svg
credit-suisse-grey.svg
t-mobile-grey.svg

Course Description

Often in data science, you'll encounter fascinating data that is spread across multiple tables. This course will teach you the skills you'll need to join multiple tables together to analyze them in combination. You'll practice your skills using a fun dataset about LEGOs from the Rebrickable website. The dataset contains information about the sets, parts, themes, and colors of LEGOs, but is spread across many tables. You'll work with the data throughout the course as you learn a total of six different joins! You'll learn four mutating joins: inner join, left join, right join, and full join, and two filtering joins: semi join and anti join. In the final chapter, you'll apply your new skills to Stack Overflow data, containing each of the almost 300,000 Stack Oveflow questions that are tagged with R, including information about their answers, the date they were asked, and their score. Get ready to take your dplyr skills to the next level!

  1. Left and Right Joins

    Learn two more mutating joins, the left and right join, which are mirror images of each other! You'll learn use cases for each type of join as you explore parts and colors of LEGO themes. Then, you'll explore how to join tables to themselves to understand the hierarchy of LEGO themes in the data.

  2. Case Study: Joins on Stack Overflow Data

    Put together all the types of join you learned in this course to analyze a new dataset: Stack Overflow questions, answers, and tags. This includes calculating and visualizing trends for some notable tags like dplyr and ggplot2. You'll also master one more method for combining tables, the bind_rows verb, which stacks tables on top of each other.

  1. 1

    Joining Tables

    Free

    Get started with your first joining verb: inner-join! You'll learn to join tables together to answer questions about the LEGO dataset, which contains information across many tables about the sets, parts, themes, and colors of LEGOs over time.

  2. Left and Right Joins

    Learn two more mutating joins, the left and right join, which are mirror images of each other! You'll learn use cases for each type of join as you explore parts and colors of LEGO themes. Then, you'll explore how to join tables to themselves to understand the hierarchy of LEGO themes in the data.

  3. Full, Semi, and Anti Joins

    In this chapter, you'll cover three more joining verbs: full-join, semi-join, and anti-join. You'll then use these verbs to answer questions about the similarities and differences between a variety of LEGO sets.

  4. Case Study: Joins on Stack Overflow Data

    Put together all the types of join you learned in this course to analyze a new dataset: Stack Overflow questions, answers, and tags. This includes calculating and visualizing trends for some notable tags like dplyr and ggplot2. You'll also master one more method for combining tables, the bind_rows verb, which stacks tables on top of each other.

What do other learners have to say?

Devon

“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group

Louis

“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School

Ronbowers

“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Chris Cardillo
Chris Cardillo

Data Scientist at DataCamp

Chris is a Generalist, and actually learned programming for data science on DataCamp prior to joining the company. He is extremely passionate about helping others find the joy of coding to alleviate repetitive and time-consuming tasks. Previously, Chris was the Associate Director of Strategy at M&C Saatchi Mobile, a Data Scientist at DataCamp, and graduated with a B.S./M.B.A. from Drexel University in Philadelphia.

See More
Icon Icon Icon professional info