Skip to main content

Data Manipulation with dplyr

Learn to transform and manipulate your data using dplyr.

Start Course for Free
4 Hours13 Videos46 Exercises73,643 Learners
3850 XP

Create Your Free Account



By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

Say you've found a great dataset and would like to learn more about it. How can you start to answer the questions you have about the data? You can use dplyr to answer those questions—it can also help with basic transformations of your data. You'll also learn to aggregate your data and add, remove, or change the variables. Along the way, you'll explore a dataset containing information about counties in the United States. You'll finish the course by applying these tools to the babynames dataset to explore trends of baby names in the United States.

  1. 1

    Transforming Data with dplyr


    Learn verbs you can use to transform your data, including select, filter, arrange, and mutate. You'll use these functions to modify the counties dataset to view particular observations and answer questions about the data.

    Play Chapter Now
    The counties dataset
    50 xp
    Understanding your data
    50 xp
    Selecting columns
    100 xp
    The filter and arrange verbs
    50 xp
    Arranging observations
    100 xp
    Filtering for conditions
    100 xp
    Filtering and arranging
    100 xp
    50 xp
    Calculating the number of government employees
    100 xp
    Calculating the percentage of women in a county
    100 xp
    Select, mutate, filter, and arrange
    100 xp
  2. 2

    Aggregating Data

    Now that you know how to transform your data, you'll want to know more about how to aggregate your data to make it more interpretable. You'll learn a number of functions you can use to take many observations in your data and summarize them, including count, group_by, summarize, ungroup, and top_n.

    Play Chapter Now
  3. 3

    Selecting and Transforming Data

    Learn advanced methods to select and transform columns. Also learn about select helpers, which are functions that specify criteria for columns you want to choose, as well as the rename and transmute verbs.

    Play Chapter Now
  4. 4

    Case Study: The babynames Dataset

    Work with a new dataset that represents the names of babies born in the United States each year. Learn how to use grouped mutates and window functions to ask and answer more complex questions about your data. And use a combination of dplyr and ggplot2 to make interesting graphs to further explore your data.

    Play Chapter Now

In the following tracks

Data Analyst Data Manipulation Data ScientistR Programmer


Amy Peterson
DataCamp Content Creator Headshot

DataCamp Content Creator

Course Instructor

DataCamp offers interactive R, Python, Spreadsheets, SQL and shell courses. All on topics in data science, statistics, and machine learning. Learn from a team of expert teachers in the comfort of your browser with video lessons and fun coding challenges and projects.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA