Data Manipulation in R with dplyr

Master techniques for data manipulation using the select, mutate, filter, arrange, and summarise functions in dplyr.

Start Course for Free
4 Hours11 Videos42 Exercises104,544 Learners
3400 XP

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies


Course Description

In this interactive tutorial, you will learn how to perform sophisticated dplyr techniques to carry out your data manipulation with R. First you will master the five verbs of R data manipulation with dplyr: select, mutate, filter, arrange and summarise. Next, you will learn how you can chain your dplyr operations using the pipe operator of the magrittr package. In the final section, the focus is on practicing how to subset your data using the group_by function, and how you can access data stored outside of R in a database. All said and done, you will be familiar with data manipulation tools and techniques that will allow you to efficiently manipulate data.

  1. 1

    Introduction to dplyr and tbls

    Free

    Introduction to the dplyr package and the tbl class. Learn the philosophy that guides dplyr, discover some useful applications of the dplyr package, and meet the data structures that dplyr uses behind the scenes.

    Play Chapter Now
    Section 1 - Introduction to dplyr
    50 xp
    Load the dplyr and hflights package
    100 xp
    Explore the data set
    50 xp
    Section 2 - tbl, a special type of data.frame
    50 xp
    Convert data.frame to tibble
    100 xp
    Changing labels of hflights, part 1 of 2
    100 xp
    Changing labels of hflights, part 2 of 2
    100 xp
  2. 4

    Summarize and the pipe operator

    Master the data manipulation verb summarize, and practice combining the five verbs to solve advanced data manipulation tasks. Learn to chain the operators together with the piping operator.

    Play Chapter Now
  3. 5

    Group_by and working with databases

    Complete your mastery of data manipulation with group-wise operations and databases. Learn to use group_by to group your data into subsets of observations, and use dplyr to access data stored outside of R in a database.

    Play Chapter Now

Collaborators

Filip Schouwenaars

Prerequisites

Introduction to RIntermediate R
Team RStudio Headshot

Team RStudio

See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA