Skip to main content
Alex Beernink avatar

Alex Beernink has completed

Cleaning Data in R

Start course For Free
4 hr
4,700 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

It's commonly said that data scientists spend 80% of their time cleaning and manipulating data and only 20% of their time actually analyzing it. For this reason, it is critical to become familiar with the data cleaning process and all of the tools available to you along the way. This course provides a very basic introduction to cleaning data in R using the tidyr, dplyr, and stringr packages. After taking the course you'll be able to go from raw data to awesome insights as quickly and painlessly as possible!
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Introduction and exploring raw data

    Free

    This chapter will give you an overview of the process of data cleaning with R, then walk you through the basics of exploring raw data.

    Play Chapter Now
    Introduction to Cleaning Data in R
    50 xp
    The data cleaning process
    50 xp
    Here's what messy data look like
    100 xp
    Here's what clean data look like
    100 xp
    Exploring raw data
    50 xp
    Getting a feel for your data
    100 xp
    Viewing the structure of your data
    100 xp
    Exploring raw data (part 2)
    50 xp
    Looking at your data
    100 xp
    Visualizing your data
    100 xp
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Messy weather dataBMI dataCensus dataStudent data (with dates)

prerequisites

Introduction to R
Nick Carchedi HeadshotNick Carchedi

Product Manager at DataCamp

See More

Join over 18 million learners and start Cleaning Data in R today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.