Skip to main content

String Manipulation with stringr in R

Learn how to pull character strings apart, put them back together and use the stringr package.

Start Course for Free
4 Hours17 Videos60 Exercises24,408 Learners
5150 XP

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies


Course Description

Character strings can turn up in all stages of a data science project. You might have to clean messy string input before analysis, extract data that is embedded in text or automatically turn numeric results into a sentence to include in a report. Perhaps the strings themselves are the data of interest, and you need to detect and match patterns within them. This course will help you master these tasks by teaching you how to pull strings apart, put them back together and use stringr to detect, extract, match and split strings using regular expressions, a powerful way to express patterns.

  1. 1

    String basics

    Free

    You'll start with some basics: how to enter strings in R, how to control how numbers are transformed to strings, and finally how to combine strings together to produce output that combines text and nicely formatted numbers.

    Play Chapter Now
    Welcome!
    50 xp
    Quotes
    100 xp
    What you see isn't always what you have
    100 xp
    Escape sequences
    100 xp
    Turning numbers into strings
    50 xp
    Using format() with numbers
    100 xp
    Controlling other aspects of the string
    100 xp
    formatC()
    100 xp
    Putting strings together
    50 xp
    Annotation of numbers
    100 xp
    A very simple table
    100 xp
    Let's order pizza!
    100 xp
  2. 2

    Introduction to stringr

    Time to meet stringr! You'll start by learning about some stringr functions that are very similar to some base R functions, then how to detect specific patterns in strings, how to split strings apart and how to find and replace parts of strings.

    Play Chapter Now
  3. 3

    Pattern matching with regular expressions

    In this chapter you'll learn about regular expressions, a language for describing patterns in strings. By combining regular expressions with the stringr functions you'll greatly increase your power to manipulate strings.

    Play Chapter Now
  4. 4

    More advanced matching and manipulation

    Now for two advanced ways to use regular expressions along with stringr: selecting parts of a match (a.k.a capturing) and referring back to parts of a match (a.k.a back-referencing). You'll also learn to deal with and strings or patterns that contain Unicode characters (e.g. é).

    Play Chapter Now
  5. 5

    Case studies

    Practice your string manipulation skills on a couple of case studies. You'll also learn a few new skills, reading strings into R and handling problems of case (e.g. A versus a).

    Play Chapter Now

In the following tracks

R ProgrammerText Mining

Collaborators

Tom JeonRichie Cotton

Prerequisites

Intermediate R
Charlotte Wickham Headshot

Charlotte Wickham

Assistant Professor at Oregon State University

Charlotte is an Assistant Professor in the Department of Statistics at Oregon State University and an avid R programmer with a passion for teaching. Her interests lie in spatiotemporal data, statistical graphics and computing, and environmental statistics.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA