Skip to main content

Introduction to Text Analysis in R

Analyze text data in R using the tidy framework.

Start Course for Free
4 Hours15 Videos46 Exercises16,224 Learners3850 XPMarketing Analytics TrackText Mining Track

Create Your Free Account



By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

From social media to product reviews, text is an increasingly important type of data across applications, including marketing analytics. In many instances, text is replacing other forms of unstructured data due to how inexpensive and current it is. However, to take advantage of everything that text has to offer, you need to know how to think about, clean, summarize, and model text. In this course, you will use the latest tidy tools to quickly and easily get started with text. You will learn how to wrangle and visualize text, perform sentiment analysis, and run and interpret topic models.

  1. 1

    Wrangling Text


    Since text is unstructured data, a certain amount of wrangling is required to get it into a form where you can analyze it. In this chapter, you will learn how to add structure to text by tokenizing, cleaning, and treating text as categorical data.

    Play Chapter Now
    Text as data
    50 xp
    Airline tweets data
    100 xp
    Grouped summaries
    100 xp
    Counting categorical data
    50 xp
    Counting user types
    100 xp
    Summarizing user types
    100 xp
    Tokenizing and cleaning
    50 xp
    Tokenizing and counting
    100 xp
    Cleaning and counting
    100 xp
  2. 3

    Sentiment Analysis

    While word counts and visualizations suggest something about the content, we can do more. In this chapter, we move beyond word counts alone to analyze the sentiment or emotional valence of text.

    Play Chapter Now

In the following tracks

Marketing AnalyticsText Mining


sumedhpanchadharSumedh PanchadharchesterChester Ismay
Marc Dotson Headshot

Marc Dotson

Assistant Professor of Marketing, BYU Marriott School of Business

Marc's research is focused on applications of Bayesian inference in marketing, including choice modeling and text analysis. He teaches courses in survey research, conjoint analysis, marketing analytics, and statistical modeling.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA