Skip to main content
Jeong Mo Yang avatar

Jeong Mo Yang has completed

Regular Expressions in Python

Start course For Free
4 hr
4,650 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

As a data scientist, you will encounter many situations where you will need to extract key information from huge corpora of text, clean messy data containing strings, or detect and match patterns to find useful words. All of these situations are part of text mining and are an important step before applying machine learning algorithms. This course will take you through understanding compelling concepts about string manipulation and regular expressions. You will learn how to split strings, join them back together, interpolate them, as well as detect, extract, replace, and match strings using regular expressions. On the journey to master these skills, you will work with datasets containing movie reviews or streamed tweets that can be used to determine opinion, as well as with raw text scraped from the web.
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Basic Concepts of String Manipulation

    Free

    Start your journey into the regular expression world! From slicing and concatenating, adjusting the case, removing spaces, to finding and replacing strings. You will learn how to master basic operation for string manipulation using a movie review dataset.

    Play Chapter Now
    Introduction to string manipulation
    50 xp
    First day!
    100 xp
    Artificial reviews
    100 xp
    Palindromes
    100 xp
    String operations
    50 xp
    Normalizing reviews
    100 xp
    Time to join!
    100 xp
    Split lines or split the line?
    100 xp
    Finding and replacing
    50 xp
    Finding a substring
    100 xp
    Where's the word?
    100 xp
    Replacing negations
    100 xp
  2. 2

    Formatting Strings

    Following your journey, you will learn the main approaches that can be used to format or interpolate strings in python using a dataset containing information scraped from the web. You will explore the advantages and disadvantages of using positional formatting, embedding expressing inside string constants, and using the Template class.

    Play Chapter Now
  3. 3

    Regular Expressions for Pattern Matching

    Time to discover the fundamental concepts of regular expressions! In this key chapter, you will learn to understand the basic concepts of regular expression syntax. Using a real dataset with tweets meant for sentiment analysis, you will learn how to apply pattern matching using normal and special characters, and greedy and lazy quantifiers.

    Play Chapter Now
  4. 4

    Advanced Regular Expression Concepts

    In the last step of your journey, you will learn more complex methods of pattern matching using parentheses to group strings together or to match the same text as matched previously. Also, you will get an idea of how you can look around expressions.

    Play Chapter Now
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

datasets

Movie ReviewsWikipedia Web PageSentiment140

collaborators

Collaborator's avatar
Sara Billen
Collaborator's avatar
Hillary Green-Lerman

prerequisites

Intermediate Python
Maria Eugenia Inzaugarat HeadshotMaria Eugenia Inzaugarat

Data Scientist and Artificial Intelligence Consultant

Eugenia is a passionate, dedicated, and proactive data scientist and Artificial Intelligence Consultant that enjoys not only doing machine learning projects but also telling stories with data. She obtained a Ph.D. from the University of Buenos Aires. She has taught university courses in mathematics and biology as well as online courses on Data Science. Having transitioned from an academic background into data science, Eugenia loves teaching concepts related to python programming, data science, and machine learning to help others also gain knowledge about these fields.
See More

Join over 18 million learners and start Regular Expressions in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.