Pular para o conteúdo principal
Página inicialRIntermediate Regular Expressions in R

# Intermediate Regular Expressions in R

Manipulate text data, analyze it and more by mastering regular expressions and string distances in R.

Comece O Curso Gratuitamente
4 Horas14 Videos48 Exercicios
3.988 AprendizesDeclaração de Realização

## Crie sua conta gratuita

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.
Treinar 2 ou mais pessoas?Experimente o DataCamp For Business

## Descrição do Curso

Analyzing data that comes in tables is fun. But what if the things that we find most interesting are not available as a neatly organized dataset but in plain text? Do not despair: In this course, you'll learn everything you need to know to create powerful regular expressions that will help you find all the information you need for your analyses from just a blob of text. But not only that. Using the concept of string distances, you will learn to work even with text that contains typos or scanning errors, as you will be able to match them to their correct counterparts from other data sources (record linkage). As a learning material, we will analyze real documents about box office figures in Swiss cinemas.
Para Empresas

### .css-1goj2uy{margin-right:8px;}Group.css-gnv7tt{font-size:20px;font-weight:700;white-space:nowrap;}.css-12nwtlk{box-sizing:border-box;margin:0;min-width:0;color:#05192D;font-size:16px;line-height:1.5;font-size:20px;font-weight:700;white-space:nowrap;}Treinar 2 ou mais pessoas?

Obtenha acesso à biblioteca completa do DataCamp, com relatórios, atribuições, projetos e muito mais centralizados
Experimente O DataCamp for BusinessPara uma solução sob medida , agende uma demonstração.
1. 1

### Regular Expressions: Writing Custom Patterns

Grátis

Regular expressions can be pretty intimidating at first as they contain vast amounts of special characters. In this chapter, you'll learn to decipher these and write your own patterns to find exactly what you're looking for.

Reproduzir Capítulo Agora
Welcome
50 xp
Starts with, ends with
100 xp
If you don't know what you're looking for
100 xp
Character classes and repetitions
50 xp
Digits, words and spaces
100 xp
Match repetitions
100 xp
Which special character did what again?
100 xp
The pipe and the question mark
50 xp
This or that
100 xp
The question mark and its two meanings
100 xp
50 xp
2. 2

### Creating Strings with Data

In this chapter, we will slightly move away from regular expressions and focus on string manipulation by creating strings from other data structures like vectors or lists.

3. 3

### Extracting Structured Data From Text

One task where regular expressions really shine is making sense from a blob of text. In this chapter, you'll learn to extract the information from messy data that doesn't come in neatly arranged tables but in plain text.

4. 4

### Similarities Between Strings

In the last chapter, we will shift gears away from regular expressions to understanding string distances. By calculating the differences of multiple strings, we can match those that are similar. This will help us to find duplicates even when they contain small errors like typos. This is an important part to record linkage where we combine datasets from multiple sources.

Para Empresas

### GroupTreinar 2 ou mais pessoas?

Obtenha acesso à biblioteca completa do DataCamp, com relatórios, atribuições, projetos e muito mais centralizados

Pre Requisitos

Introduction to the TidyverseString Manipulation with stringr in R
Benja Zehr

Data Journalist

Veja Mais