Luiz Felipe Pereira Figueiredo has completed

Introduction to the Tidyverse

4 hr

4,150 XP

Loved by learners at thousands of companies

Course Description

This is an introduction to the programming language R, focused on a powerful set of tools known as the Tidyverse. You'll learn the intertwined processes of data manipulation and visualization using the tools dplyr and ggplot2. You'll learn to manipulate data by filtering, sorting, and summarizing a real dataset of historical country data in order to answer exploratory questions. You'll then learn to turn this processed data into informative line plots, bar plots, histograms, and more with the ggplot2 package. You’ll get a taste of the value of exploratory data analysis and the power of Tidyverse tools. This is a suitable introduction for those who have no previous experience in R and are interested in performing data analysis.The videos contain live transcripts you can reveal by clicking "Show transcript" at the bottom left of the videos. The course glossary can be found on the right in the resources section. To obtain CPE credits you need to complete the course and reach a score of 70% on the qualified assessment. You can navigate to the assessment by clicking on the CPE credits callout on the right.

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

1
Data wrangling
Free
In this chapter, you'll learn to do three things with a table: filter for particular observations, arrange the observations in a desired order, and mutate to add or change a column. You'll see how each of these steps allows you to answer questions about your data.
Play Chapter Now
The gapminder dataset
50 xp
Loading the gapminder and dplyr packages
100 xp
Understanding a data frame
50 xp
The filter verb
50 xp
Filtering for one year
100 xp
Filtering for one country and one year
100 xp
The arrange verb
50 xp
Arranging observations by life expectancy
100 xp
Filtering and arranging
100 xp
The mutate verb
50 xp
Using mutate to change or create a column
100 xp
Combining filter, mutate, and arrange
100 xp
2
Data visualization
Often a better way to understand and present data as a graph. In this chapter, you'll learn the essential skills of data visualization using the ggplot2 package, and you'll see how the dplyr and ggplot2 packages work closely together to create informative graphs.
Play Chapter Now
Visualizing with ggplot2
50 xp
Variable assignment
100 xp
Comparing population and GDP per capita
100 xp
Comparing population and life expectancy
100 xp
Log scales
50 xp
Putting the x-axis on a log scale
100 xp
Putting the x- and y- axes on a log scale
100 xp
Additional aesthetics
50 xp
Adding color to a scatter plot
100 xp
Adding size and color to a plot
100 xp
Faceting
50 xp
Creating a subgraph for each continent
100 xp
Faceting by year
100 xp
3
Grouping and summarizing
So far you've been answering questions about individual country-year pairs, but you may be interested in aggregations of the data, such as the average life expectancy of all countries within each year. Here you'll learn to use the group by and summarize verbs, which collapse large datasets into manageable summaries.
Play Chapter Now
The summarize verb
50 xp
Summarizing the median life expectancy
100 xp
Summarizing the median life expectancy in 1957
100 xp
Summarizing multiple variables in 1957
100 xp
The group_by verb
50 xp
Summarizing by year
100 xp
Summarizing by continent
100 xp
Summarizing by continent and year
100 xp
Visualizing summarized data
50 xp
Visualizing median life expectancy over time
100 xp
Visualizing median GDP per capita per continent over time
100 xp
Comparing median life expectancy and median GDP per continent in 2007
100 xp
4
Types of visualizations
In this chapter, you'll learn how to create line plots, bar plots, histograms, and boxplots. You'll see how each plot requires different methods of data manipulation and preparation, and you’ll understand how each of these plot types plays a different role in data analysis.
Play Chapter Now
Line plots
50 xp
Visualizing median GDP per capita over time
100 xp
Visualizing median GDP per capita by continent over time
100 xp
Bar plots
50 xp
Visualizing median GDP per capita by continent
100 xp
Visualizing GDP per capita by country in Oceania
100 xp
Histograms
50 xp
Visualizing population
100 xp
Visualizing population with x-axis on a log scale
100 xp
Boxplots
50 xp
Comparing GDP per capita across continents
100 xp
Adding a title to your graph
100 xp
Conclusion
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

In other tracks

Tidyverse Fundamentals

datasets

Gapminder Course Glossary

collaborators

Yashas Roy

Chester Ismay

David Robinson

Principal Data Scientist at Heap

Dave is the Principal Data Scientist at Heap. He has worked as a data scientist at DataCamp and Stack Overflow, and received his PhD in Quantitative and Computational Biology from Princeton University. Follow him at @drob on Twitter or on his blog, Variance Explained.

Join over 18 million learners and start Introduction to the Tidyverse today!

Create Your Free Account

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Introduction to the Tidyverse

Loved by learners at thousands of companies

Course Description

.css-10r9e5n{-webkit-margin-end:8px;margin-inline-end:8px;}.css-1309hh9{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;-webkit-margin-end:8px;margin-inline-end:8px;}Training 2 or more people?

Data wrangling

Data visualization

Grouping and summarizing

Types of visualizations

Training 2 or more people?

Join over .css-ou6dz6{color:#03ef62;}18 million learners and start Introduction to the Tidyverse today!

Create Your Free Account

Training 2 or more people?

Join over 18 million learners and start Introduction to the Tidyverse today!