Premium project

Importing and Cleaning Data

Apply your importing and data cleaning skills to real-world soccer data.

Start Project
10 Tasks1,500 XP

Loved by learners at thousands of companies


Project Description

Your boss at _Crunching Numbers_ needs you to determine which match and stadium had the highest attendance during the 2019 FIFA Women's World Cup. Use your data import and cleaning skills to dig through the dirty data, clean it up, and present your boss with polished graphs. These data come from the online 2019 FIFA Women's World Cup [match reports](https://www.fifa.com/womensworldcup/matches/?#groupphase).

Project Tasks

  1. 1
    Importing data part 1
  2. 2
    Importing data part 2
  3. 3
    Removing rows of NA
  4. 4
    Replacing NA
  5. 5
    separate() and replace_na()
  6. 6
    Plotting for outliers
  7. 7
    What to do with the outlier?
  8. 8
    A pretty boxplot
  9. 9
    A pretty line plot
  10. 10
    Wrap up
Technologies
R R
Topics
Data ManipulationImporting & Cleaning DataCase Studies
Erin LaBrecque Headshot

Erin LaBrecque

Instructor at DataCamp
Erin is a marine geospatial research ecologist who combines physical and biological spatiotemporal data to understand marine ecosystems. She received her Ph.D. in Marine Science and Conservation from Duke University and is passionate about science communication and data visualization. When she is not playing with environmental and species datasets, Erin can be found hiking with her dog.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA