There's a commonly cited statistic that whenever you get a new dataset, 80% of the project time is just spent cleaning the dataset. Even though it's such a commonly used skill, it's one of the toughest data analysis skills to crack. Of all the candidates who go through DataCamp Certifications, data cleaning is the step that is the most common reason to fail.
In this session, you'll learn about the common types of "data dirtiness" and the techniques you need to clean them. You'll also learn about the common mistakes made when cleaning data, and how to avoid them.
This session is essential for anyone considering taking any of the DataCamp Certifications or anyone who has to deal with dirty datasets.
Presenter Bio
Aimee GottHead of Certification Content
Aimée leads the Certification Content team in the design and development of the certification curriculum. Before joining DataCamp she worked at a data science consulting firm where she specialized in teaching data science skills across a range of industries.