Skip to main content
HomeSQL

Course

Cleaning Data in PostgreSQL Databases

IntermediateSkill Level
4.8+
415 reviews
Updated 09/2022
Learn to tame your raw, messy data stored in a PostgreSQL database to extract accurate insights.
Start Course for Free
SQLData Preparation4 hr15 videos49 Exercises4,050 XP13,950Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training 2 or more people?

Try DataCamp for Business

Course Description

If you surveyed a large number of data scientists and data analysts about which tasks are most common in their workday, cleaning data would likely be in almost all responses. This is the case because real-world data is messy. To help you tame messy data, this course teaches you how to clean data stored in a PostgreSQL database. You’ll learn how to solve common problems such as how to clean messy strings, deal with empty values, compare the similarity between strings, and much more. You’ll get hands-on practice with these tasks using interesting (but messy) datasets made available by New York City's Open Data program. Are you ready to whip that messy data into shape?

Prerequisites

Data Manipulation in SQL
1

Data Cleaning Basics

In this chapter, you’ll gain an understanding of data cleaning approaches when working with PostgreSQL databases and learn the value of cleaning data as early as possible in the pipeline. You’ll also learn basic string editing approaches such as removing unnecessary spaces as well as more involved topics such as pattern matching and string similarity to identify string values in need of cleaning.
Start Chapter
2

Missing, Duplicate, and Invalid Data

3

Converting Data

4

Transforming Data

Cleaning Data in PostgreSQL Databases
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.8
from 415 reviews
85%
14%
1%
0%
0%
  • Erica
    4 days ago

  • Duarte
    5 days ago

  • Sean
    6 days ago

  • Edmar
    last week

  • RAHEEM
    2 weeks ago

  • Nattavorn
    3 weeks ago

Erica

Duarte

Sean

FAQs

What PostgreSQL functions will I learn for cleaning messy data?

You learn COALESCE for missing data, pattern matching and string similarity functions, CAST for type conversion, and CONCAT, SUBSTRING, and REGEXP_SPLIT_TO_TABLE for transforming data.

What real-world datasets are used in the exercises?

You work with datasets from New York City's Open Data program, including postal data that you split into city, state, and zip code components in the final chapter.

Does the course cover handling missing and duplicate data?

Yes. Chapter 2 is dedicated to solving problems with missing, duplicate, and invalid data using techniques like COALESCE, targeted SELECT queries, and WHERE clause filtering.

Will I learn to convert data types in PostgreSQL?

Yes. Chapter 3 covers converting text to numeric types and formatting strings as temporal data, which are common tasks when cleaning data stored in PostgreSQL databases.

What SQL background do I need for this course?

You need Introduction to SQL, Intermediate SQL, Data Manipulation in SQL, and Joining Data in SQL. This intermediate course builds on solid SQL foundations.

Join over 19 million learners and start Cleaning Data in PostgreSQL Databases today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.