Interactive Course

Cleaning Data in SQL Server Databases

Develop the skills you need to clean raw data and transform it into accurate insights.

  • 4 hours
  • 13 Videos
  • 48 Exercises
  • 3,204 Participants
  • 3,750 XP

Loved by learners at thousands of top companies:


Course Description

Did you know that data scientists and data analysts spend a large amount of time cleaning data before they can analyze it? This is because real-world data is messy. To help you navigate messy data this course teaches you how to clean data stored in an SQL Server database. You’ll learn how to solve common problems such as how to clean messy strings, deal with empty values, compare the similarity between strings, and much more. You’ll get hands-on with all these tasks using a wide range of interesting and messy datasets, including monthly airline flights by airport, TV series and paper shop sales. Are you ready to get your hands messy?

  1. 1

    Starting with Cleaning Data


    To begin the course, you will learn why cleaning data is important. You will solve simple problems such as leading and trailing spaces in strings, unifying formats for flight registrations, combining strings and more.

  2. Dealing with missing data, duplicate data, and different date formats

    In this chapter, you will deepen your data cleaning knowledge. You will learn how to deal with missing data, avoid duplicate data in your datasets, and work with different formats of dates.

  3. Dealing with out of range values, different data types, and pattern matching

    In this chapter, you will deal with out of range values and inaccurate data. You will also practice converting data with different types. Finally, you will work on matching patterns to your data to find outliers.

  4. Combining, splitting, and transforming data

    In this final chapter, you will learn how to combine and aggregate data of some columns into one, split data of one column into more columns, and transform rows into columns and vice versa.

What do other learners have to say?


“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group


“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School


“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Miriam Antona
Miriam Antona

Software Engineer

Miriam Antona has 10+ years of experience working as a Software Engineer. She is focused on analyzing, designing, and developing software applications for Civil Services and Justice Administration. Miriam loves programming and experimenting with different technologies. She is passionate about databases and enjoys playing with data. She holds a Master of Science Degree in Computer Engineering.

See More
Icon Icon Icon professional info