Subscribe now. Save 50% on DataCamp and commit to learning data science and analytics.

Offer ends in  days  hrs  mins  secs
Interactive Course

SQL for Exploratory Data Analysis

Learn how to explore what's available in a database: the tables, relationships between them, and data stored in them.

  • 4 hours
  • 16 Videos
  • 58 Exercises
  • 5,840 Participants
  • 4,800 XP

Loved by learners at thousands of top companies:

forrester-grey.svg
ikea-grey.svg
ebay-grey.svg
uber-grey.svg
deloitte-grey.svg
intel-grey.svg

Course Description

You have access to a database. Now what do you do? Building on your existing skills joining tables, using basic functions, grouping data, and using subqueries, the next step in your SQL journey is learning how to explore a database and the data in it. Using data from Stack Overflow, Fortune 500 companies, and 311 help requests from Evanston, IL, you'll get familiar with numeric, character, and date/time data types. You'll use functions to aggregate, summarize, and analyze data without leaving the database. Errors and inconsistencies in the data won't stop you! You'll learn common problems to look for and strategies to clean up messy data. By the end of this course, you'll be ready to start exploring your own PostgreSQL databases and analyzing the data in them.

  1. 1

    What's in the database?

    Free

    Start exploring a database by identifying the tables and the foreign keys that link them. Look for missing values, count the number of observations, and join tables to understand how they're related. Learn about coalescing and casting data along the way.

  2. Exploring categorical data and unstructured text

    Text, or character, data can get messy, but you'll learn how to deal with inconsistencies in case, spacing, and delimiters. Learn how to use a temporary table to recode messy categorical data to standardized values you can count and aggregate. Extract new variables from unstructured text as you explore help requests submitted to the city of Evanston, IL.

  3. Summarizing and aggregating numeric data

    You'll build on functions like min and max to summarize numeric data in new ways. Add average, variance, correlation, and percentile functions to your toolkit, and learn how to truncate and round numeric values too. Build complex queries and save your results by creating temporary tables.

  4. Working with dates and timestamps

    What time is it? In this chapter, you'll learn how to find out. You'll aggregate date/time data by hour, day, month, or year and practice both constructing time series and finding gaps in them.

  1. 1

    What's in the database?

    Free

    Start exploring a database by identifying the tables and the foreign keys that link them. Look for missing values, count the number of observations, and join tables to understand how they're related. Learn about coalescing and casting data along the way.

  2. Summarizing and aggregating numeric data

    You'll build on functions like min and max to summarize numeric data in new ways. Add average, variance, correlation, and percentile functions to your toolkit, and learn how to truncate and round numeric values too. Build complex queries and save your results by creating temporary tables.

  3. Exploring categorical data and unstructured text

    Text, or character, data can get messy, but you'll learn how to deal with inconsistencies in case, spacing, and delimiters. Learn how to use a temporary table to recode messy categorical data to standardized values you can count and aggregate. Extract new variables from unstructured text as you explore help requests submitted to the city of Evanston, IL.

  4. Working with dates and timestamps

    What time is it? In this chapter, you'll learn how to find out. You'll aggregate date/time data by hour, day, month, or year and practice both constructing time series and finding gaps in them.

What do other learners have to say?

Devon

“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group

Louis

“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School

Ronbowers

“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Christina Maimone
Christina Maimone

Data Scientist, Northwestern University

Christina Maimone leads Research Data Services at Northwestern University with the IT Research Computing Services group. She enables innovative research by providing data science, programming, and software development support for researchers. Through consultations, project collaborations, user groups, and workshops, the Research Data Services team ensures researchers have the resources, services, and skills they need to overcome challenges in their work. Christina regularly uses R, Python, and SQL but enjoys the challenge of using a wide range of programs and languages in her work. She has a PhD in political science and an MS in statistics from Stanford.

See More
Icon Icon Icon professional info