Win Tun Lin has completed
Introduction to Data Quality with Great Expectations
Start course For Free4 hr
3,500 XP

Loved by learners at thousands of companies
Course Description
Great Expectations is a powerful tool for monitoring data quality in data science and data engineering workflows. The platform can be easily integrated into Python, making it a useful library for Python users to master.
At the core of Great Expectations are Expectations, or assertions that you'd like to verify about your data. You'll begin this course by learning how to connect to real-world datasets and apply Expectations to them. You'll then learn how to retrieve, edit, delete Expectations, and build pipelines for applying Expectations to new datasets in a production deployment.
Finally, you'll learn about specific types of Expectations, such as for numeric and string columns, and how to write Expectations of one column conditional on the values of other columns.
By the end of this course, you'll have a strong foundation in the Great Expectations Python library. You'll be able to use the platform's core functionalities to monitor the quality of your data, and you'll be able to use your data with confidence that it meets your data quality standards.
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.- 1
Connecting to Data
FreeUnderstand why Great Expectations (GX) is such a powerful tool for monitoring data quality. Get familiar with the basics of GX, including how to start a session using a Data Context, and how to load in a pandas dataframe using a Data Source, Data Asset, and Batch Definition.
- 2
Establishing Expectations
Create and evaluate basic shape and schema Expectations. Validate your Expectations either individually, as part of an Expectation Suite with a Batch Definition, or using a Validation Definition.
Create Expectations50 xpEstablish a Column Count Expectation100 xpEvaluate Validation Results100 xpSchema Expectations50 xpEstablish a Column Name Expectation100 xpEstablish a Column Range Expectation100 xpCreate a Suite of Expectations50 xpCreate an Expectation Suite100 xpValidate your Expectation Suite100 xpValidate Expectation Suites50 xpCreate and Run a Validation Definition100 xpValidation Definition vs. Batch Definition100 xp - 3
GX in Practice
Learn practical skills that will help you dominate the dynamic nature of Expectations in the real world. Deploy Validation Definitions using Checkpoints; update your Expectation Suites; and learn how to add, retrieve, list, and delete key GX components.
- 4
All About Expectations
Dive head-first into the world of Expectations. Practice creating basic column Expectations, row- and aggregate-level numeric Expectations, string and string parseability Expectations, and more. Learn how to apply Expectations to only some rows of a dataframe.
Basic Column Expectations50 xpEstablish row-level Expectations100 xpEstablish aggregate-level Expectations100 xpType-Specific Expectations50 xpEstablish numeric Expectations100 xpEstablish string Expectations100 xpConditional Expectations50 xpWrite up a Conditional Expectation100 xpInvert a Conditional Expectation100 xpWrap-Up50 xp
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.datasets
weather_new.csvmovie_dataset.csvlife_expectancy.csvrenewable_new.csvshein_footwear_clean.csvcollaborators


prerequisites
Data Manipulation with pandasData Scientist & CTO
I'm a data scientist by trade, having worked at companies like Allstate and AmFam. Now, I have my own startup, where I work as the CTO. I'm passionate about leveraging data for health equity, combating climate change, and general social good.
Join over 18 million learners and start Introduction to Data Quality with Great Expectations today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.