课程

ETL and ELT in Python

中级技能水平

更新时间 2026年1月

Learn to build effective, performant, and reliable data pipelines using Extract, Transform, and Load principles.

免费开始课程

PythonData Engineering4 小时14 视频53 练习4,450 经验值35,851成就声明

创建您的免费帐户

或

继续操作即表示您接受我们的《使用条款》和《隐私政策》，并同意您的数据存储在美国。

深受数千家公司学习者的喜爱

培训2人或更多？

试用DataCamp for Business

课程描述

Empowering Analytics with Data Pipelines

Data pipelines are at the foundation of every strong data platform. Building these pipelines is an essential skill for data engineers, who provide incredible value to a business ready to step into a data-driven future. This introductory course will help you hone the skills to build effective, performant, and reliable data pipelines.

Building and Maintaining ETL Solutions

Throughout this course, you’ll dive into the complete process of building a data pipeline. You’ll grow skills leveraging Python libraries such as pandas and json to extract data from structured and unstructured sources before it’s transformed and persisted for downstream use. Along the way, you’ll develop confidence tools and techniques such as architecture diagrams, unit-tests, and monitoring that will help to set your data pipelines out from the rest. As you progress, you’ll put your new-found skills to the test with hands-on exercises.

Supercharge Data Workflows

After completing this course, you’ll be ready to design, develop and use data pipelines to supercharge your data workflow in your job, new career, or personal project.

先决条件

Data Warehousing Concepts Streamlined Data Ingestion with pandas

1

Introduction to Data Pipelines

Get ready to discover how data is collected, processed, and moved using data pipelines. You will explore the qualities of the best data pipelines, and prepare to design and build your own.

Introduction to ETL and ELT Pipelines

50 经验值

Running an ETL Pipeline

100 经验值

ELT in Action

100 经验值

ETL and ELT Pipelines

50 经验值

Building ETL and ELT Pipelines

50 经验值

Building an ETL Pipeline

100 经验值

The "T" in ELT

100 经验值

Extracting, Transforming, and Loading Student Scores Data

100 经验值

2

Building ETL Pipelines

Dive into leveraging pandas to extract, transform, and load data as you build your first data pipelines. Learn how to make your ETL logic reusable, and apply logging and exception handling to your pipelines.

Extracting data from structured sources

50 经验值

Extracting data from parquet files

100 经验值

Pulling data from SQL databases

100 经验值

Building functions to extract data

100 经验值

Transforming data with pandas

50 经验值

Filtering pandas DataFrames

100 经验值

Transforming sales data with pandas

100 经验值

Validating data transformations

100 经验值

Persisting data with pandas

50 经验值

Loading sales data to a CSV file

100 经验值

Customizing a CSV file

100 经验值

Persisting data to files

100 经验值

Monitoring a data pipeline

50 经验值

Logging within a data pipeline

100 经验值

Handling exceptions when loading data

100 经验值

Monitoring and alerting within a data pipeline

100 经验值

3

Advanced ETL Techniques

Supercharge your workflow with advanced data pipelining techniques, such as working with non-tabular data and persisting DataFrames to SQL databases. Discover tooling to tackle advanced transformations with pandas, and uncover best-practices for working with complex data.

Extracting non-tabular data

50 经验值

Ingesting JSON data with pandas

100 经验值

Reading JSON data into memory

100 经验值

Transforming non-tabular data

50 经验值

Iterating over dictionaries

100 经验值

Parsing data from dictionaries

100 经验值

Transforming JSON data

100 经验值

Transforming and cleaning DataFrames

100 经验值

Advanced data transformation with pandas

50 经验值

Filling missing values with pandas

100 经验值

Grouping data with pandas

100 经验值

Applying advanced transformations to DataFrames

100 经验值

Loading data to a SQL database with pandas

50 经验值

Loading data to a Postgres database

100 经验值

Validating data loaded to a Postgres Database

100 经验值

4

Deploying and Maintaining a Data Pipeline

In this final chapter, you’ll create frameworks to validate and test data pipelines before shipping them into production. After you’ve tested your pipeline, you’ll explore techniques to run your data pipeline end-to-end, all while allowing for visibility into pipeline performance.

Manually testing a data pipeline

50 经验值

Testing data pipelines

50 经验值

Validating a data pipeline at "checkpoints"

100 经验值

Testing a data pipeline end-to-end

100 经验值

Unit-testing a data pipeline

50 经验值

Validating a data pipeline with assert

100 经验值

Writing unit tests with pytest

100 经验值

Creating fixtures with pytest

100 经验值

Unit testing a data pipeline with fixtures

100 经验值

Running a data pipeline in production

50 经验值

Orchestration and ETL tools

50 经验值

Data pipeline architecture patterns

100 经验值

Running a data pipeline end-to-end

100 经验值

Congratulations!

50 经验值

ETL and ELT in Python

课程完成

获得成就证明

将此证书添加到你的 LinkedIn 档案、简历或履历中
在社交媒体和绩效评估中分享立即注册

加入超过19百万学习者，今天就开始ETL and ELT in Python！

创建您的免费帐户

或

继续操作即表示您接受我们的《使用条款》和《隐私政策》，并同意您的数据存储在美国。

通过 DataCamp for Mobile 提升您的数据技能

随时随地通过我们的移动课程和每日 5 分钟编程挑战提升技能。