Skip to main content
HomePythonIntroduction to Data Pipelines

Introduction to Data Pipelines

4.6+
11 reviews
Intermediate

This introductory course will help you hone the skills to build effective, performant, and reliable data pipelines.

Start Course for Free
4 Hours15 Videos57 Exercises
6,342 LearnersTrophyStatement of Accomplishment

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
GroupTraining 2 or more people?Try DataCamp For Business

Loved by learners at thousands of companies


Course Description

Empowering Analytics with Data Pipelines

Data pipelines are at the foundation of every strong data platform. Building these pipelines is an essential skill for data engineers, who provide incredible value to a business ready to step into a data-driven future. This introductory course will help you hone the skills to build effective, performant, and reliable data pipelines.

Building and Maintaining ETL Solutions

Throughout this course, you’ll dive into the complete process of building a data pipeline. You’ll grow skills leveraging Python libraries such as `pandas` and `json` to extract data from structured and unstructured sources before it’s transformed and persisted for downstream use. Along the way, you’ll grow confidence tools and techniques such as architecture diagrams, unit-tests, and monitoring that will help to set your data pipelines out from the rest. As you progress, you’ll put your new-found skills to the test with hands-on exercises.

Supercharge Data Workflows

After completing this course, you’ll be ready to design, develop and use data pipelines to supercharge your data workflow in your job, new career, or personal project.

For Business

GroupTraining 2 or more people?

Get your team access to the full DataCamp library, with centralized reporting, assignments, projects and more
Try DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Introduction to Data Pipelines

    Free

    Get ready to discover how data is collected, processed, and moved using data pipelines. You will explore the qualities of the best data pipelines, and prepare to design and build your own.

    Play Chapter Now
    Introducing data pipelines
    50 xp
    What is a data pipeline?
    50 xp
    Components of a data pipeline
    100 xp
    Producers and consumers of data pipelines
    100 xp
    Designing data pipelines
    50 xp
    Architecture diagrams for data pipelines
    50 xp
    Reading architecture diagrams
    50 xp
    Data pipeline design process
    100 xp
    Qualities of great data pipelines
    50 xp
    Building quality data pipelines
    50 xp
    Persisting data throughout a pipeline
    50 xp
    Qualities of sound data pipelines
    100 xp

In the following tracks

Data Engineer in PythonMachine Learning Engineer

Collaborators

Collaborator's avatar
George Boorman
Collaborator's avatar
Arne Warnke
Collaborator's avatar
Anastasia Dvoryanchikova
Collaborator's avatar
Katerina Zahradova
Jake Roach HeadshotJake Roach

Data Engineer

Jake is a Data Engineer at Delaware North, and DataCamp Instructor. He and his team are building a state-of-the-art data platform for a multi-billion dollar organization, powered by Astronomer, Airflow, AWS, and Databricks. Born and raised in Buffalo, NY, when he's not working with data, you can find him out at the golf course playing a quick nine holes before dark!
See More

Don’t just take our word for it

*4.6
from 11 reviews
73%
18%
9%
0%
0%
Sort by
  • Stefan C.
    5 months

    Good course

  • Alex N.
    9 months

    This course offers valuable insights into classes and inheritance in Python. There should be more software engineering and best coding practices design on the platform.

  • Sorin I.
    3 months

    Excellent courses and practice mode, very nice UI/UX and trainer

  • Robert T.
    5 months

    This was a great intro to the concepts. Overall it walked through a lot of key items all through the ETL process. The section on testing felt a bit rushed and didn't have a ton of explanation to it, but it did talk about the importance of testing the work before sending it to prod.

  • Mauricio P.
    9 months

    Very funny and useful

"Good course"

Stefan C.

"This course offers valuable insights into classes and inheritance in Python. There should be more software engineering and best coding practices design on the platform."

Alex N.

"Excellent courses and practice mode, very nice UI/UX and trainer"

Sorin I.

Join over 13 million learners and start Introduction to Data Pipelines today!

Create Your Free Account

GoogleLinkedInFacebook

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.