Skip to main content
HomeGoogle Cloud

Course

Introduction to Data Engineering on Google Cloud

BasicSkill Level
Updated 05/2026
Learn the data engineering role on Google Cloud. Explore data sources, storage solutions, ETL/ELT architectures, BigQuery, Dataform, and Dataproc.
Start Course for Free
Google CloudCloud
3 hr 41 min
42 videos
80 Exercises
4,350 XP
Statement of Accomplishment

Create Your Free Account

Continue with GoogleShow more options

or


By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training a Team?

Try for Business

Course Description

This course introduces the data engineering role on Google Cloud. You'll learn about data sources, sinks, formats, and storage solutions, then explore replication, migration, and ETL/ELT architectures using BigQuery, Dataform, Dataproc, and Cloud Composer. The course includes hands-on labs with Datastream, BigLake, and Serverless Spark.

Prerequisites

There are no prerequisites for this course
1

Course Introduction

This section welcomes you to the Introduction to Data Engineering on Google Cloud course, and provides an overview of the course structure and goals.
Start Chapter
2

Data Engineering Tasks and Components

This module provides an introduction to the role of a data engineer. It covers key concepts such as data sources and sinks, data formats, storage options on Google Cloud, metadata management, and the use of Analytics Hub for data sharing within and outside an organization.
Start Chapter
3

Data Replication and Migration

This module provides an overview of data replication and migration on Google Cloud. It covers the basic architecture, the 'gcloud' command-line tool, Storage Transfer Service, Transfer Appliance, and Datastream, along with their functionalities and use cases.
Start Chapter
4

The Extract and Load Data Pipeline Pattern

This module focuses on data extraction and loading processes on Google Cloud, particularly with BigQuery. It covers the basic extraction and loading architecture, the bq command-line tool, BigQuery Data Transfer Service, and BigLake as an alternative to traditional extract-load patterns.
Start Chapter
5

The Extract, Load, and Transform Data Pipeline Pattern

This module provides an overview of ELT (extract, load, transform) processes on Google Cloud. It covers the basic ELT architecture, a common ELT pipeline example, BigQuery's capabilities for scripting and scheduling SQL, and the functionality and use cases of Dataform.
Start Chapter
7

Automation Techniques

This module focuses on automation patterns and options for pipelines on Google Cloud. It covers various tools and services like Cloud Scheduler, Workflows, Cloud Composer, Cloud Run functions, and Eventarc, along with their functionalities and use cases for automation.
Start Chapter
8

Course Summary

In this final section, we review what was presented in this course and discuss the next steps to continue your cloud learning journey.
Start Chapter
Introduction to Data Engineering on Google Cloud
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Join over 19 million learners and start Introduction to Data Engineering on Google Cloud today!

Create Your Free Account

Continue with GoogleShow more options

or


By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.