Профессиональный инженер данных на Python

Обновлено 05.2026

Освойте передовые навыки и современные инструменты, которые сегодня меняют роль data engineer, с нашим треком Professional Data Engineer.

Создать бесплатный аккаунт

Продолжить через Google

Показать больше вариантов

или

Продолжая, вы принимаете наши Условия использования, нашу Политику конфиденциальности и соглашаетесь с тем, что ваши данные хранятся в США.

Описание трека

Профессиональный инженер данных на Python

Поднимите свои навыки на новый уровень с нашим треком Professional Data Engineer. Этот продвинутый трек предназначен для развития на основе треков Associate Data Engineer in SQL и Data Engineer in Python. Он вооружает вас передовыми знаниями и инструментами, востребованными в современных ролях data engineering. На протяжении этого пути вы освоите современные архитектуры данных, улучшите навыки Python, углубившись в объектно-ориентированное программирование, изучите базы данных NoSQL и научитесь использовать dbt для бесшовной трансформации данных. Откройте секреты DevOps с помощью основных практик, продвинутых методов тестирования и таких инструментов, как Docker, чтобы оптимизировать процессы разработки и развертывания. Погрузитесь в технологии больших данных с PySpark и овладейте обработкой данных и автоматизацией с помощью shell scripting. Применяйте знания на практике в проектах и работайте с реальными наборами данных, чтобы оттачивать навыки, отлаживать сложные рабочие процессы и оптимизировать процессы обработки данных. Пройдя этот трек, вы не только освоите продвинутые навыки, необходимые для решения сложных задач data engineering, но и обретёте уверенность в их применении в динамичном мире data engineering.

Необходимые условия

Инженер данных

Course
1
Understanding Modern Data Architecture
Discover modern data architecture's key components, from ingestion and serving to governance and orchestration.
Course
2
Introduction to Shell
The Unix command line helps users combine existing programs in new ways, automate repetitive tasks, and run programs on clusters and clouds.
Course
3
Containerization and Virtualization Concepts
Learn the essentials of VMs, containers, Docker, and Kubernetes. Understand the differences to get started!
Course
4
Introduction to dbt
This course introduces dbt for data modeling, transformations, testing, and building documentation.
Course
5
Introduction to Object-Oriented Programming in Python
Discover the fundamental concepts of object-oriented programming (OOP), building custom classes and objects!
Course
6
Introduction to NoSQL
Conquer NoSQL and supercharge data workflows. Learn Snowflake to work with big data, Postgres JSON for handling document data, and Redis for key-value data.
Course
7
DevOps Concepts
In this Introduction to DevOps, you’ll master the DevOps basics and learn the key concepts, tools, and techniques to improve productivity.
Course
8
Introduction to Testing in Python
Master Python testing: Learn methods, create checks, and ensure error-free code with pytest and unittest.
Project
бонус
Debugging Code
Sharpen your debugging skills to enhance sales data accuracy.
Course
10
Introduction to Docker
Gain an introduction to Docker and discover its importance in the data professional’s toolkit. Learn about Docker containers, images, and more.
Course
11
Introduction to PySpark
Master PySpark to handle big data with ease—learn to process, query, and optimize massive datasets for powerful analytics!
Chapter
бонус
Introduction to Big Data analysis with Spark
This chapter introduces the exciting world of Big Data, as well as the various concepts and different frameworks for processing Big Data. You will understand why Apache Spark is considered the best framework for BigData.
Chapter
бонус
Programming in PySpark RDD’s
The main abstraction Spark provides is a resilient distributed dataset (RDD), which is the fundamental and backbone data type of this engine. This chapter introduces RDDs and shows how RDDs can be created and executed using RDD Transformations and Actions.
Chapter
бонус
PySpark SQL & DataFrames
In this chapter, you'll learn about Spark SQL which is a Spark module for structured data processing. It provides a programming abstraction called DataFrames and can also act as a distributed SQL query engine. This chapter shows how Spark SQL allows you to use DataFrames in Python.
Project
бонус
Cleaning an Orders Dataset with PySpark
Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!
Chapter
бонус
Downloading Data on the Command Line
In this chapter, we learn how to download data files from web servers via the command line. In the process, we also learn about documentation manuals, option flags, and multi-file processing.
Chapter
бонус
Data Pipeline on the Command Line
In the last chapter, we bridge the connection between command line and other data science languages and learn how they can work together. Using Python as a case study, we learn to execute Python on the command line, to install dependencies using the package manager pip, and to build an entire model pipeline using the command line.
Course
18
Streaming Concepts
Learn about the difference between batching and streaming, scaling streaming systems, and real-world applications.
Course
19
Introduction to Apache Kafka
Master Apache Kafka! From core concepts to advanced architecture, learn to create, manage, and troubleshoot Kafka for real-world data streaming challenges!
Course
20
Introduction to Kubernetes
In this course, you will learn the fundamentals of Kubernetes and deploy and orchestrate containers using Manifests and kubectl instructions.
Resource
бонус
Impactful Data Engineering—with Datadog's Wouter de Bie
Understand how data engineering can impact your business.

Профессиональный инженер данных на Python

13 Курсов

Трек
завершён

Получить сертификат об окончании

Добавьте эту квалификацию в профиль LinkedIn, резюме или CV
Поделитесь в социальных сетях и в обзоре эффективностиЗаписаться сейчас

Для бизнеса

Обучаете 2 и более человек?

Откройте вашей команде доступ ко всей платформе DataCamp, включая все функции.

инструкторов

Filip Schouwenaars

Data Science Instructor at DataCamp

Присоединяйтесь к более чем 19 миллионам обучающихся и начните Профессиональный инженер данных на Python уже сегодня!

Создать бесплатный аккаунт

Продолжить через Google Показать больше вариантов

или

Профессиональный инженер данных на Python

Обучаете команду?

Описание трека

Профессиональный инженер данных на Python

Необходимые условия

Understanding Modern Data Architecture

Introduction to Shell

Containerization and Virtualization Concepts

Introduction to dbt

Introduction to Object-Oriented Programming in Python

Introduction to NoSQL

DevOps Concepts

Introduction to Testing in Python

Debugging Code

Introduction to Docker

Introduction to PySpark

Introduction to Big Data analysis with Spark

Programming in PySpark RDD’s

PySpark SQL & DataFrames

Cleaning an Orders Dataset with PySpark

Downloading Data on the Command Line

Data Pipeline on the Command Line

Streaming Concepts

Introduction to Apache Kafka

Introduction to Kubernetes

Impactful Data Engineering—with Datadog's Wouter de Bie

Получить сертификат об окончании

Присоединяйтесь к более чем 19 миллионам обучающихся и начните Профессиональный инженер данных на Python уже сегодня!

Развивайте свои навыки работы с данными с помощью DataCamp для мобильных устройств.

Описание трека

Профессиональный инженер данных на Python

Получить сертификат об окончании

Присоединяйтесь к более чем .css-nklxlk{color:var(--wf-brand--main, #03EF62);}19 миллионам обучающихся и начните Профессиональный инженер данных на Python уже сегодня!

Создать бесплатный аккаунт

Развивайте свои навыки работы с данными с помощью DataCamp для мобильных устройств.

Присоединяйтесь к более чем 19 миллионам обучающихся и начните Профессиональный инженер данных на Python уже сегодня!