Program
Insinyur Data Profesional dalam Python
Termasuk denganPremium or Team
Buat Akun Gratis Anda
atau
Dengan melanjutkan, Anda menerima Ketentuan Penggunaan kami, Kebijakan Privasi kami dan bahwa data Anda disimpan di Amerika Serikat.Dipercaya oleh para pelajar di ribuan perusahaan
Pelatihan untuk 2 orang atau lebih?
Coba DataCamp for BusinessDeskripsi Track
Insinyur Data Profesional dalam Python
Persyaratan
Data EngineerCourse
Discover modern data architecture's key components, from ingestion and serving to governance and orchestration.
Course
The Unix command line helps users combine existing programs in new ways, automate repetitive tasks, and run programs on clusters and clouds.
Course
Learn the essentials of VMs, containers, Docker, and Kubernetes. Understand the differences to get started!
Course
Kursus ini memperkenalkan dbt untuk pemodelan data, transformasi, pengujian, dan pembuatan dokumentasi.
Course
Pelajari konsep dasar pemrograman berorientasi objek (OOP), membuat kelas dan objek kustom!
Course
Course
Dalam Pengenalan DevOps ini, Anda akan menguasai dasar-dasar DevOps dan mempelajari konsep-konsep kunci, alat, dan teknik untuk meningkatkan produktivitas.
Course
Project
bonusDebugging Code
Sharpen your debugging skills to enhance sales data accuracy.
Course
Dapatkan pengenalan tentang Docker dan temukan pentingnya dalam kotak alat profesional data. Pelajari tentang kontainer Docker, gambar, dan lainnya.
Course
Master PySpark to handle big data with ease—learn to process, query, and optimize massive datasets for powerful analytics!
Chapter
This chapter introduces the exciting world of Big Data, as well as the various concepts and different frameworks for processing Big Data. You will understand why Apache Spark is considered the best framework for BigData.
Chapter
The main abstraction Spark provides is a resilient distributed dataset (RDD), which is the fundamental and backbone data type of this engine. This chapter introduces RDDs and shows how RDDs can be created and executed using RDD Transformations and Actions.
Chapter
In this chapter, you'll learn about Spark SQL which is a Spark module for structured data processing. It provides a programming abstraction called DataFrames and can also act as a distributed SQL query engine. This chapter shows how Spark SQL allows you to use DataFrames in Python.
Project
Step into a data engineer's shoes and master data cleaning with PySpark on an e-commerce orders dataset!
Chapter
In this chapter, we learn how to download data files from web servers via the command line. In the process, we also learn about documentation manuals, option flags, and multi-file processing.
Chapter
In the last chapter, we bridge the connection between command line and other data science languages and learn how they can work together. Using Python as a case study, we learn to execute Python on the command line, to install dependencies using the package manager pip, and to build an entire model pipeline using the command line.
Course
Pelajari perbedaan antara batching dan streaming, skalabilitas sistem streaming, dan penerapan di dunia nyata.
Course
Course
Resource
Understand how data engineering can impact your business.
Selesai
Memperoleh Surat Keterangan Prestasi
Tambahkan kredensial ini ke profil LinkedIn, resume, atau CV AndaBagikan di media sosial dan dalam penilaian kinerja Anda
Termasuk denganPremium or Team
Daftar SekarangBergabung dengan 19 juta pelajar dan mulai Insinyur Data Profesional dalam Python Hari Ini!
Buat Akun Gratis Anda
atau
Dengan melanjutkan, Anda menerima Ketentuan Penggunaan kami, Kebijakan Privasi kami dan bahwa data Anda disimpan di Amerika Serikat.