Sariți la conținutul principal
AcasăPython

Curs

Parallel Programming with Dask in Python

IntermediarNivel de competențe
Actualizat 04.2024
Learn how to use Python parallel programming with Dask to upscale your workflows and efficiently handle big data.
Începe cursul gratuit
PythonProgramming
4 h
15 videoclipuri
51 Exerciții
4,150 XP
4,892
Certificat de realizare

Creează-ți contul gratuit

Continuă cu GoogleArată mai multe opțiuni

sau


Continuând, accepți Termenii de utilizare, Politica de confidențialitate și faptul că datele tale sunt stocate în SUA.

Îndrăgit de cursanți din mii de companii

Group

Formare pentru o echipă?

Încearcă pentru afaceri

Descrierea cursului

Use Parallel Processing to Speed Up Your Python Code

With this 4-hour course, you’ll discover how parallel processing with Dask in Python can make your workflows faster.

When working with big data, you’ll face two common obstacles: using too much memory and long runtimes. The Dask library can lower your memory use by loading chunks of data only when needed. It can lower runtimes by using all your available computing cores in parallel. Best of all, it requires very few changes to your existing Python code.

Analyze Big Structured Data Using Dask DataFrames

In this course, you use Dask to analyze Spotify song data, process images of sign language gestures, calculate trends in weather data, analyze audio recordings, and train machine learning models on big data.

You’ll start by learning the basics of Dask, exploring how parallel processing in Python can speed up almost any code. Next, you’ll explore Dask DataFrames and arrays and how to use them to analyze big structured data.

Train machine learning models using Dask-ML

As you progress through the 51 exercises in this course, you’ll learn how to process any type of data, using Dask bags to work with unstructured and structured data. Finally, you’ll learn how to use Dask in Python to train machine learning models and improve your computing speeds.

Cerințe prealabile

Data Manipulation with pandasPython Toolbox
1

Lazy Evaluation and Parallel Computing

This chapter will teach you the basics of Dask and lazy evaluation. At the end of this chapter, you'll be able to speed up almost any Python code by using parallel processing or multi-threading. You'll learn the difference between these two task scheduling methods and which one is better under which circumstances.
Începe capitolul
2

Parallel Processing of Big, Structured Data

Here you’ll learn how to analyze big structured data using Dask arrays and Dask DataFrames. You'll learn how everything you know about NumPy and pandas can easily be applied to data that is too large to fit in memory.
Începe capitolul
4

Dask Machine Learning and Final Pieces

Harness the power of Dask to train machine learning models. You'll learn how to train machine learning models on big data using the Dask-ML package, and how to split Dask calculations across a mixture of processes and threads for even greater computing speed.
Începe capitolul
Parallel Programming with Dask in Python
Curs
finalizat

Obține diploma de absolvire

Adaugă această acreditare la profilul tău LinkedIn, CV sau rezumat
Distribuie pe rețelele de socializare și în evaluarea ta de performanță
Înscrie-te acum

Alătură-te celor peste 19 de milioane de cursanți și începe Parallel Programming with Dask in Python astăzi!

Creează-ți contul gratuit

Continuă cu GoogleArată mai multe opțiuni

sau


Continuând, accepți Termenii de utilizare, Politica de confidențialitate și faptul că datele tale sunt stocate în SUA.

Dezvoltați-vă abilitățile de gestionare a datelor cu DataCamp pentru mobil

Fă progrese din mers cu cursurile noastre mobile și provocările zilnice de programare de 5 minute.