Skip to main content
HomePython

Course

Parallel Programming with Dask in Python

IntermediateSkill Level
4.8+
60 reviews
Updated 04/2024
Learn how to use Python parallel programming with Dask to upscale your workflows and efficiently handle big data.
Start Course for Free
PythonProgramming4 hr15 videos51 Exercises4,150 XP4,843Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training 2 or more people?

Try DataCamp for Business

Course Description

Use Parallel Processing to Speed Up Your Python Code

With this 4-hour course, you’ll discover how parallel processing with Dask in Python can make your workflows faster.

When working with big data, you’ll face two common obstacles: using too much memory and long runtimes. The Dask library can lower your memory use by loading chunks of data only when needed. It can lower runtimes by using all your available computing cores in parallel. Best of all, it requires very few changes to your existing Python code.

Analyze Big Structured Data Using Dask DataFrames

In this course, you use Dask to analyze Spotify song data, process images of sign language gestures, calculate trends in weather data, analyze audio recordings, and train machine learning models on big data.

You’ll start by learning the basics of Dask, exploring how parallel processing in Python can speed up almost any code. Next, you’ll explore Dask DataFrames and arrays and how to use them to analyze big structured data.

Train machine learning models using Dask-ML

As you progress through the 51 exercises in this course, you’ll learn how to process any type of data, using Dask bags to work with unstructured and structured data. Finally, you’ll learn how to use Dask in Python to train machine learning models and improve your computing speeds.

Prerequisites

Data Manipulation with pandasPython Toolbox
1

Lazy Evaluation and Parallel Computing

This chapter will teach you the basics of Dask and lazy evaluation. At the end of this chapter, you'll be able to speed up almost any Python code by using parallel processing or multi-threading. You'll learn the difference between these two task scheduling methods and which one is better under which circumstances.
Start Chapter
2

Parallel Processing of Big, Structured Data

3

Dask Bags for Unstructured Data

4

Dask Machine Learning and Final Pieces

Harness the power of Dask to train machine learning models. You'll learn how to train machine learning models on big data using the Dask-ML package, and how to split Dask calculations across a mixture of processes and threads for even greater computing speed.
Start Chapter
Parallel Programming with Dask in Python
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.8
from 60 reviews
82%
18%
0%
0%
0%
  • Mohmed
    last week

  • Akram
    last week

  • Patience Divine Aimee
    3 weeks ago

  • Andrew
    3 weeks ago

  • Andrew
    6 weeks ago

  • Tung
    last month

    .

Mohmed

Patience Divine Aimee

Andrew

FAQs

Is Python good for parallel processing?

Yes, multiprocessing in Python allows your computer to utilize multiple cores of a CPU to run tasks/processes in parallel, helping to speed up your code.

What is Dask in Python used for?

Dask is a Python library used for parallel processing. It can help you scale your data science or machine learning workflows by improving the performance and speed of Python script execution.

Is Dask faster than pandas?

Yes, tasks in Dask generally execute faster than in pandas.

Is Dask faster than NumPy?

Yes, because of parallelization, tasks in Dask are generally faster than in NumPy.

Join over 19 million learners and start Parallel Programming with Dask in Python today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.