Hoppa till huvudinnehållet
HemPython

Kurs

Efficient AI Model Training with PyTorch

AvanceradKunskapsnivå
Uppdaterad 2026-05
Learn how to reduce training times for large language models with Accelerator and Trainer for distributed training
Starta kursen gratis
PythonArtificial Intelligence
4 tim
13 videor
45 Övningar
3,850 XP
Intyg om genomförande

Skapa ditt kostnadsfria konto

Fortsätt med GoogleVisa fler alternativ

eller


Genom att fortsätta godkänner du våra Användarvillkor, vår Integritetspolicy och att dina uppgifter lagras i USA.

Omtyckt av lärande på tusentals företag

Group

Utbildar du ett team?

Prova för företag

Kursbeskrivning

Distributed training is an essential skill in large-scale machine learning, helping you to reduce the time required to train large language models with trillions of parameters. In this course, you will explore the tools, techniques, and strategies essential for efficient distributed training using PyTorch, Accelerator, and Trainer.

Preparing Data for Distributed Training

You'll begin by preparing data for distributed training by splitting datasets across multiple devices and deploying model copies to each device. You'll gain hands-on experience in preprocessing data for distributed environments, including images, audio, and text.

Exploring Efficiency Techniques

Once your data is ready, you'll explore ways to improve efficiency in training and optimizer use across multiple interfaces. You'll see how to address these challenges by improving memory usage, device communication, and computational efficiency with techniques like gradient accumulation, gradient checkpointing, local stochastic gradient descent, and mixed precision training. You'll understand the tradeoffs between different optimizers to help you decrease your model's memory footprint. By the end of this course, you'll be equipped with the knowledge and tools to build distributed AI-powered services.

Förkunskapskrav

Intermediate Deep Learning with PyTorchWorking with Hugging Face
1

Data Preparation with Accelerator

You'll prepare data for distributed training by splitting the data across multiple devices and copying the model on each device. Accelerator provides a convenient interface for data preparation, and you'll learn how to preprocess images, audio, and text as a first step in distributed training.
Starta kapitel
2

Distributed Training with Accelerator and Trainer

In distributed training, each device trains on its data in parallel. You'll investigate two methods for distributed training: Accelerator enables custom training loops, and Trainer simplifies the interface for training.
Starta kapitel
3

Improving Training Efficiency

Distributed training strains resources with large models and datasets, but you can address these challenges by improving memory usage, device communication, and computational efficiency. You'll discover the techniques of gradient accumulation, gradient checkpointing, local stochastic gradient descent, and mixed precision training.
Starta kapitel
4

Training with Efficient Optimizers

Efficient AI Model Training with PyTorch
Kurs
slutförd

Tjäna ett prestationsbevis

Lägg till det här beviset i din LinkedIn-profil, ditt CV eller din meritförteckning
Dela det i sociala medier och i din medarbetarutvärdering
Registrera dig nu

Gå med 19 miljoner lärande och börja Efficient AI Model Training with PyTorch idag!

Skapa ditt kostnadsfria konto

Fortsätt med GoogleVisa fler alternativ

eller


Genom att fortsätta godkänner du våra Användarvillkor, vår Integritetspolicy och att dina uppgifter lagras i USA.

Utveckla dina datakunskaper med DataCamp för mobilen

Gör framsteg när du är på språng med våra mobila kurser och dagliga 5-minuters kodningsutmaningar.