Перейти к основному содержимому

Course

Efficient AI Model Training with PyTorch

ПередовойУровень мастерства

Обновлено 04.2026

Learn how to reduce training times for large language models with Accelerator and Trainer for distributed training

Начать Курс Бесплатно

PythonArtificial Intelligence4 ч13 videos45 Exercises3,850 XPСвидетельство о достижениях

Создайте бесплатный аккаунт

или

Продолжая, вы принимаете наши Условия использования, нашу Политику конфиденциальности и подтверждаете, что ваши данные хранятся в США.

Пользуется популярностью среди обучающихся в тысячах компаний.

Обучение двух или более человек?

Попробуйте DataCamp for Business

Описание курса

Distributed training is an essential skill in large-scale machine learning, helping you to reduce the time required to train large language models with trillions of parameters. In this course, you will explore the tools, techniques, and strategies essential for efficient distributed training using PyTorch, Accelerator, and Trainer.

Preparing Data for Distributed Training

You'll begin by preparing data for distributed training by splitting datasets across multiple devices and deploying model copies to each device. You'll gain hands-on experience in preprocessing data for distributed environments, including images, audio, and text.

Exploring Efficiency Techniques

Once your data is ready, you'll explore ways to improve efficiency in training and optimizer use across multiple interfaces. You'll see how to address these challenges by improving memory usage, device communication, and computational efficiency with techniques like gradient accumulation, gradient checkpointing, local stochastic gradient descent, and mixed precision training. You'll understand the tradeoffs between different optimizers to help you decrease your model's memory footprint. By the end of this course, you'll be equipped with the knowledge and tools to build distributed AI-powered services.

Предварительные требования

Intermediate Deep Learning with PyTorch Working with Hugging Face

1

Data Preparation with Accelerator

You'll prepare data for distributed training by splitting the data across multiple devices and copying the model on each device. Accelerator provides a convenient interface for data preparation, and you'll learn how to preprocess images, audio, and text as a first step in distributed training.

Prepare models with AutoModel and Accelerator

Loading and inspecting pre-trained models

Automatic device placement with Accelerator

Preprocess images and audio for training

Preprocess image datasets

Preprocess audio datasets

Prepare datasets for distributed training

Preprocess text for training

Preprocess text with AutoTokenizer

Save and load the state of preprocessed text

Начало Главы

2

Distributed Training with Accelerator and Trainer

In distributed training, each device trains on its data in parallel. You'll investigate two methods for distributed training: Accelerator enables custom training loops, and Trainer simplifies the interface for training.

Fine-tune models with Trainer

Define evaluation metrics

Specify the TrainingArguments

Set up the Trainer

Train models with Accelerator

Prepare a model for distributed training

Training loops before and after Accelerator

Building a training loop with Accelerator

Evaluate models with Accelerator

Setting the model in evaluation mode

Logging evaluation metrics

Начало Главы

3

Improving Training Efficiency

Distributed training strains resources with large models and datasets, but you can address these challenges by improving memory usage, device communication, and computational efficiency. You'll discover the techniques of gradient accumulation, gradient checkpointing, local stochastic gradient descent, and mixed precision training.

Gradient accumulation

Gradient accumulation with Accelerator

Gradient accumulation with Trainer

Gradient checkpointing and local SGD

Gradient checkpointing with Accelerator

Gradient checkpointing with Trainer

Local SGD with Accelerator

Mixed precision training

Mixed precision training with basic PyTorch

Mixed precision training with Accelerator

Mixed precision training with Trainer

Начало Главы

4

Training with Efficient Optimizers

You'll focus on optimizers as levers to improve distributed training efficiency, highlighting tradeoffs between AdamW, Adafactor, and 8-bit Adam. Reducing the number of parameters or using low precision helps to decrease a model's memory footprint.

Balanced training with AdamW

AdamW with Trainer

AdamW with Accelerator

Compute the optimizer size

Memory-efficient training with Adafactor

Adafactor with Trainer

Adafactor with Accelerator

Mixed precision training with 8-bit Adam

Set up the 8-bit Adam optimizer

8-bit Adam with Trainer

8-bit Adam with Accelerator

Which optimizer is it?

Congratulations!

Начало Главы

Efficient AI Model Training with PyTorch

Курс
завершен

Получите свидетельство о достижениях

Добавьте эти данные в свой профиль LinkedIn, резюме или CV.
Поделитесь этим в социальных сетях и в своем отчете об оценке эффективности работы.Запишитесь Прямо Сейчас

Присоединяйтесь 19 миллионов учащихся и начните Efficient AI Model Training with PyTorch сегодня!

Создайте бесплатный аккаунт

или

Продолжая, вы принимаете наши Условия использования, нашу Политику конфиденциальности и подтверждаете, что ваши данные хранятся в США.

Развивайте свои навыки работы с данными с помощью DataCamp для мобильных устройств.

Успевайте в обучении на ходу с помощью наших мобильных курсов и ежедневных 5-минутных заданий по программированию.