Transformer Models with PyTorch

高级技能水平

更新时间 2025年1月

What makes LLMs tick? Discover how transformers revolutionized text modeling and kickstarted the generative AI boom.

课程描述

Deep-Dive into the Transformer Architecture

Transformer models have revolutionized text modeling, kickstarting the generative AI boom by enabling today's large language models (LLMs). In this course, you'll look at the key components in this architecture, including positional encoding, attention mechanisms, and feed-forward sublayers. You'll code these components in a modular way to build your own transformer step-by-step.

Implement Attention Mechanisms with PyTorch

The attention mechanism is a key development that helped formalize the transformer architecture. Self-attention allows transformers to better identify relationships between tokens, which improves the quality of generated text. Learn how to create a multi-head attention mechanism class that will form a key building block in your transformer models.

Build Your Own Transformer Models

Learn to build encoder-only, decoder-only, and encoder-decoder transformer models. Learn how to choose and code these different transformer architectures for different language tasks, including text classification and sentiment analysis, text generation and completion, and sequence-to-sequence translation.

先决条件

Deep Learning for Text with PyTorch

The Building Blocks of Transformer Models

Discover what makes the hottest deep learning architecture in AI tick! Learn about the components that make up Transformer models, including the famous self-attention mechanisms described in the renowned paper "Attention is All You Need."

Transformers with PyTorch

课程描述

Deep-Dive into the Transformer Architecture

Implement Attention Mechanisms with PyTorch

Build Your Own Transformer Models

获得成就证明

加入超过.css-nklxlk{color:var(--wf-brand--main, #03EF62);}19百万学习者，今天就开始Transformer Models with PyTorch！

创建您的免费帐户

通过 DataCamp for Mobile 提升您的数据技能

加入超过19百万学习者，今天就开始Transformer Models with PyTorch！