Hoppa till huvudinnehåll
HemPyTorch

course

Transformer Models with PyTorch

AvanceradFärdighetsnivå
Uppdaterad 2025-01
What makes LLMs tick? Discover how transformers revolutionized text modeling and kickstarted the generative AI boom.
Börja Kursen Gratis
PyTorchArtificial Intelligence2 timmar7 videos23 exercises1,900 XP7,113Uttalande om prestation

Skapa ditt gratiskonto

eller

Genom att fortsätta accepterar du våra Användarvillkor, vår Integritetspolicy och att dina uppgifter lagras i USA.

Älskad av elever på tusentals företag

Group

Utbilda 2 eller fler personer?

Testa DataCamp for Business

Kursbeskrivning

Deep-Dive into the Transformer Architecture

Transformer models have revolutionized text modeling, kickstarting the generative AI boom by enabling today's large language models (LLMs). In this course, you'll look at the key components in this architecture, including positional encoding, attention mechanisms, and feed-forward sublayers. You'll code these components in a modular way to build your own transformer step-by-step.

Implement Attention Mechanisms with PyTorch

The attention mechanism is a key development that helped formalize the transformer architecture. Self-attention allows transformers to better identify relationships between tokens, which improves the quality of generated text. Learn how to create a multi-head attention mechanism class that will form a key building block in your transformer models.

Build Your Own Transformer Models

Learn to build encoder-only, decoder-only, and encoder-decoder transformer models. Learn how to choose and code these different transformer architectures for different language tasks, including text classification and sentiment analysis, text generation and completion, and sequence-to-sequence translation.

Förkunskapskrav

Deep Learning for Text with PyTorch
1

The Building Blocks of Transformer Models

Discover what makes the hottest deep learning architecture in AI tick! Learn about the components that make up Transformer models, including the famous self-attention mechanisms described in the renowned paper "Attention is All You Need."
Starta Kapitel
2

Building Transformer Architectures

Design transformer encoder and decoder blocks, and combine them with positional encoding, multi-headed attention, and position-wise feed-forward networks to build your very own Transformer architectures. Along the way, you'll develop a deep understanding and appreciation for how transformers work under the hood.
Starta Kapitel
Transformer Models with PyTorch
Kursen
är

Få ett prestationsutlåtande

Lägg till denna inloggningsuppgifter i din LinkedIn-profil, ditt CV eller ditt CV
Dela det på sociala medier och i ditt prestationssamtal
Registrera Dig Nu

Gå med över 19 miljoner elever och börja Transformer Models with PyTorch idag!

Skapa ditt gratiskonto

eller

Genom att fortsätta accepterar du våra Användarvillkor, vår Integritetspolicy och att dina uppgifter lagras i USA.

Utveckla dina datakunskaper med DataCamp för mobilen

Gör framsteg när du är på språng med våra mobila kurser och dagliga 5-minuters kodningsutmaningar.