Camp Data has completed

Transformer Models with PyTorch

Start course For Free

2 hr

1,900 XP

Loved by learners at thousands of companies

Course Description

Deep-Dive into the Transformer Architecture

Transformer models have revolutionized text modeling, kickstarting the generative AI boom by enabling today's large language models (LLMs). In this course, you'll look at the key components in this architecture, including positional encoding, attention mechanisms, and feed-forward sublayers. You'll code these components in a modular way to build your own transformer step-by-step.

Implement Attention Mechanisms with PyTorch

The attention mechanism is a key development that helped formalize the transformer architecture. Self-attention allows transformers to better identify relationships between tokens, which improves the quality of generated text. Learn how to create a multi-head attention mechanism class that will form a key building block in your transformer models.

Build Your Own Transformer Models

Learn to build encoder-only, decoder-only, and encoder-decoder transformer models. Learn how to choose and code these different transformer architectures for different language tasks, including text classification and sentiment analysis, text generation and completion, and sequence-to-sequence translation.

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

1
The Building Blocks of Transformer Models
Free
Discover what makes the hottest deep learning architecture in AI tick! Learn about the components that make up Transformer models, including the famous self-attention mechanisms described in the renowned paper "Attention is All You Need."
Play Chapter Now
Transformers with PyTorch
50 xp
Breaking down the Transformer
50 xp
PyTorch Transformers
100 xp
Embedding and positional encoding
50 xp
Creating input embeddings
100 xp
Creating positional encodings
100 xp
Multi-head self-attention
50 xp
Implementing multi-head attention
100 xp
Starting the MultiHeadAttentionClass
100 xp
Adding methods to the MultiHeadAttention class
100 xp
2
Building Transformer Architectures
Free
Design transformer encoder and decoder blocks, and combine them with positional encoding, multi-headed attention, and position-wise feed-forward networks to build your very own Transformer architectures. Along the way, you'll develop a deep understanding and appreciation for how transformers work under the hood.
Play Chapter Now
Encoder transformers
50 xp
Feed-forward sublayers
100 xp
The encoder transformer layer
100 xp
The encoder transformer body
100 xp
Adding the transformer head
100 xp
Decoder transformers
50 xp
Designing a mask for self-attention
100 xp
The decoder layer
100 xp
Completing the decoder transformer
100 xp
Encoder-decoder transformers
50 xp
Adding cross-attention to the decoder layer
100 xp
Constructing the encoder-decoder transformer
100 xp
Congratulations!
50 xp

For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

collaborators

Michał Oleszak

Jasmin Ludolf

prerequisites

Deep Learning for Text with PyTorch

James Chapman

AI Curriculum Manager, DataCamp

James is a Curriculum Manager at DataCamp, where he collaborates with experts from industry and academia to create courses on AI, data science, and analytics. He has led nine DataCamp courses on diverse topics in Python, R, AI developer tooling, and Google Sheets. He has a Master's degree in Physics and Astronomy from Durham University, where he specialized in high-redshift quasar detection. In his spare time, he enjoys restoring retro toys and electronics.

Follow James on LinkedIn

Join over 17 million learners and start Transformer Models with PyTorch today!

Create Your Free Account

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Transformer Models with PyTorch

Loved by learners at thousands of companies

Course Description

Deep-Dive into the Transformer Architecture

Implement Attention Mechanisms with PyTorch

Build Your Own Transformer Models

.css-10r9e5n{-webkit-margin-end:8px;margin-inline-end:8px;}.css-1309hh9{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;-webkit-margin-end:8px;margin-inline-end:8px;}Training 2 or more people?

The Building Blocks of Transformer Models

Building Transformer Architectures

Training 2 or more people?

Join over .css-ou6dz6{color:#03ef62;}17 million learners and start Transformer Models with PyTorch today!

Create Your Free Account

Training 2 or more people?

Join over 17 million learners and start Transformer Models with PyTorch today!