首页 Google Cloud

课程

Google DeepMind: Build Your Own Small Language Model

中级技能水平

更新时间 2026年4月

In this Google DeepMind course, you will learn the fundamentals of language models and gain a high-level of machine learning development pipelines.

免费开始课程

Google CloudCloud

6小时

39 练习

1,950 经验值

成就证明

深受数千家公司学习者的喜爱

在培训团队？

企业版试用

课程描述

In this Google DeepMind course, you will learn the fundamentals of language models and gain a high-level understanding of the machine learning development pipeline. You will consider the strengths and limitations of traditional n-gram models and advanced transformer models. Practical coding labs will enable you to develop insights into how machine learning models work and how they can be used to generate text and identify patterns in language. Through real-world case studies, you will build an understanding around how research engineers operate. Drawing on these insights you will identify problems that you wish to tackle in your own community and consider how to leverage the power of machine learning responsibly to address these problems within a global and local context.

先决条件

本课程无先修要求

1

Introduction to the language modeling problem

In this module, you will explore the power of language models and their real-world applications. Starting with a manual method for modelling language, you will investigate the role that probabilities and randomness play in next word prediction. You will also consider the course learning objectives and how to most effectively study.

The power of language models

50 经验值

Predict the next word

50 经验值

Learning objectives

50 经验值

How to get the most out of this course

50 经验值

The role of probabilities in language models

50 经验值

Lab: Create Your Own Probability Distribution

50 经验值

Reflect on your findings

50 经验值

Quiz 1 - Question 1

50 经验值

Quiz 1 - Question 2

50 经验值

2

From n-grams to transformers

In this module, you will move beyond the manual method and explore how n-grams can be used to tokenize data. You will investigate how probabilities can be calculated to begin identifying language patterns. You will then build your own n-gram model using a small dataset and examine its limitations. Furthermore, you will consider the process researchers undertake when approaching real-world problems through the lens of Google DeepMind’s AlphaFold project. Finally, you will reflect on your own values and those of your community, as well as the role AI systems play in making decisions that involve ethical choices.

50 经验值

Lab: Experiment with N-grams

50 经验值

The limitations of n-grams

50 经验值

AlphaFold: The power of machine learning

50 经验值

Weighing values: Culture and ethics in the trolley problem

50 经验值

Applying a local ethical lens to the trolley problem

50 经验值

Quiz 2 - Question 1

50 经验值

Quiz 2 - Question 2

50 经验值

3

Transformer models

In this module, you will experiment with more sophisticated transformer models and evaluate how they perform in comparison to n-gram models. You will take a deeper dive into the anatomy of language models and their core components. You will continue reflecting on the role that values play in guiding which technical problems you choose to solve. Specifically, you will consider the Ubuntu moral system and compare its characteristics with moral values popular in Europe and North America. Finally, you will design a values framework for guiding LLM development in your local community.

Lab: Compare N-Gram Models and Transformer Language Models

50 经验值

Core aspects of Ubuntu

50 经验值

Develop a local values framework

50 经验值

Anatomy of a language model

50 经验值

What does it mean to train a model?

50 经验值

Quiz 3 - Question 1

50 经验值

Quiz 3 - Question 2

50 经验值

4

Training a model

In this module, you will contextualise the process of building language models within the machine learning development pipeline. You will preprocess your dataset and learn how to prepare a dataset to be used for training a transformer model. You will then train your own language model and evaluate its performance.

Machine learning development pipeline

50 经验值

Lab: Prepare the Dataset for Training an SLM

50 经验值

Lab: Train Your Own Small Language Model (SLM)

50 经验值

Evaluating a model

50 经验值

Quiz 4 - Question 1

50 经验值

Quiz 4 - Question 2

50 经验值

5

Challenge

In this module, you will consider the specific benefits that transformer LLMs can bring about for different sectors in your local context. You will then explore what makes a good problem statement before developing your own problem statement for a challenge around language models that you have identified in your community.

Anticipating benefits

50 经验值

Challenge: Develop your problem statement

50 经验值

Quiz 5 - Question 1

50 经验值

Quiz 5 - Question 2

50 经验值

6

Continue your journey

In this module, you will have the opportunity to consult additional resources and further reading to investigate the topics you have covered in more detail. Finally, you will consider your next steps and how you can build on what you have learned in the course.

50 经验值

Looking forward

50 经验值

Additional resources and further reading

50 经验值

50 经验值

50 经验值

Google DeepMind: Build Your Own Small Language Model

课程完成

获得成就证明

将此证书添加到你的 LinkedIn 档案、简历或履历中
在社交媒体和绩效评估中分享立即注册

加入超过19百万学习者，今天就开始Google DeepMind: Build Your Own Small Language Model！

通过 DataCamp for Mobile 提升您的数据技能

随时随地通过我们的移动课程和每日 5 分钟编程挑战提升技能。