Skip to main content
HomeShell

Course

CI/CD for Machine Learning

Advanced
Updated 03/2025
Elevate your Machine Learning Development with CI/CD using GitHub Actions and Data Version Control
Start Course for Free

Included withPremium or Teams

ShellMachine Learning5 hours15 videos46 Exercises3,500 XP4,461Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.
Group

Training 2 or more people?

Try DataCamp for Business

Loved by learners at thousands of companies

Course Description

The course will empower you to streamline your machine learning development processes, enhancing efficiency, reliability, and reproducibility in your projects. Throughout the course, you'll develop a comprehensive understanding of CI/CD workflows and YAML syntax, utilizing GitHub Actions (GA) for automation, training models in a pipeline, versioning datasets with DVC, performing hyperparameter tuning, and automating testing and pull requests.

Fundamentals of CI/CD, YAML, and Machine Learning

You'll be introduced to the fundamental concepts of CI/CD and YAML, and gain an understanding of the software development life cycle and key terms like build, test, and deploy. You'll define Continuous Integration, Continuous Delivery, and Continuous Deployment while examining their distinctions. You'll also explore the utility of CI/CD in machine learning and experimentation.

GitHub Actions for CI/CD Automation

You'll learn about GA, a powerful platform for implementing CI/CD workflows. You'll discover the various elements of GA, including events, actions, jobs, steps, runners, and context. You'll learn how to define workflows triggered by events such as push and pull requests and customize runner machines. You'll also gain practical experience by setting up basic CI pipelines and understanding the GA log.

Versioning Datasets with Data Version Control

You'll delve deep into Data Version Control (DVC) for versioning datasets, initializing DVC, and tracking datasets. Using DVC pipelines, you'll learn how to train classification models and generate metrics in a reproducible manner.

Optimizing Model Performance and Hyperparameter Tuning

You'll now focus on model performance analysis and hyperparameter tuning and gain practical skills in diffing metrics and plots across branches to compare changes in model performance. You'll learn how to download artifacts using GA and perform hyperparameter tuning using scikit-learn's GridSearchCV. Additionally, you'll explore automating pull requests with the best model configuration.

Prerequisites

MLOps ConceptsSupervised Learning with scikit-learnIntermediate Git
1

Introduction to Continuous Integration/Continuous Delivery and YAML

Start Chapter
2

GitHub Actions

Start Chapter
3

Continuous Integration in Machine Learning

Start Chapter
4

Comparing training runs and Hyperparameter (HP) tuning

Start Chapter
CI/CD for Machine Learning
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

Included withPremium or Teams

Enroll now

Join over 16 million learners and start CI/CD for Machine Learning today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.