Pular para o conteúdo principal
Página inicialMachine LearningCI/CD for Machine Learning

CI/CD for Machine Learning

Elevate your Machine Learning Development with CI/CD using GitHub Actions and Data Version Control

Comece O Curso Gratuitamente
5 Horas15 Videos46 Exercicios
2.123 AprendizesTrophyDeclaração de Realização

Crie sua conta gratuita

GoogleLinkedInFacebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.
GroupTreinar 2 ou mais pessoas?Experimente o DataCamp For Business

Amado por alunos de milhares de empresas


Descrição do Curso

The course will empower you to streamline your machine learning development processes, enhancing efficiency, reliability, and reproducibility in your projects. Throughout the course, you'll develop a comprehensive understanding of CI/CD workflows and YAML syntax, utilizing GitHub Actions (GA) for automation, training models in a pipeline, versioning datasets with DVC, performing hyperparameter tuning, and automating testing and pull requests.

Fundamentals of CI/CD, YAML, and Machine Learning

You'll be introduced to the fundamental concepts of CI/CD and YAML, and gain an understanding of the software development life cycle and key terms like build, test, and deploy. You'll define Continuous Integration, Continuous Delivery, and Continuous Deployment while examining their distinctions. You'll also explore the utility of CI/CD in machine learning and experimentation.

GitHub Actions for CI/CD Automation

You'll learn about GA, a powerful platform for implementing CI/CD workflows. You'll discover the various elements of GA, including events, actions, jobs, steps, runners, and context. You'll learn how to define workflows triggered by events such as push and pull requests and customize runner machines. You'll also gain practical experience by setting up basic CI pipelines and understanding the GA log.

Versioning Datasets with Data Version Control

You'll delve deep into Data Version Control (DVC) for versioning datasets, initializing DVC, and tracking datasets. Using DVC pipelines, you'll learn how to train classification models and generate metrics in a reproducible manner.

Optimizing Model Performance and Hyperparameter Tuning

You'll now focus on model performance analysis and hyperparameter tuning and gain practical skills in diffing metrics and plots across branches to compare changes in model performance. You'll learn how to download artifacts using GA and perform hyperparameter tuning using scikit-learn's GridSearchCV. Additionally, you'll explore automating pull requests with the best model configuration.
Para Empresas

GroupTreinar 2 ou mais pessoas?

Obtenha acesso à biblioteca completa do DataCamp, com relatórios, atribuições, projetos e muito mais centralizados
Experimente O DataCamp for BusinessPara uma solução sob medida , agende uma demonstração.

Nas seguintes faixas

Engenheiro de aprendizado de máquina

Ir para a trilha
  1. 1

    Introduction to Continuous Integration/Continuous Delivery and YAML

    Grátis

    In this chapter, you will explore the essential principles of Continuous Integration/Continuous Delivery (CI/CD) and YAML. You'll grasp the software development life cycle and key terms like build, test, and deploy. Discover the differences between Continuous Integration, Continuous Delivery, and Continuous Deployment. Moreover, you'll investigate the significance of CI/CD in machine learning and experimentation.

    Reproduzir Capítulo Agora
    Introduction to Continuous Integration/Continuous Delivery for Machine Learning
    50 xp
    Continuous deployment and delivery
    50 xp
    Machine learning workflow
    100 xp
    Introduction to YAML
    50 xp
    YAML syntax
    100 xp
    YAML mappings and sequences
    100 xp
    Introduction to GitHub Actions
    50 xp
    Utility of GitHub Actions
    100 xp
    Anatomy of GitHub Actions
    100 xp
  2. 2

    GitHub Actions

    Get ready to explore GitHub Actions (GHA), an influential platform for executing CI/CD workflows. Uncover the diverse components of GHA, encompassing events, actions, jobs, steps, runners, and context. Gain insights into crafting workflows that activate upon events like push and pull requests, and tailor runner machines. Dive into hands-on learning as you establish fundamental CI pipelines and grasp the intricacies of the GHA log.

    Reproduzir Capítulo Agora
  3. 3

    Continuous Integration in Machine Learning

    In this chapter, you'll explore the integration of machine learning model training into a GitHub Action pipeline using Continuous Machine Learning GitHub Action. You'll generate a comprehensive markdown report including model metrics and plots. You will also delve into data versioning in Machine Learning by adopting Data Version Control (DVC) to track data changes. The chapter also covers setting DVC remotes and dataset transfers. Finally, you'll explore DVC pipelines, configuring a DVC YAML file to orchestrate reproducible model training.

    Reproduzir Capítulo Agora
  4. 4

    Comparing training runs and Hyperparameter (HP) tuning

    In this chapter, you will direct your attention towards the analysis of model performance and the fine-tuning of hyperparameters. You will acquire practical expertise in comparing metrics and visualizations across different branches to assess changes in model performance. You will conduct hyperparameter tuning using scikit-learn's GridSearchCV. Furthermore, you will delve into the automation of pull requests using the optimal model configuration.

    Reproduzir Capítulo Agora
Para Empresas

GroupTreinar 2 ou mais pessoas?

Obtenha acesso à biblioteca completa do DataCamp, com relatórios, atribuições, projetos e muito mais centralizados

Nas seguintes faixas

Engenheiro de aprendizado de máquina

Ir para a trilha

Conjuntos De Dados

Weather

Colaboradores

Collaborator's avatar
George Boorman
Collaborator's avatar
Arne Warnke
Collaborator's avatar
Katerina Zahradova
Ravi Bhadauria HeadshotRavi Bhadauria

Senior Machine Learning Engineer

Veja Mais

O que os outros alunos têm a dizer?

Cadastre-se mais 13 milhões de alunos e comece CI/CD for Machine Learning Hoje!

Crie sua conta gratuita

GoogleLinkedInFacebook

ou

Ao continuar, você aceita nossos Termos de Uso, nossa Política de Privacidade e que seus dados são armazenados nos EUA.