
Loved by learners at thousands of companies
Course Description
This course dives deep into Git's advanced features and is geared toward data engineering and data science workflows. You'll master complex merging strategies, learn to manipulate repository history and optimize Git for large-scale data projects. Key topics include advanced rebasing, git reflog for disaster recovery, efficient debugging with git bisect, and managing large files with Git LFS. You'll also explore parallel development using worktrees and how to modularize project structures with submodules. By the end of this course, you'll have the skills to handle complex version control scenarios and issues in data pipeline development and collaborative data projects.
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.- 1
Advanced Merging Strategies
FreeIn this chapter, you will learn practical Git merging techniques for managing complex data engineering projects. You'll explore different merge strategies and understand how to integrate code changes while maintaining a clean project history. By the end, you'll know how to handle various merge scenarios and reorganize commit histories.
- 2
Git History and Exploration
In this chapter, you will develop skills for investigating and managing your project's Git history. You'll learn techniques for selectively applying changes, identifying and fixing bugs, and managing sensitive information in your repository. These tools will help you maintain clean, traceable code in data engineering workflows.
Cherry-Picking50 xpCherry-Pick single commit100 xpHandling cherry-pick abort100 xpBisect50 xpBisect Initialization100 xpBisect Search100 xpGit Filter Repo50 xpFiltering sensitive data50 xpRemove secrets file from repo100 xpGit reflog50 xpFinding deleted branches50 xpUndoing a commit100 xpLost commit100 xp - 3
Advanced Repository Management
In this chapter, you will explore advanced Git techniques for managing complex software projects. You'll learn how to work on multiple features simultaneously, organize code dependencies, handle large files, and implement efficient development workflows. These skills are essential for managing modern data engineering and software development projects.
Git Worktrees50 xpCreate a worktree100 xpRemove a worktree100 xpGit Submodules50 xpAdding a Git submodule100 xpShow all submodules100 xpGit Large File Storage50 xpSetting Up Git LFS100 xpIdentifying Git LFS File Storage50 xpTrunk Based Development50 xpPrinciples of Trunk Based Development50 xpTrunk Based Development Workflow50 xpETL Repository Management100 xpWrap-up50 xp
Training 2 or more people?
Get your team access to the full DataCamp platform, including all the features.collaborators


prerequisites
Intermediate GitSoftware and Data Engineer
Amanda is an experienced software and data engineer with over a decade of hands-on experience in version control, particularly Git, applied across software development and data engineering projects. Amanda's expertise spans from her time at tech giants like Dropbox and Microsoft to her current role as a freelance consultant and educator. She holds a Master's degree in Computer Science from the University of Pittsburgh.
Join over 18 million learners and start Advanced Git today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.