Skip to main content

Collaborative Data Engineering with Git

Amanda Crawford-Adamo, an experienced data engineer, walks you through collaborative data engineering practices using Git.
Jun 11, 2025

Collaboration is at the heart of successful data engineering, and Git has become an essential tool for managing complex, multi-developer workflows. Yet many data practitioners are still getting up to speed with how to apply version control best practices in the context of data pipelines and analytics infrastructure. Mastering Git not only improves collaboration, but also ensures reproducibility, traceability, and long-term maintainability of data projects.

In this hands-on code-along, Amanda Crawford-Adamo, an experienced data engineer, walks you through collaborative data engineering practices using Git. You’ll learn how to manage data workflows with version control, explore advanced Git techniques like branching strategies and repository organization, and dive into a real-world case study about collaborating on a data pipeline. This session is perfect for data and software engineers looking to improve the way they build, share, and scale data infrastructure.

Key Takeaways:

  • Learn best practices for collaborative data engineering using Git.
  • Discover advanced Git techniques for managing shared repositories and resolving conflicts.
  • Explore a real-world example of using Git to build a data pipeline as a team.

Session Resources (Including GitHub Repo + Slides)

Topics
Related

blog

Git vs. GitHub: Differences Every Developer Should Know

Understand the difference between Git and GitHub, how they work together in modern workflows, and when to use each for solo and team projects.
Oluseye Jeremiah's photo

Oluseye Jeremiah

9 min

Tutorial

Introduction to GitHub Codespaces

Discover GitHub Codespaces, the development environment that allows you to write, run, and deploy your code anywhere.
Adejumo Ridwan Suleiman's photo

Adejumo Ridwan Suleiman

10 min

Tutorial

Building an ETL Pipeline with Airflow

Master the basics of extracting, transforming, and loading data with Apache Airflow.
Jake Roach's photo

Jake Roach

15 min

Tutorial

Git Branch: A Guide to Creating, Managing, and Merging Branches

Master the power of Git branches for smoother development and better collaboration.
Oluseye Jeremiah's photo

Oluseye Jeremiah

12 min

code-along

Creating Data Pipelines with Airflow

Join Mike, an experienced data engineering consultant, as he guides you through the fundamentals of data pipelines with Airflow and Python.
Mike Metzger's photo

Mike Metzger

code-along

Pair Programming with GitHub Copilot

In this session, Nuno, DataCamp's Director of Engineering, demonstrates how to make use of GitHub Copilot. You'll see how to perform a simple data analysis in conjunction with AI, and learn how to make the most of Copilot's features.
Nuno Rocha's photo

Nuno Rocha

See MoreSee More