Category
Topics
Data Engineering Tutorials | Read, Learn, & Grow Your Skills
Read our data engineering blog to gain extra insight into how to build the tools, infrastructure, & frameworks to support data fluency in your business.
Other topics:
Training 2 or more people?Try DataCamp for Business
The Difference Between WHERE and HAVING in SQL
Discover how WHERE filters row-level data in SQL queries, while HAVING filters grouped data after aggregation, and master their distinct uses in SQL querying.
Islam Salahuddin
October 10, 2024
SQL Order of Execution: Understanding How Queries Run
Understand the SQL order of execution and how its different from the order of writing. Write accurate and optimized queries for improved performance and avoid common mistakes in query design.
Allan Ouko
October 10, 2024
Jenkins for MLOps: A Complete CI/CD Tutorial
Learn how to install Jenkins on Windows, set up and run Jenkins agents and pipelines, and build MLOps projects with Jenkins pipelines from model training to model serving.
Abid Ali Awan
October 1, 2024
Moving Data with Python and dlt: A Guide for Data Engineers
Python dlt is a great choice for data engineers who want to move data fast and confidently. It integrates with almost any source and destination and lets you write your ELT/ETL pipelines quickly. This tutorial shows you how to control every step of the pipeline and work with local files, cloud, databases, and REST APIs.
Dario Radečić
October 1, 2024
What is Transitive Dependency?
A transitive dependency occurs when one attribute in a database indirectly relies on another through a third attribute, causing redundancy. Keep reading to learn how to identify and eliminate transitive dependencies in database design .
Allan Ouko
September 26, 2024
Introduction to GitHub Codespaces
Discover GitHub Codespaces, the development environment that allows you to write, run, and deploy your code anywhere.
Adejumo Ridwan Suleiman
September 24, 2024
Amazon Simple Queue Service (SQS): A Comprehensive Tutorial
This tutorial teaches you how to create, manage, and use Amazon SQS queues for building scalable distributed applications on AWS, with practical examples using both the console and the CLI.
Zoumana Keita
September 22, 2024
DuckDB for Data Engineers: Speed Up Your Data Pipelines 10x and More
DuckDB is a powerful analytical engine that lives on your laptop. You can use it to speed up data reading and processing and reduce your pipeline runtimes from minutes to seconds. Follow this hands-on guide to learn how.
Dario Radečić
September 21, 2024
SQL UPDATE with JOIN: How it Works
Learn how UPDATE with JOIN in SQL simplifies cross-table updates in SQL Server. Understand how INNER JOIN and LEFT JOIN differ for specific use cases, and explore alternatives using subqueries or the MERGE statement.
Allan Ouko
September 11, 2024
DynamoDB to Redshift: Three Methods for Migrating Data
Learn three methods to migrate data from Amazon DynamoDB to Redshift, with step-by-step instructions, the pros and cons of each method, and best practices for data integrity and performance optimization.
Aashish Nair
September 10, 2024
Getting Started with AWS Glue: A Step-by-Step Guide
Learn how to set up AWS Glue, create a crawler, catalog your data, and run jobs to convert CSV files into Parquet format, optimizing your ETL processes.
Zoumana Keita
September 9, 2024
What is a Database Schema? A Guide on the Types and Uses
A database schema provides a comprehensive blueprint for the organization of data, detailing how tables, fields, and relationships are structured. Read to learn about the schema types, such as star, snowflake, and relational schemas.
Laiba Siddiqui
August 30, 2024