Skip to main content
Category
Topics

Data Engineering Tutorials | Read, Learn, & Grow Your Skills

Read our data engineering blog to gain extra insight into how to build the tools, infrastructure, & frameworks to support data fluency in your business.
Other topics:
GroupTraining 2 or more people?Try DataCamp for Business
SQL

The Difference Between WHERE and HAVING in SQL

Discover how WHERE filters row-level data in SQL queries, while HAVING filters grouped data after aggregation, and master their distinct uses in SQL querying.
Islam Salahuddin's photo

Islam Salahuddin

October 10, 2024

SQL

SQL Order of Execution: Understanding How Queries Run

Understand the SQL order of execution and how its different from the order of writing. Write accurate and optimized queries for improved performance and avoid common mistakes in query design.
Allan Ouko's photo

Allan Ouko

October 10, 2024

Data Engineering

Moving Data with Python and dlt: A Guide for Data Engineers

Python dlt is a great choice for data engineers who want to move data fast and confidently. It integrates with almost any source and destination and lets you write your ELT/ETL pipelines quickly. This tutorial shows you how to control every step of the pipeline and work with local files, cloud, databases, and REST APIs.
Dario Radečić's photo

Dario Radečić

October 1, 2024

SQL

What is Transitive Dependency?

A transitive dependency occurs when one attribute in a database indirectly relies on another through a third attribute, causing redundancy. Keep reading to learn how to identify and eliminate transitive dependencies in database design .
Allan Ouko's photo

Allan Ouko

September 26, 2024

AWS

Amazon Simple Queue Service (SQS): A Comprehensive Tutorial

This tutorial teaches you how to create, manage, and use Amazon SQS queues for building scalable distributed applications on AWS, with practical examples using both the console and the CLI.
Zoumana Keita 's photo

Zoumana Keita

September 22, 2024

Data Engineering

DuckDB for Data Engineers: Speed Up Your Data Pipelines 10x and More

DuckDB is a powerful analytical engine that lives on your laptop. You can use it to speed up data reading and processing and reduce your pipeline runtimes from minutes to seconds. Follow this hands-on guide to learn how.
Dario Radečić's photo

Dario Radečić

September 21, 2024

SQL

SQL UPDATE with JOIN: How it Works

Learn how UPDATE with JOIN in SQL simplifies cross-table updates in SQL Server. Understand how INNER JOIN and LEFT JOIN differ for specific use cases, and explore alternatives using subqueries or the MERGE statement.
Allan Ouko's photo

Allan Ouko

September 11, 2024

AWS

DynamoDB to Redshift: Three Methods for Migrating Data

Learn three methods to migrate data from Amazon DynamoDB to Redshift, with step-by-step instructions, the pros and cons of each method, and best practices for data integrity and performance optimization.
Aashish Nair's photo

Aashish Nair

September 10, 2024

AWS

Getting Started with AWS Glue: A Step-by-Step Guide

Learn how to set up AWS Glue, create a crawler, catalog your data, and run jobs to convert CSV files into Parquet format, optimizing your ETL processes.
Zoumana Keita 's photo

Zoumana Keita

September 9, 2024

Data Engineering

What is a Database Schema? A Guide on the Types and Uses

A database schema provides a comprehensive blueprint for the organization of data, detailing how tables, fields, and relationships are structured. Read to learn about the schema types, such as star, snowflake, and relational schemas.
Laiba Siddiqui's photo

Laiba Siddiqui

August 30, 2024