Skip to main content
HomeDatabricks

Course

Introduction to Databricks Lakehouse

BasicSkill Level
4.7+
40 reviews
Updated 05/2026
Explore the Databricks Lakehouse - from medallion architecture and clusters to governance, sharing, and deployment.
Start Course for Free
DatabricksData Engineering
3 hr
15 videos
43 Exercises
3,550 XP
Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training a Team?

Try for Business

Course Description

Data lakes offer flexibility but lack reliability. Data warehouses deliver performance but can't handle unstructured data. The lakehouse combines both — and Databricks is where it all comes together. In this course, you'll explore the Databricks Lakehouse from the ground up, gaining hands-on experience with the platform's core components.

Understand the Lakehouse Architecture

Start by discovering what sets the lakehouse apart from traditional approaches. You'll explore the medallion architecture — bronze, silver, and gold layers — that transforms raw, messy data into clean, business-ready insights. Then get oriented inside the Databricks workspace to understand how catalogs, schemas, and volumes organize everything.

Master Compute and Notebooks

Learn to choose the right cluster for the job, configure autoscaling and auto-termination to control costs, and build notebooks that mix Python, SQL, and Markdown. You'll also connect your work to Git through Databricks Repos for version control and team collaboration.

Govern and Share Data Securely

Explore Unity Catalog to manage access controls and track data lineage across your organization. Then use Delta Sharing to distribute data to partners — on Databricks or any other platform — and query external sources with Lakehouse Federation, all without copying a single byte.

Deploy to Production with Asset Bundles

Wrap up by packaging your notebooks, pipelines, and jobs into Databricks Asset Bundles for repeatable, automated deployments. A capstone scenario brings everything together so you leave ready to apply these skills on the job.

What you'll learn

  • Identify how the lakehouse architecture and medallion pattern (bronze, silver, gold) organize data from raw ingestion through to business-ready insights.
  • Recognize how to configure and manage Databricks clusters, including selecting runtimes, enabling autoscaling, and controlling costs with auto-termination.
  • Identify how to build multi-language notebooks, use magic commands, and connect work to Git through Databricks Repos for version control.
  • Recognize how Unity Catalog, Delta Sharing, and Lakehouse Federation work together to govern access, share data securely, and query external sources without copying data.
  • Identify how to package notebooks, pipelines, and jobs into Databricks Asset Bundles for repeatable, automated production deployments.

Feels like what you want to learn?

Start Course for Free

Prerequisites

Introduction to Databricks
1

The Lakehouse Paradigm

Discover what makes the lakehouse different from traditional architectures, how the medallion pattern organizes data, and where things live inside the Databricks platform.
Start Chapter
2

Compute and Notebooks

Spin up the right cluster for the job, configure it for cost and performance, master the notebook environment, and connect your work to Git - all inside the Databricks workspace.
Start Chapter
Introduction to Databricks Lakehouse
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.7
from 40 reviews
80%
18%
3%
0%
0%
  • Giang
    8 hours ago

  • Mohan
    2 days ago

  • Sayan
    3 days ago

  • Erickson
    3 days ago

  • Donal
    4 days ago

  • Sun
    4 days ago

Mohan

Sayan

Donal

FAQs

What is the medallion architecture covered in this course?

The medallion architecture is a pattern that organizes data into bronze, silver, and gold layers, progressively refining raw data into trusted, analytics-ready insights within the lakehouse.

Does this course cover Unity Catalog and data governance?

Yes, Chapter 3 teaches you to lock down data with Unity Catalog, share it securely using Delta Sharing, and federate queries to external sources without copying data.

What is Delta Sharing and is it covered here?

Delta Sharing is an open protocol for securely sharing data across organizations. Chapter 3 covers how to use it alongside Unity Catalog for governed data sharing.

Do I need prior Databricks experience for this course?

Yes, Introduction to Databricks is a prerequisite. This course builds on that foundation to cover the lakehouse platform, compute configuration, governance, and deployment.

What are Databricks Asset Bundles and will I learn to use them?

Asset Bundles let you package and deploy your Databricks work to production. Chapter 4 teaches you how to use them and brings all course concepts together in a capstone scenario.

Join over 19 million learners and start Introduction to Databricks Lakehouse today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.