Course
Introduction to Databricks Lakehouse
BasicSkill Level
Updated 04/2026Start Course for Free
Included withPremium or Teams
DatabricksData Engineering3 hr15 videos43 Exercises3,550 XPStatement of Accomplishment
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Loved by learners at thousands of companies
Training 2 or more people?
Try DataCamp for BusinessCourse Description
Understand the Lakehouse Architecture
Start by discovering what sets the lakehouse apart from traditional approaches. You'll explore the medallion architecture — bronze, silver, and gold layers — that transforms raw, messy data into clean, business-ready insights. Then get oriented inside the Databricks workspace to understand how catalogs, schemas, and volumes organize everything.Master Compute and Notebooks
Learn to choose the right cluster for the job, configure autoscaling and auto-termination to control costs, and build notebooks that mix Python, SQL, and Markdown. You'll also connect your work to Git through Databricks Repos for version control and team collaboration.Govern and Share Data Securely
Explore Unity Catalog to manage access controls and track data lineage across your organization. Then use Delta Sharing to distribute data to partners — on Databricks or any other platform — and query external sources with Lakehouse Federation, all without copying a single byte.Deploy to Production with Asset Bundles
Wrap up by packaging your notebooks, pipelines, and jobs into Databricks Asset Bundles for repeatable, automated deployments. A capstone scenario brings everything together so you leave ready to apply these skills on the job.Feels like what you want to learn?
Start Course for FreeWhat you'll learn
- Identify how the lakehouse architecture and medallion pattern (bronze, silver, gold) organize data from raw ingestion through to business-ready insights.
- Recognize how to configure and manage Databricks clusters, including selecting runtimes, enabling autoscaling, and controlling costs with auto-termination.
- Identify how to build multi-language notebooks, use magic commands, and connect work to Git through Databricks Repos for version control.
- Recognize how Unity Catalog, Delta Sharing, and Lakehouse Federation work together to govern access, share data securely, and query external sources without copying data.
- Identify how to package notebooks, pipelines, and jobs into Databricks Asset Bundles for repeatable, automated production deployments.
Prerequisites
Introduction to Databricks1
The Lakehouse Paradigm
Discover what makes the lakehouse different from traditional architectures, how the medallion pattern organizes data, and where things live inside the Databricks platform.
2
Compute and Notebooks
Spin up the right cluster for the job, configure it for cost and performance, master the notebook environment, and connect your work to Git - all inside the Databricks workspace.
3
Governance and Sharing
Lock down your data with Unity Catalog, share it securely with Delta Sharing, and federate queries to external sources - all without copying a single byte.
4
Deployment and Next Steps
Package your work with Databricks Asset Bundles, deploy to production, and bring everything together in a capstone scenario.
Introduction to Databricks Lakehouse
Course Complete
Earn Statement of Accomplishment
Add this credential to your LinkedIn profile, resume, or CVShare it on social media and in your performance review
Included withPremium or Teams
Enroll NowJoin over 19 million learners and start Introduction to Databricks Lakehouse today!
Create Your Free Account
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.