Learn to build data pipelines on the Databricks Lakehouse Platform — from architecture concepts to hands-on Spark and Delta Lake. This beginner course starts with why the lakehouse pattern replaced separate data warehouses and data lakes, then moves directly into the Databricks workspace where you'll configure compute, write PySpark and SQL queries, and manage data with Unity Catalog's three-level namespace.
Week by week, you'll progress from navigating the platform to transforming DataFrames with select, filter, groupBy, and joins, then to creating Delta Lake tables with ACID transactions, schema enforcement, and time travel. You'll perform real DML operations — INSERT, UPDATE, DELETE, and MERGE — and learn to schedule production pipelines using Databricks Jobs with DAG-based orchestration.
The course runs entirely on Databricks Free Edition, so there's no cloud billing. Six hands-on labs reinforce each module: explore the workspace, write notebook-based transformations, build Delta tables, and wire up an automated workflow. By the end, you'll have built a complete data engineering pipeline from raw ingestion through Delta Lake to scheduled production jobs.
This module introduces the lakehouse paradigm and the Databricks platform. You'll learn about the structure of lakehouse architecture, explore the Databricks workspace and its core tools, and understand how compute and storage work together.
涵盖的内容
6个视频7篇阅读材料1个作业
显示有关单元内容的信息
6个视频•总计24分钟
Data Architecture Evolution•5分钟
Lakehouse Architecture•5分钟
Databricks and the Lakehouse•3分钟
Databricks Overview•4分钟
Workspace, Catalog & Data•4分钟
Compute Resources•4分钟
7篇阅读材料•总计7分钟
About This Course•1分钟
Key Terms•1分钟
Reflection•1分钟
Key Terms•1分钟
Reflection•1分钟
Key Terms•1分钟
Reflection•1分钟
1个作业•总计2分钟
Quiz: Lakehouse Architecture & Platform•2分钟
Apache Spark on Databricks
第 2 单元•小时 后完成
单元详情
This module covers notebooks and hands-on data manipulation using PySpark. You'll create and organize notebooks, load data from the Catalog, and write PySpark transformations to select, filter, aggregate, and join datasets.
涵盖的内容
6个视频6篇阅读材料1个作业
显示有关单元内容的信息
6个视频•总计28分钟
Using Notebooks•4分钟
Magic Commands & Utilities•4分钟
Loading & Previewing Data•5分钟
Spark Core Concepts•3分钟
Select & Filter Operations•7分钟
GroupBy, Aggregations & Joins•5分钟
6篇阅读材料•总计6分钟
Key Terms•1分钟
Reflection•1分钟
Databricks Free Edition•1分钟
Key Terms•1分钟
Lazy Evaluation•1分钟
Reflection•1分钟
1个作业•总计2分钟
Quiz: Spark Fundamentals•2分钟
Delta Lake Essentials
第 3 单元•小时 后完成
单元详情
This module introduces Delta Lake, where you'll create Delta tables, perform transactional operations like updates, deletes, and merges, use time travel to query previous versions, and see how Delta Lake connects to governance and automation features.
涵盖的内容
6个视频7篇阅读材料1个作业
显示有关单元内容的信息
6个视频•总计25分钟
What Is Delta Lake•4分钟
Delta Lake Concepts•4分钟
Creating Delta Tables•6分钟
Insert, Update & Merge•5分钟
Time Travel•3分钟
Jobs, Dashboards & Workflows•4分钟
7篇阅读材料•总计7分钟
Key Terms•1分钟
Reflection•1分钟
Key Terms•1分钟
Reflection•1分钟
Hands-On: MERGE, Updates, and Time Travel (Pure Python Mental Model)•1分钟
Key Terms•1分钟
Reflection•1分钟
1个作业•总计3分钟
Quiz: Delta Lake & Workflows•3分钟
Capstone
第 4 单元•6分钟 后完成
单元详情
Build an end-to-end lakehouse data pipeline integrating every concept from the course. Starting from raw data files, you will construct a complete medallion architecture (bronze → silver → gold) with Delta Lake, implement incremental MERGE logic, and orchestrate the pipeline as a scheduled Databricks Job. Six hands-on lab notebooks guide you through the project using the course GitHub repository.
Do I need a paid Databricks account to take this course?
o. The entire course runs on Databricks Free Edition, which gives you full platform access — notebooks, clusters, Delta Lake, Unity Catalog, and Jobs — with zero cloud billing. You can start immediately without a credit card.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.