You'll master the art of building production-ready data pipelines that automatically process millions of records. In this hands-on course, you'll design end-to-end workflows that integrate diverse data sources—from databases and APIs to real-time streams—using industry-standard tools like Apache Spark, dbt, and Apache Airflow. You'll learn to create robust data models that preserve historical changes, implement performance optimizations that reduce processing time by 30% or more, and build automated workflows with intelligent retry logic and monitoring alerts.
By the end, you'll have created a complete data pipeline system that demonstrates the technical skills data engineering teams need most. You'll know how to unify fragmented data sources, apply advanced transformation techniques, and ensure your pipelines run reliably at scale. This practical experience directly translates to the challenges you'll face as a data engineer, data analyst, or anyone working with large-scale data systems in modern organizations.
You will learn the foundational concepts and tools needed to create systematic visual documentation of data pipeline architectures.
涵盖的内容
3个视频2篇阅读材料1个作业
显示有关单元内容的信息
3个视频•总计15分钟
Why Data Flow Visualization Drives Engineering Success•4分钟
Systematic Approach to Identifying Sources and Destinations•8分钟
Creating Your First Data Flow Diagram•3分钟
2篇阅读材料•总计11分钟
Essential Components of Professional Data Flow Diagrams•6分钟
Transformation Mapping Principles for Complex Data Pipelines•5分钟
1个作业•总计3分钟
Data Flow Fundamentals Knowledge Check•3分钟
Creating Comprehensive Data Flow Diagrams
第 2 单元•小时 后完成
单元详情
You will apply advanced techniques to create professional-quality data flow diagrams that accurately represent complex enterprise data systems and support stakeholder collaboration.
涵盖的内容
2个视频2篇阅读材料3个作业
显示有关单元内容的信息
2个视频•总计12分钟
Advanced Diagramming Techniques for Complex Data Systems•9分钟
Mapping Complex Multi-System Data Pipelines•3分钟
2篇阅读材料•总计13分钟
Enterprise Data Flow Best Practices and Industry Standards•7分钟
Validation and Review Processes for Data Flow Documentation•6分钟
3个作业•总计25分钟
Comprehensive Data Flow Mastery Assessment•10分钟
Create Complete Enterprise Data Flow Diagram•12分钟
Advanced Data Flow Concepts Knowledge Check•3分钟
Modular Pipeline Development - Foundation & Core
第 3 单元•22分钟 后完成
单元详情
You will establish the foundational understanding and core skills for creating modular data pipeline stages, focusing on the principles of separation of concerns and tool integration fundamentals.
涵盖的内容
1个视频1篇阅读材料1个作业
显示有关单元内容的信息
1个视频•总计7分钟
Open Source Tool Ecosystem: Spark, dbt, and Airflow Integration•7分钟
1篇阅读材料•总计12分钟
Fundamentals of Modular Data Pipeline Architecture•12分钟
You will implement complete end-to-end data pipelines by integrating modular components with industry-standard tools, culminating in comprehensive assessment of their pipeline development capabilities.
涵盖的内容
2篇阅读材料3个作业
显示有关单元内容的信息
2篇阅读材料•总计20分钟
End-to-End Pipeline Integration Patterns•12分钟
Implementing Complete Pipeline Integration with Spark, dbt, and Airflow•8分钟
3个作业•总计38分钟
Comprehensive Modular Pipeline Development Assessment•15分钟
End-to-End Pipeline Development Project•20分钟
Modular Pipeline Integration and Coordination Quiz•3分钟
Connector Configuration Foundations
第 5 单元•30分钟 后完成
单元详情
You will establish foundational knowledge of connector architecture and complete their first database connector configuration using Airbyte.
涵盖的内容
2个视频2篇阅读材料1个作业
显示有关单元内容的信息
2个视频•总计10分钟
Why Data Source Unification Matters for Enterprise Success•4分钟
Airbyte Connector Fundamentals - Your Integration Foundation•6分钟
2篇阅读材料•总计17分钟
Understanding Connector Architecture and Integration Patterns•8分钟
Professional Guide: Configuring Your First Database Connector Step-by-Step•9分钟
1个作业•总计3分钟
Connector Configuration Foundation Knowledge Check •3分钟
Unified Data Integration Implementation
第 6 单元•小时 后完成
单元详情
You will implement complete multi-source data integration by configuring streaming and API connectors, applying enterprise security patterns, and demonstrating mastery through comprehensive connector configuration.
Streaming and API Connector Configuration Mastery•6分钟
2篇阅读材料•总计15分钟
Authentication and Security Patterns for Production Connectors •7分钟
Multi-Source Data Integration: A How-To Guide•8分钟
2个作业•总计18分钟
Connector Configuration Mastery Assessment •15分钟
Multi-Source Integration Configuration Check•3分钟
SCD2 Historical Tracking Fundamentals
第 7 单元•25分钟 后完成
单元详情
You will understand the fundamental concepts of SCD2 logic and begin applying these principles to create data models that preserve historical context in enterprise data warehouses.
涵盖的内容
3个视频1篇阅读材料1个作业
显示有关单元内容的信息
3个视频•总计14分钟
Why SCD2 Matters in Enterprise Data Warehouses•4分钟
Understanding SCD2 Core Components and Business Logic •7分钟
Building Your First SCD2 Table Structure in SQL•4分钟
1篇阅读材料•总计8分钟
SCD2 Implementation Patterns and Data Model Design•8分钟
1个作业•总计3分钟
SCD2 Fundamentals Knowledge Check•3分钟
dbt SCD2 Model Implementation
第 8 单元•小时 后完成
单元详情
You will implement production-ready SCD2 models using dbt, creating automated historical tracking systems with proper change detection, validity periods, and current status management.
涵盖的内容
2个视频2篇阅读材料3个作业
显示有关单元内容的信息
2个视频•总计13分钟
dbt Snapshots for Automated SCD2 Change Detection•8分钟
Building Complete dbt SCD2 Model with Validity Periods•5分钟
2篇阅读材料•总计18分钟
Why dbt Transforms SCD2 Implementation for Data Teams•8分钟
dbt SCD2 Implementation Patterns and Production Considerations •10分钟
3个作业•总计36分钟
SCD2 Implementation Mastery Assessment •15分钟
Build Production SCD2 Data Model for Product Dimensions •18分钟
dbt SCD2 Implementation Knowledge Check •3分钟
Workflow Design Principles - Foundation
第 9 单元•28分钟 后完成
单元详情
You will understand the foundational concepts and design principles for creating robust data workflows with Apache Airflow.
涵盖的内容
3个视频1篇阅读材料1个作业
显示有关单元内容的信息
3个视频•总计15分钟
The Cost of Fragile Data Pipelines•2分钟
Apache Airflow Fundamentals for Production Workflows•6分钟
Building Your First Production-Ready DAG Structure•7分钟
1篇阅读材料•总计10分钟
Design Principles for Robust Data Workflows•10分钟
1个作业•总计3分钟
Workflow Design Principles Assessment•3分钟
Production Implementation - Core Application & Assessment
第 10 单元•小时 后完成
单元详情
You will implement production-grade Airflow workflows with retry mechanisms, SLA monitoring, and parameterization for enterprise-ready data pipeline resilience.
涵盖的内容
2个视频1篇阅读材料2个作业1个非评分实验室
显示有关单元内容的信息
2个视频•总计12分钟
When Production Workflows Save Business Operations•3分钟
Implementing Advanced Production Patterns in Airflow•9分钟
1篇阅读材料•总计10分钟
Production Implementation Patterns and Best Practices•10分钟
2个作业•总计13分钟
Production Workflow Mastery Assessment•10分钟
Production Implementation Patterns Assessment•3分钟
1个非评分实验室•总计20分钟
Building Production-Ready Airflow DAGs with Retry Logic and SLA Monitoring•20分钟
Project: Building Automated Data Pipelines with Spark, dbt, and Airflow
第 11 单元•小时 后完成
单元详情
You will integrate data engineering skills to build a complete automated data pipeline that processes diverse data sources, applies historical tracking, and orchestrates workflows. This project synthesizes mapping, transformation, integration, modeling, and automation capabilities into a production-ready data system.
涵盖的内容
4篇阅读材料1个作业
显示有关单元内容的信息
4篇阅读材料•总计90分钟
Why This Project Matters•10分钟
Project Requirements•10分钟
Assignment: Data Pipeline Automation System•60分钟
Solution Key•10分钟
1个作业•总计15分钟
Graded Quiz: Building Automated Data Pipelines with Spark, dbt, and Airflow•15分钟
Coursera brings together a diverse network of subject matter experts who have demonstrated their expertise through professional industry experience or strong academic backgrounds. These instructors design and teach courses that make practical, career-relevant skills accessible to learners worldwide.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.