我能在完成指导项目后从中下载作品吗？

您可以从指导项目中下载并保留您创建的任何文件。为此，您可以在访问云桌面时使用‘文件浏览器’功能。

指导项目不提供助学金。

我需要具备多少经验才能做这个指导项目？

您可在页面顶部点按此指导项目的经验级别，查看任何知识先决条件。对于指导项目的每个级别，您的授课教师会逐步为您提供指导。

我能直接通过 Web 浏览器来完成此指导项目，而不必安装特殊软件吗？

是，您可以在浏览器的云桌面中获得完成指导项目所需的一切。

指导项目的学习体验如何？

您可以直接在浏览器中于分屏环境下完成任务，以此从做中学。在屏幕的左侧，您将在工作空间中完成任务。在屏幕的右侧，您将看到有授课教师逐步指导您完成项目。

Real-time analytics with Spark: User Activity Monitoring

Real-time analytics with Spark: User Activity Monitoring

Q: 有助学金吗？

指导项目不提供助学金。

位教师：imen kerkeni

包含在 Coursera Plus 中

了解更多

指导项目

在专家的指导下，学习、练习并应用工作就绪技能

中级等级

推荐体验

1 hour

自行安排学习进度

实践学习

了解更多

指导项目

在专家的指导下，学习、练习并应用工作就绪技能

中级等级

推荐体验

1 hour

自行安排学习进度

实践学习

了解更多

您将学到什么

Set up and configure a real-time data processing pipeline
Perform transformations, aggregations, and SQL queries on streaming data
Implement fault-tolerance mechanisms and ensure the pipeline remains resilient under high workloads and data inconsistencies

您将练习的技能

要了解的详细信息

可分享的证书

添加到您的领英档案

授课语言：英语（English）

无需下载或安装

仅桌面可用

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

在不到 2 个小时的时间内学习、练习和应用为就业做好准备的技能

接受行业专家的培训
获得解决实训工作任务的实践经验
使用最新的工具和技术来建立信心

关于此指导项目

In this hands-on, 1-hour project-based course, you will master real-time data processing using Apache Spark Structured Streaming. This course is designed for data engineers and developers who want to gain practical experience in building streaming data pipelines. You will begin by setting up the Spark environment and learn how to configure micro-batches and fault tolerance mechanisms through checkpointing. Next, you’ll dive into transforming streaming data by applying filters, maps, and aggregations to extract meaningful insights. You'll also handle out-of-order data with watermarks, ensuring the accuracy of your real-time analytics. The course will introduce you to querying streaming data using SQL, allowing you to perform transformations and aggregations on live data. Finally, you will learn to deploy your streaming pipeline to production by writing results to an external sink like Parquet files. This is an intermediate level project and in order to succeed in this course it is recommended to have basic understanding of Apache Spark and API PySpark, proficiency in programming and big data as well and some basic knowledge on writing SQL queries. This is the perfect opportunity for anyone looking to dive into real-time data processing and Spark Structured Streaming!