Building Realtime Pipelines in Cloud Data Fusion
About this Course
This is a self-paced lab that takes place in the Google Cloud console. In addition to batch pipelines, Data Fusion also allows you to create real-time pipelines, that can process events as they are generated. Currently, realtime pipelines execute using Apache Spark Streaming on Cloud Dataproc clusters. In this lab, you will learn how to build a streaming pipeline using Data Fusion.Created by: Google Cloud

Related Online Courses
Embark on a dynamic learning journey through this course. In the first module, explore the intricacies of Persistent Memory (PMEM), unraveling its fundamental concepts, characteristics, and... more
Welcome to the \"Generative AI Foundations\" course, a learning journey designed to equip you with a deep understanding of Generative AI, its principles, methodologies, and applications across... more
Interested in learning how to solve partial differential equations with numerical methods and how to turn them into python codes? This course provides you with a basic introduction how to apply... more
The Control Flow in RPA course provides a deep understanding of the automation flow. Control Flow is a concept that refers to the order in which actions are executed and \"control\" flows in an... more
Today, we are faced with the increasing challenges of dealing with more aggressive and persistent threat actors, while being inundated with information, which is full of misinformation and false... more