Lecture

Stream Processing and Fault Tolerance

Description

This lecture covers the concepts of stream processing and fault tolerance in big data analytics. It discusses the measurement of time in data streams, efficient stream management techniques, scaling-out platforms like Spark Streaming and Apache Flink, fault tolerance strategies such as replication and upstream backup, and the use of DStreams for discretized stream processing. The instructor explains fault tolerance techniques for stream processing systems, including state partitioning and immutable tasks. Examples of streaming word count and sliding window operations are provided, showcasing the combination of batch and streaming computations. The lecture concludes with a vision of unifying batch and stream processing models in a single stack.

This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.

Watch on Mediaspace
About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.