Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture introduces the concepts of data stream processing, focusing on the differences between batch and stream processing, the relevance of stream processing for real-time insights, applications in various domains, constraints, challenges, and related concepts such as sliding windows, event time vs processing time, and window operations. The lecture also covers tools like Apache Kafka and Spark Streaming, explaining their functionalities and how they can be used for scalable, fault-tolerant stream processing applications.