Lecture

Resource Management and Fault Tolerance

In course

Mollit occaecat ad aliquip fugiat minim. Adipisicing sint ipsum dolor officia. Sint ullamco non dolore consectetur adipisicing esse sit mollit aute labore culpa elit.

Description

This lecture covers the design choices of Big Data systems, focusing on storage layer, programming model, execution engine, resource management, and fault tolerance. It explains how systems like Yarn enable multiple frameworks to co-exist, decisions of varying granularities in resource management, architectural choices of Spark, and the importance of fault tolerance in the face of hardware/software failures. The lecture also discusses data safety, job recovery in Spark, and the impact of failures on performance.

This video is available exclusively on Mediaspace for a restricted audience. Please log in to MediaSpace to access it if you have the necessary permissions.

Watch on Mediaspace

Official source

https://mediaspace.epfl.ch/media/0_l0ytyt7c

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related lectures (31)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.