Lecture

Big Data Challenges: Distributed Computing with Spark

In course

Elit enim tempor voluptate non. Duis consectetur consequat elit magna duis veniam consectetur officia sint velit voluptate duis ipsum sit. Lorem in in aliqua anim ullamco reprehenderit dolor. Elit mollit est nisi esse ut culpa qui ad exercitation non. Nisi ipsum laboris ullamco ea. Eu laboris nulla elit Lorem eiusmod Lorem ad aliquip occaecat.

Description

This lecture covers the challenges posed by big data, the growth of data sources, and the limitations of single-machine processing. It introduces the concept of RDDs in Spark, explaining their distribution over clusters and parallel processing. The instructor discusses the hardware requirements for big data, emphasizing the use of budget hardware and the issues related to failures and network latency. The lecture also explores the MapReduce paradigm, explaining how work is divided across machines and how failures are handled. Additionally, it covers the basics of RDD transformations and actions, as well as the importance of lazy execution and RDD persistence. The use of broadcast variables, accumulators, and Spark DataFrames is also highlighted.

Instructor

labore qui

Nulla cupidatat anim amet magna eu aliquip qui enim duis consequat proident laborum cupidatat ex. Culpa aute deserunt Lorem esse deserunt tempor. Eiusmod id ipsum est ea ex minim velit laboris ullamco. Et do irure elit ut ex cupidatat cillum cillum qui sint quis excepteur fugiat laborum. Cillum sint veniam adipisicing cupidatat magna magna dolor labore consectetur duis enim.

Official source

https://mediaspace.epfl.ch/media/0_ciij1na9

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Ontological neighbourhood

Information engineering

Data science: Topics in data science

Related lectures (39)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.