Skip to main content
Graph
Search
fr
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Concept
Apache Hadoop
Applied sciences
Computer engineering
High-performance computing
Distributed computing
Graph Chatbot
Related lectures (29)
Login to filter by course
Login to filter by course
Reset
Previous
Page 2 of 3
Next
Fault Tolerance and Recovery: Data Safety in Distributed Computing
Explores fault tolerance, data safety, and job recovery in distributed computing systems.
Data Wrangling with Hadoop: Storage Formats and Hive
Explores data wrangling with Hadoop, emphasizing storage formats and Hive for big data processing.
Spark Storage Layer
Explores the Spark ecosystem, Resilient Distributed Datasets, and the storage layer abstraction in Spark.
Big Data Best Practices and Guidelines
Covers best practices and guidelines for big data, including data lakes, architecture, challenges, and technologies like Hadoop and Hive.
Distributed Computing Execution Models: Spark Ecosystem
Explores the spark ecosystem in distributed computing and critiques the limitations of MapReduce.
Big Data Challenges: Distributed Computing with Spark
Explores big data challenges, distributed computing with Spark, RDDs, hardware requirements, MapReduce, transformations, and Spark DataFrames.
Data-Parallel Programming: Vector & SIMD Processors
Explores data-parallel programming with vector processors and SIMD, and introduces MapReduce, Pregel, and TensorFlow.
Scaling up: Spark and Big Data
Explores the challenges of big data processing and introduces Spark as a solution.
Data Wrangling with Hadoop: Advanced Techniques
Covers advanced data wrangling techniques using Hadoop, focusing on Hive and HBase integration.
MapReduce: Execution Models for Distributed Computing
Introduces the MapReduce programming model for distributed computing, focusing on its vision and under-the-hood mechanisms.