Skip to main content
Graph
Search
fr
|
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Big Data Challenges: Scaling to Massive Data
Graph Chatbot
Related lectures (31)
Previous
Page 1 of 4
Next
Data Wrangling with Hive: Managing Big Data Efficiently
Covers data wrangling techniques using Apache Hive for efficient big data management.
Big Data Best Practices and Guidelines
Covers best practices and guidelines for big data, including data lakes, architecture, challenges, and technologies like Hadoop and Hive.
Scaling up: Spark and Big Data
Explores the challenges of big data processing and introduces Spark as a solution.
General Introduction to Big Data
Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Introduction to Spark Runtime Architecture
Introduces Apache Spark, covering its architecture, RDDs, transformations, actions, fault tolerance, deployment options, and practical exercises in Jupyter notebooks.
Integrating Scalable Data Storage and Map Reduce Processing with Hadoop
Covers the integration of scalable data storage and map reduce processing using Hadoop, including HDFS, Hive, Parquet, ORC, Spark, and HBase.
Introduction to Spark Runtime Architecture
Covers the Spark runtime architecture, including RDDs, transformations, actions, and caching for performance optimization.
Data Analysis to AI and ML, Social Media
Explores the evolution from data analysis to AI and ML, emphasizing big data, machine learning, and social media interaction.
Big Data Challenges: Distributed Computing with Spark
Explores big data challenges, distributed computing with Spark, RDDs, hardware requirements, MapReduce, transformations, and Spark DataFrames.
Data Wrangling with Hadoop: Advanced Techniques
Covers advanced data wrangling techniques using Hadoop, focusing on Hive and HBase integration.