Skip to main content
Graph
Search
fr
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Lecture
Integrating Scalable Data Storage and Map Reduce Processing with Hadoop
Graph Chatbot
Related lectures (32)
Previous
Page 1 of 4
Next
Data Wrangling with Hive: Managing Big Data Efficiently
Covers data wrangling techniques using Apache Hive for efficient big data management.
Data Wrangling with Hadoop
Covers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Big Data Best Practices and Guidelines
Covers best practices and guidelines for big data, including data lakes, architecture, challenges, and technologies like Hadoop and Hive.
General Introduction to Big Data
Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Data Wrangling Techniques: HBase and Hive Integration
Covers data wrangling techniques using HBase and Hive, focusing on integration and practical applications.
Big Data Challenges: Scaling to Massive Data
Explores challenges of handling massive data in the era of big data, discussing solutions like MapReduce and Spark.
Data Wrangling with Hadoop: Advanced Techniques
Covers advanced data wrangling techniques using Hadoop, focusing on Hive and HBase integration.
Big Data Ecosystems: Technologies and Challenges
Covers the fundamentals of big data ecosystems, focusing on technologies, challenges, and practical exercises with Hadoop's HDFS.
Big Data: Best Practices and Guidelines
Covers best practices and guidelines for big data, including data lakes, typical architecture, challenges, and technologies used to address them.
Introduction to Spark Runtime Architecture
Introduces Apache Spark, covering its architecture, RDDs, transformations, actions, fault tolerance, deployment options, and practical exercises in Jupyter notebooks.