Exam Instructions: Types of Questions and Code Grading

About
Privacy
Disclaimer

Graph Chatbot

Related lectures (32)

Page 3 of 4

Data Wrangling with Hive: Managing Big Data Efficiently

Covers data wrangling techniques using Apache Hive for efficient big data management.

The Power of Registers

Covers wait-free implementations of atomic objects, focusing on counters and snapshots, discussing key ideas for enforcing atomicity and wait-freedom.

General Introduction to Big Data

Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.

Concurrency Control & Recovery in Databases

Delves into transaction management, concurrency control, and recovery in databases to ensure data integrity and system resilience.

Data Wrangling with Hadoop: Storage Formats and Hive

Explores data wrangling with Hadoop, emphasizing storage formats and Hive for big data processing.

Data Warehousing and Decision Support

Explores data warehousing, decision support systems, and the importance of statistics in data analysis.

General-Purpose Distributed Execution System

Explores the design of a general-purpose distributed execution system, covering challenges, specialized frameworks, decentralized control logic, and high-performance shuffle.

Database Management Systems: Overview

Covers database management systems principles, design, implementation, and storage options like flat CSV files.

Untitled

Big Data Best Practices and Guidelines

Covers best practices and guidelines for big data, including data lakes, architecture, challenges, and technologies like Hadoop and Hive.