Lecture

Data Ingestion Layer: SmartDataLake

Related lectures (34)
Data Warehousing: Overview and Challenges
Introduces data warehousing fundamentals, challenges, and the innovative concept of a 'lakehouse'.
Data Warehouses and Decision Support Systems
Explores data warehouses, decision support systems, OLAP, data lakes, multidimensional data models, and query optimizations.
Dose Management in Electron Microscopy
Explores the challenges and solutions for managing electron dose in microscopy, emphasizing the importance of accurate dose tracking and analysis.
Data Modeling: Concepts and Applications
Explores data modeling concepts, SQL implementations, and practical applications in handling missing data.
Data Wrangling with Hadoop
Covers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Big Data Best Practices and Guidelines
Covers best practices and guidelines for big data, including data lakes, architecture, challenges, and technologies like Hadoop and Hive.
Data formats and data wrangling with Hadoop
Explores Apache Hive for data warehousing, data formats, and partitioning, with practical exercises in querying and connecting to Hive.
Water Consumption in Geneva
Explores water consumption data in Geneva, including charts on consumption and losses, available datasets, and data processing phases.
Principal Component Analysis: Dimension Reduction
Covers Principal Component Analysis for dimension reduction in biological data, focusing on visualization and pattern identification.
Spark Data Frames
Covers Spark Data Frames, distributed collections of data organized into named columns, and the benefits of using them over RDDs.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.