Publication

High-dimensional Data Cubes

Christoph Koch, Sachin Basil John
2022
Journal paper

Abstract

This paper introduces an approach to supporting high-dimensional data cubes at interactive query speeds and moderate storage cost. The approach is based on binary(-domain) data cubes that are judiciously partially materialized; the missing information can be quickly reconstructed using statistical or linear programming techniques. This enables new applications such as exploratory data analysis for feature engineering and other fields of data science. Moreover, it removes the need to compromise when building a data cube - all columns that we might ever wish to use can be included as dimensions. Our approach also speeds up certain dice, roll-up, and drill-down operations on data cubes with hierarchical dimensions compared to traditional data cubes.

Official source

https://infoscience.epfl.ch/record/303012?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related concepts (32)

Related publications (32)

Related MOOCs (6)

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

High-dimensional Data Cubes

Graph Chatbot

Chat with Graph Search

Quantifying the Unknown: Data-Driven Approaches and Applications in Energy Systems

Interactive-time Exploration, Querying, and Analysis of Large High-dimensional Datasets

Reinforcement Learning for Joint Design and Control of Battery-PV Systems

Quantifying the Unknown: Data-Driven Approaches and Applications in Energy Systems

Interactive-time Exploration, Querying, and Analysis of Large High-dimensional Datasets

Reinforcement Learning for Joint Design and Control of Battery-PV Systems