System and Method for Optimizing Data Storage in a Distributed Data Storage Environment
Related publications (53)
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
Data lakes are complex ecosystems where heterogeneity prevails. Raw data of diverse formats are stored and processed, while long and expensive ETL processes are avoided. Apart from data heterogeneity, data lakes also entail hardware heterogeneity. Typical ...
As the volume of produced data is exponentially increasing, companies tend to rely on distributed systems to meet the surging demand for storage capacity. With the business workflows becoming more and more complex, such systems often consist of or are acce ...
Amid a data revolution that is transforming industries around the globe, computing systems have undergone a paradigm shift where many applications are scaled out to run on multiple computers in a computing cluster. As the storage and processing capabilitie ...
Hydropower (HP) is the backbone of the Swiss electricity system providing around 60 % (36 TWh/a) of the total electricity generated on a yearly average. With the planned phase-out of nuclear power plants, HP and other Renewable Energy Sources (RES) will ne ...
Whether it be for environmental sensing or Internet of Things (IoT) applications, sensor networks are of growing use thanks to their large-scale sensing and distributed data storage abilities. However, when used in hazardous conditions and thus undergoing ...
Exa-scale simulations are on the horizon but almost no new design for the output has been proposed in recent years. In simulations using individual time steps, the traditional snapshots are over resolving particles/cells with large time steps and are under ...
ELSEVIER2022
Storage is an important domain of the energy sector, with its traditional, classical solutions for smaller and larger amounts of energy. Energy storage has become of higher importance in relation with the development of alternative energy sources, leading ...
UNIV NIS2021
In the current era of big data, aggregation queries on high-dimensional datasets are frequently utilized to uncover hidden patterns, trends, and correlations critical for effective business decision-making. Data cubes facilitate such queries by employing p ...
EPFL2023
,
Data-intensive systems are the backbone of today's computing and are responsible for shaping data centers. Over the years, cloud providers have relied on three principles to maintain cost-effective data systems: use disaggregation to decouple scaling, use ...
With the increasing dominance of SSDs for local storage, today's network mounted virtual disks can no longer offer competitive performance. We propose a Log-Structured Virtual Disk (LSVD) that couples log-structured approaches at both the cache and storage ...