Publication

Scaling Up Mixed Workloads: A Battle of Data Freshness, Flexibility, and Scheduling

Anastasia Ailamaki, Iraklis Psaroudakis
2015
Conference paper

Abstract

The common "one size does not fit all" paradigm isolates transactional and analytical workloads into separate, specialized database systems. Operational data is periodically replicated to a data warehouse for analytics. Competitiveness of enterprises today, however, depends on real-time reporting on operational data, necessitating an integration of transactional and analytical processing in a single database system. The mixed workload should be able to query and modify common data in a shared schema. The database needs to provide performance guarantees for transactional workloads, and, at the same time, efficiently evaluate complex analytical queries. In this paper, we share our analysis of the performance of two main-memory databases that support mixed workloads, SAP HANA and HyPer, while evaluating the mixed workload CHbenCHmark. By examining their similarities and differences, we identify the factors that affect performance while scaling the number of concurrent transactional and analytical clients. The three main factors are (a) data freshness, i.e., how recent is the data processed by analytical queries, (b) flexibility, i.e., restricting transactional features in order to increase optimization choices and enhance performance, and (c) scheduling, i.e., how the mixed workload utilizes resources. Specifically for scheduling, we show that the absence of workload management under cases of high con-currency leads to analytical workloads overwhelming the system and severely hurting the performance of transactional workloads.

Official source

https://infoscience.epfl.ch/record/212399?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Anastasia Ailamaki, Iraklis Psaroudakis
2015
Conference paper

Abstract

Official source

https://infoscience.epfl.ch/record/212399?ln=en

About this result

Ontological neighbourhood

Computer engineering

Databases: Relational databases

High-performance computing: Distributed computing

Related concepts (35)

Related publications (121)

Related MOOCs (5)

Scaling Up Mixed Workloads: A Battle of Data Freshness, Flexibility, and Scheduling

Graph Chatbot

Chat with Graph Search

Efficient Concurrent Analytical Query Processing using Data and Workload-conscious Sharing

Determining an optimum quantity of interleaved instruction streams of defined coroutines

Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics

Efficient Concurrent Analytical Query Processing using Data and Workload-conscious Sharing

Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics

Determining an optimum quantity of interleaved instruction streams of defined coroutines