Energy Proportionality and Workload Consolidation for Latency-Critical Applications

Edouard Bugnion, Christos Kozyrakis, Georgios Prekas, Mia Primorac
2015
Conference paper

Abstract

Energy proportionality and workload consolidation are important objectives towards increasing efficiency in large-scale datacenters. Our work focuses on achieving these goals in the presence of applications with microsecond-scale tail latency requirements. Such applications represent a growing subset of datacenter workloads and are typically deployed on dedicated servers, which is the simplest way to ensure low tail latency across all loads. Unfortunately, it also leads to low energy efficiency and low resource utilization during the frequent periods of medium or low load. We present the OS mechanisms and dynamic control needed to adjust core allocation and voltage/frequency settings based on the measured delays for latency-critical workloads. This allows for energy proportionality and frees the maximum amount of resources per server for other background applications, while respecting service-level objectives. The two key mechanism allow us to detect increases in queuing latencies and to re-assign flow groups between the threads of a latency-critical application in milliseconds without dropping or reordering packets. We compare the efficiency of our solution to the Pareto-optimal frontier of 224 distinct static configurations. Dynamic resource control saves 44%–54% of processor energy, which corresponds to 85%–93% of the Pareto-optimal upper bound. Dynamic resource control also allows background jobs to run at 32%–46% of their standalone throughput, which corresponds to 82%–92% of the Pareto bound.

Official source

https://infoscience.epfl.ch/record/210138?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Energy Proportionality and Workload Consolidation for Latency-Critical Applications

Graph Chatbot

Chat with Graph Search

Bayesian Optimization for Chemical Reactions

Active learning for multi-objective optimization of processes and energy systems

Large-scale traffic signal control and multimodal network design

Bayesian Optimization for Chemical Reactions

Active learning for multi-objective optimization of processes and energy systems

Large-scale traffic signal control and multimodal network design