Publication

ALLARM: Optimizing Sparse Directories for Thread-Local Data

Amitabha Roy
2014
Conference paper

Abstract

Large-scale cache-coherent systems often impose unnecessary overhead on data that is thread-private for the whole of its lifetime. These include resources devoted to tracking the coherence state of the data, as well as unnecessary coherence messages sent out over the interconnect. In this paper we show how the memory allocation strategy for non-uniform memory access (NUMA) systems can be exploited to remove any coherence-related traffic for thread-local data, as well removing the need to track those cache lines in sparse directories. Our strategy is to allocate directory state only on a miss from a node in a different affinity domain from the directory. We call this ALLocAte on Remote Miss, or ALLARM. Our solution is entirely backward compatible with existing operating systems and software, and provides a means to scale cache coherence into the many-core era. On a mix of SPLASH2 and Parsec workloads, ALLARM is able to improve performance by 13% on average while reducing dynamic energy consumption by 9% in the on-chip network and 15% in the directory controller. This is achieved through a 46% reduction in the number of sparse directory entries evicted.

Official source

https://infoscience.epfl.ch/record/192798?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

ALLARM: Optimizing Sparse Directories for Thread-Local Data

Graph Chatbot

Chat with Graph Search

Rebooting Virtual Memory with Midgard

Rethinking Software Runtimes for Disaggregated Memory

Scalable Synchronization in Shared-Memory Systems: Extrapolating, Adapting, Tuning

Scalable Synchronization in Shared-Memory Systems: Extrapolating, Adapting, Tuning

Rethinking Software Runtimes for Disaggregated Memory

Rebooting Virtual Memory with Midgard