Reliability of clustered vs. declustered replica placement in data storage systems

Rüdiger Urbanke, Christina Fragouli
2011
Conference paper

Abstract

The placement of replicas across storage nodes in a replication-based storage system is known to affect rebuild times and therefore system reliability. Earlier work has shown that, for a replication factor of two, the reliability is essentially unaffected by the replica placement scheme because all placement schemes have mean times to data loss (MTTDLs) within a factor of two for practical values of the failure rate, storage capacity, and rebuild bandwidth of a storage node. However, for higher replication factors, simulation results reveal that this no longer holds. Moreover, an analytical derivation of MTTDL becomes intractable for general placement schemes. In this paper, we develop a theoretical model that is applicable for any replication factor and provides a good approximation of the MTTDL for small failure rates. This model characterizes the system behavior by using an analytically tractable measure of reliability: the probability of the shortest path to data loss following the first node failure. It is shown that, for highly reliable systems, this measure approximates well the probability of all paths to data loss after the first node failure and prior to the completion of rebuild, and leads to a rough estimation of the MTTDL. The results obtained are of theoretical and practical importance and are confirmed by means of simulations. As our results show, the declustered placement scheme, contrary to intuition, offers a reliability for replication factors greater than two that does not decrease as the number of nodes in the system increases.

Official source

https://infoscience.epfl.ch/record/174449?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Reliability of clustered vs. declustered replica placement in data storage systems

Graph Chatbot

Chat with Graph Search

Energy Management of Price-Maker Community Energy Storage by Stochastic Dynamic Programming

Impact of CO2-rich seawater injection on the flow properties of basalts

Altruism, reciprocity, and tokens to reward forwarding data: Is that fair?

Impact of CO2-rich seawater injection on the flow properties of basalts

Energy Management of Price-Maker Community Energy Storage by Stochastic Dynamic Programming

Altruism, reciprocity, and tokens to reward forwarding data: Is that fair?