Publication

The Complexity of a Reliable Distributed System

Alexandre David Olivier Maurer
2017
Report or working paper
Abstract

Studying the complexity of distributed algorithms typically boils down to evaluating how the number of messages exchanged (resp. communication steps performed or shared memory operations executed) by nodes to reliably achieve some common task, evolves with the number nn of these nodes. But what about the complexity of building the distributed system itself? How does the number of physical network components (e.g., channels and intermediary nodes acting as routers), needed for building a system of nn nodes to ensure some global reliable connectivity property, evolves with nn? Addressing such a question lies at the heart of achieving the dream of \emph{elasticity} in so-called \emph{cloud computing}. In this paper, we show for the first time how to construct a distributed system of which any two of the nn nodes, for any nn, remain connected (i.e., able to communicate) with probability at least μ\mu, despite the very fact that (a) every other node or channel has an independent probability λ\lambda of failing, and (b) the number of channels connected to every node is physically bounded by a constant. We show however that if we also require any two of the nn nodes to maintain a balanced message throughput with a constant probability, then O(nlog1+ϵn)O(n \log^{1+\epsilon} n) additional intermediary nodes are necessary and sufficient, where ϵ\epsilon is an arbitrarily small constant. Our distributed system constructions, based on the composition of fractal and tree-like graphs, are not claimed to be simple and cost-effective enough to constitute the architectural blueprints of the next generation cloud data centers with millions of computers. Yet, they might constitute their theoretical backbone.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.