Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
Studying the complexity of distributed algorithms typically boils down to evaluating how the number of messages exchanged (resp. communication steps performed or shared memory operations executed) by nodes to reliably achieve some common task, evolves with the number of these nodes. But what about the complexity of building the distributed system itself? How does the number of physical network components (e.g., channels and intermediary nodes acting as routers), needed for building a system of nodes to ensure some global reliable connectivity property, evolves with ? Addressing such a question lies at the heart of achieving the dream of \emph{elasticity} in so-called \emph{cloud computing}. In this paper, we show for the first time how to construct a distributed system of which any two of the nodes, for any , remain connected (i.e., able to communicate) with probability at least , despite the very fact that (a) every other node or channel has an independent probability of failing, and (b) the number of channels connected to every node is physically bounded by a constant. We show however that if we also require any two of the nodes to maintain a balanced message throughput with a constant probability, then additional intermediary nodes are necessary and sufficient, where is an arbitrarily small constant. Our distributed system constructions, based on the composition of fractal and tree-like graphs, are not claimed to be simple and cost-effective enough to constitute the architectural blueprints of the next generation cloud data centers with millions of computers. Yet, they might constitute their theoretical backbone.
Boi Faltings, Sujit Prakash Gujar, Dimitrios Chatzopoulos, Anurag Jain
Raphaël Gérard Théodore Michel Marie de Deloÿe et Fourcade de Fondeville
Rachid Guerraoui, Alexandre David Olivier Maurer