D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

The convergence speed of machine learning models trained with Federated Learning is significantly affected by non-independent and identically distributed (non-IID) data partitions, even more so in a fully decentralized setting without a central server. In this paper, we show that the impact of local class bias, an important type of data non-IIDness, can be significantly reduced by carefully designing the underlying communication topology. We present D-Cliques, a novel topology that reduces gradient bias by grouping nodes in interconnected cliques such that the local joint distribution in a clique is representative of the global class distribution. We also show how to adapt the updates of decentralized SGD to obtain unbiased gradients and implement an effective momentum with D-Cliques. Our empirical evaluation on MNIST and CIFAR10 demonstrates that our approach provides similar convergence speed as a fully-connected topology with a significant reduction in the number of edges and messages. In a 1000-node topology, D-Cliques requires 98% less edges and 96% less total messages, with further possible gains using a small-world topology across cliques.

D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

Graph Chatbot

Chattez avec Graph Search

Partial discharge localization in power transformer tanks using machine learning methods

Random matrix methods for high-dimensional machine learning models

Topics in statistical physics of high-dimensional machine learning

Partial discharge localization in power transformer tanks using machine learning methods

Random matrix methods for high-dimensional machine learning models

Topics in statistical physics of high-dimensional machine learning