D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

被引:10
|
作者
Bellet, Aurelien [1 ]
Kermarrec, Anne-Marie [2 ]
Lavoie, Erick [2 ]
机构
[1] INRIA, Lille, France
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
Decentralized Learning; Federated Learning; Topology; Heterogeneous Data; Stochastic Gradient Descent;
D O I
10.1109/SRDS55811.2022.00011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The convergence speed of machine learning models trained with Federated Learning is significantly affected by heterogeneous data partitions, even more so in a fully decentralized setting without a central server. In this paper, we show that the impact of label distribution skew, an important type of data heterogeneity, can be significantly reduced by carefully designing the underlying communication topology. We present D-Cliques, a novel topology that reduces gradient bias by grouping nodes in sparsely interconnected cliques such that the label distribution in a clique is representative of the global label distribution. We also show how to adapt the updates of decentralized SGD to obtain unbiased gradients and implement an effective momentum with D-Cliques. Our extensive empirical evaluation on MNIST and CIFAR10 validates our design and demonstrates that our approach achieves similar convergence speed as a fully-connected topology, while providing a significant reduction in the number of edges and messages. In a 1000-node topology, D-Cliques require 98% less edges and 96% less total messages, with further possible gains using a small-world topology across cliques.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [21] Efficient Ring-Topology Decentralized Federated Learning with Deep Generative Models for Medical Data in eHealthcare Systems
    Wang, Zhao
    Hu, Yifan
    Yan, Shiyang
    Wang, Zhihao
    Hou, Ruijie
    Wu, Chao
    ELECTRONICS, 2022, 11 (10)
  • [22] Federated Learning: Collaborative Machine Learning Across Decentralized Data Sources
    Ramirez, Carlos
    Martinez, Ana
    CINEFORUM, 2024, 65 (03): : 148 - 151
  • [23] Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning
    Mendieta, Matias
    Yang, Taojiannan
    Wang, Pu
    Lee, Minwoo
    Ding, Zhengming
    Chen, Chen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8387 - 8396
  • [24] Optimizing Decentralized Learning with Local Heterogeneity using Topology Morphing and Clustering
    Abebe, Waqwoya
    Jannesari, Ali
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 355 - 366
  • [25] Large-Scale Decentralized Asynchronous Federated Edge Learning with Device Heterogeneity
    Liang, Xuan
    Tang, Jianhua
    Quek, Tony Q. S.
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 4566 - 4571
  • [26] Decentralized Federated Learning with Data Feature Transmission and Neighbor Selection
    Lou, Wenxiao
    Xu, Yang
    Xu, Hongli
    Liao, Yunming
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 688 - 695
  • [27] Federated Learning for Data and Model Heterogeneity in Medical Imaging
    Madni, Hussain Ahmad
    Umer, Rao Muhammad
    Foresti, Gian Luca
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 167 - 178
  • [28] Federated Learning with complete service commitment of data heterogeneity
    Zhou, Yizhi
    Wang, Junxiao
    Qin, Yuchen
    Kong, Xiangyu
    Xie, Xin
    Qi, Heng
    Zeng, Deze
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [29] Tackling Data Heterogeneity in Federated Learning with Class Prototypes
    Dai, Yutong
    Chen, Zeyuan
    Li, Junnan
    Heinecke, Shelby
    Sun, Lichao
    Xu, Ran
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7314 - 7322
  • [30] Towards Taming the Resource and Data Heterogeneity in Federated Learning
    Chai, Zheng
    Fayyaz, Hannan
    Fayyaz, Zeshan
    Anwar, Ali
    Zhou, Yi
    Baracaldo, Nathalie
    Ludwig, Heiko
    Cheng, Yue
    PROCEEDINGS OF THE 2019 USENIX CONFERENCE ON OPERATIONAL MACHINE LEARNING, 2019, : 19 - 21