How Asynchronous can Federated Learning Be?

被引:10
|
作者
Su, Ningxin [1 ]
Li, Baochun [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
D O I
10.1109/IWQoS54832.2022.9812885
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As a practical paradigm designed to involve large numbers of edge devices in distributed training of deep learning models, federated learning has witnessed a significant amount of research attention in the recent years. Yet, most existing mechanisms on federated learning assumed either fully synchronous or asynchronous communication strategies between clients and the federated learning server. Existing designs that were partially asynchronous in their communication were simple heuristics, and were evaluated using the number of communication rounds or updates required for convergence, rather than the wall-clock time in practice. In this paper, we seek to explore the entire design space between fully synchronous and asynchronous mechanisms of communication. Based on insights from our exploration, we propose PORT, a new partially asynchronous mechanism designed to allow fast clients to aggregate asynchronously, yet without waiting excessively for the slower ones. In addition, PORT is designed to adjust the aggregation weights based on both the staleness and divergence of model updates, with provable convergence guarantees. We have implemented PORT and its leading competitors in PLATO, an open-source scalable federated learning research framework designed from the ground up to emulate real-world scenarios. With respect to the wall-clock time it takes for converging to the target accuracy, PORT outperformed its closest competitor, FedBuff, by up to 40% in our experiments.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Multimodal Fusion With Block Term Decomposition for Asynchronous Federated Learning
    Gao, Min
    Zheng, Haifeng
    Du, Mengxuan
    Feng, Xinxin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (12) : 14083 - 14093
  • [42] FedEem: a fairness-based asynchronous federated learning mechanism
    Gu, Wei
    Zhang, Yifan
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):
  • [43] BAFL: A Blockchain-Based Asynchronous Federated Learning Framework
    Feng, Lei
    Zhao, Yiqi
    Guo, Shaoyong
    Qiu, Xuesong
    Li, Wenjing
    Yu, Peng
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1092 - 1103
  • [44] Asynchronous Federated Learning Framework Based on Dynamic Selective Transmission
    Zhang, Ruizhuo
    Luo, Wenjian
    Luo, Yongkang
    Xue, Shaocong
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2023, PT II, 2023, 13969 : 193 - 203
  • [45] Asynchronous Federated Learning via Over-the-air Computation
    Zheng, Zijian
    Deng, Yansha
    Liu, Xiaonan
    Nallanathan, Arumugam
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1345 - 1350
  • [46] An Asynchronous Federated Learning Optimization Scheme Based on Model Partition
    Xu, Jing
    Shi, Lei
    Shi, Yi
    Fang, Chen
    Xu, Juan
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 367 - 379
  • [47] FedDGIC: Reliable and Efficient Asynchronous Federated Learning with Gradient Compensation
    Xie, Zaipeng
    Jiang, Junchen
    Chen, Ruifeng
    Qu, Zhihao
    Liu, Hanxiang
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 98 - 105
  • [48] Time Efficient Federated Learning with Semi-asynchronous Communication
    Hao, Jiangshan
    Zhao, Yanchao
    Zhang, Jiale
    2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 156 - 163
  • [49] Secure Bilevel Asynchronous Vertical Federated Learning with Backward Updating
    Zhang, Qingsong
    Gu, Bin
    Deng, Cheng
    Huang, Heng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10896 - 10904
  • [50] FedEem: a fairness-based asynchronous federated learning mechanism
    Wei Gu
    Yifan Zhang
    Journal of Cloud Computing, 12