Federated learning with workload-aware client scheduling in heterogeneous systems

被引:13
|
作者
Li, Li [1 ]
Liu, Duo [1 ]
Duan, Moming [1 ]
Zhang, Yu [1 ]
Ren, Ao [1 ]
Chen, Xianzhang [1 ]
Tan, Yujuan [1 ]
Wang, Chengliang [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Federated learning; Distributed machine learning; Heterogeneous systems; Neural Networks;
D O I
10.1016/j.neunet.2022.07.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated Learning (FL) is a novel distributed machine learning, which allows thousands of edge devices to train models locally without uploading data to the central server. Since devices in real federated settings are resource-constrained, FL encounters systems heterogeneity, which causes considerable stragglers and incurs significant accuracy degradation. To tackle the challenges of systems heterogeneity and improve the robustness of the global model, we propose a novel adaptive federated framework in this paper. Specifically, we propose FedSAE that leverages the workload completion history of clients to adaptively predict the affordable training workload for each device. Consequently, FedSAE can significantly reduce stragglers in highly heterogeneous systems. We incorporate Active Learning into FedSAE to dynamically schedule participants. The server evaluates the devices' training value based on their training loss in each round, and larger-value clients are selected with a higher probability. As a result, the model convergence is accelerated. Furthermore, we propose q-FedSAE that combines FedSAE and q-FFL to improve global fairness in highly heterogeneous systems. The evaluations conducted in a highly heterogeneous system demonstrate that both FedSAE and q-FedSAE converge faster than FedAvg. In particular, FedSAE outperforms FedAvg across multiple federated datasets - FedSAE improves testing accuracy by 22.19% and reduces stragglers by 90.69% on average. Moreover, holding the same accuracy as FedSAE, q-FedSAE allows for more robust convergence and fairer model performance than q-FedAvg, FedSAE.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:560 / 573
页数:14
相关论文
共 50 条
  • [1] Workload-Aware Scheduling for Data Analytics upon Heterogeneous Storage
    Qian, Zhuzhong
    Gao, Yuan
    Ji, Mingtao
    Peng, Hui
    Chen, Peng
    Jin, Yibo
    Lu, Sanglu
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 580 - 587
  • [2] Federated workload-aware quantized framework for secure learning in data-sensitive applications
    Narula, Manu
    Meena, Jasraj
    Vishwakarma, Dinesh Kumar
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 168
  • [3] A Workload-aware Resources Scheduling Method for Virtual Machine
    Qu, Hongshan
    Liu, Xiaodong
    Xu, Huating
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2015, 8 (01): : 247 - 258
  • [4] Workload-Aware Harmonic Partitioned Scheduling for Probabilistic Real-Time Systems
    Ren, Jiankang
    Bi, Ran
    Su, Xiaoyan
    Liu, Qian
    Wu, Guowei
    Tan, Guozhen
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 213 - 218
  • [5] Dynamic workload-aware DVFS for multicore systems using machine learning
    Manjari Gupta
    Lava Bhargava
    S. Indu
    Computing, 2021, 103 : 1747 - 1769
  • [6] CloudMap: Workload-aware Placement in Private Heterogeneous Clouds
    Viswanathan, Balaji
    Verma, Akshat
    Dutta, Sourav
    2012 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2012, : 9 - 16
  • [7] A new scheduler strategy for heterogeneous workload-aware in hadoop
    Wang, Zhe
    Zhu, Zhengdong
    Zheng, Pengfei
    Liu, Qiang
    Dong, Xiaoshe
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 80 - 85
  • [8] Dynamic workload-aware DVFS for multicore systems using machine learning
    Gupta, Manjari
    Bhargava, Lava
    Indu, S.
    COMPUTING, 2021, 103 (08) : 1747 - 1769
  • [9] Workload-aware Power Management of Cluster Systems
    Liu, Zhuo
    Liang, Aihua
    Xiao, Limin
    Ruan, Li
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES 2010), 2010, : 603 - 608
  • [10] WISE: Workload-Aware Partitioning for RDF Systems
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    BIG DATA RESEARCH, 2020, 22