Multicenter Hierarchical Federated Learning With Fault-Tolerance Mechanisms for Resilient Edge Computing Networks

被引:4
|
作者
Chen, Xiaohong [1 ]
Xu, Guanying [1 ]
Xu, Xuesong [2 ]
Jiang, Haichong [3 ]
Tian, Zhiping [3 ]
Ma, Tao [4 ]
机构
[1] Cent South Univ, Xiang Jiang Lab, Business Sch, Changsha 410083, Peoples R China
[2] Hunan Univ Technol & Business, Changsha Social Lab Artificial Intelligence, Changsha 410205, Peoples R China
[3] Hunan Univ Technol & Business, Sch Adv Interdisciplinary Studies, Changsha 410205, Peoples R China
[4] Hope Innovat Co Ltd, Changsha 410205, Peoples R China
基金
中国国家自然科学基金;
关键词
Servers; Training; Computational modeling; Computer architecture; Fault tolerant systems; Fault tolerance; Real-time systems; federated learning (FL); hierarchical FL (HFL); multicenter; STOCHASTIC GRADIENT DESCENT; RESOURCE-ALLOCATION; INTELLIGENCE;
D O I
10.1109/TNNLS.2024.3362974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of federated learning (FL), the conventional dual-layered architecture, comprising a central parameter server and peripheral devices, often encounters challenges due to its significant reliance on the central server for communication and security. This dependence becomes particularly problematic in scenarios involving potential malfunctions of devices and servers. While existing device-edge-cloud hierarchical FL (HFL) models alleviate some dependence on central servers and reduce communication overheads, they primarily focus on load balancing within edge computing networks and fall short of achieving complete decentralization and edge-centric model aggregation. Addressing these limitations, we introduce the multicenter HFL (MCHFL) framework. This innovative framework replaces the traditional single central server architecture with a distributed network of robust global aggregation centers located at the edge, inherently enhancing fault tolerance crucial for maintaining operational integrity amidst edge network disruptions. Our comprehensive experiments with the MNIST, FashionMNIST, and CIFAR-10 datasets demonstrate the MCHFL's superior performance. Notably, even under high paralysis ratios of up to 50%, the MCHFL maintains high accuracy levels, with maximum accuracy reductions of only 2.60%, 5.12%, and 16.73% on these datasets, respectively. This performance significantly surpasses the notable accuracy declines observed in traditional single-center models under similar conditions. To the best of our knowledge, the MCHFL is the first edge multicenter FL framework with theoretical underpinnings. Our extensive experimental results across various datasets validate the MCHFL's effectiveness, showcasing its higher accuracy, faster convergence speed, and stronger robustness compared to single-center models, thereby establishing it as a pioneering paradigm in edge multicenter FL.
引用
收藏
页码:47 / 61
页数:15
相关论文
共 50 条
  • [1] Multicenter Hierarchical Federated Learning With Fault-Tolerance Mechanisms for Resilient Edge Computing Networks
    Chen, Xiaohong
    Xu, Guanying
    Xu, Xuesong
    Jiang, Haichong
    Tian, Zhiping
    Ma, Tao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 47 - 61
  • [2] Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS
    Lauer, Michael
    Amy, Matthieu
    Fabre, Jean-Charles
    Roy, Matthieu
    Excoffon, William
    Stoicescu, Miruna
    2016 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING (HASE), 2016, : 94 - 101
  • [3] Fault-tolerance schemes for hierarchical mesh networks
    Zurawski, J
    Wang, DJ
    PDCAT 2005: Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies, Proceedings, 2005, : 498 - 502
  • [4] Construction Schemes for Edge Fault-Tolerance of Ring Networks
    Hung, Chun-Nan
    Kung, Tzu-Liang
    Zhang, En-Cheng
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 626 - 631
  • [5] Hierarchical Personalized Federated Learning Over Massive Mobile Edge Computing Networks
    You, Chaoqun
    Guo, Kun
    Yang, Howard H.
    Quek, Tony Q. S.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (11) : 8141 - 8157
  • [6] AN ANALYSIS OF EDGE FAULT-TOLERANCE IN RECURSIVELY DECOMPOSABLE REGULAR NETWORKS
    LAGMAN, A
    NAJJAR, WA
    SRIMANI, PK
    IEEE TRANSACTIONS ON COMPUTERS, 1994, 43 (04) : 470 - 475
  • [7] A Fault-Tolerance Shim for Serverless Computing
    Sreekanti, Vikram
    Wu, Chenggang
    Chhatrapati, Saurav
    Gonzalez, Joseph E.
    Hellerstein, Joseph M.
    Faleiro, Jose M.
    PROCEEDINGS OF THE FIFTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS'20), 2020,
  • [8] Dynamic Edge Association in Hierarchical Federated Learning Networks
    Lim, Wei Yang Bryan
    Ng, Jer Shyuan
    Xiong, Zehui
    Garg, Sahil
    Zhang, Yang
    Niyato, Dusit
    Miao, Chunyan
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1124 - 1131
  • [9] Fault-Tolerance in the Scope of Cloud Computing
    Rehman, A. U.
    Aguiar, Rui L.
    Barraca, Joao Paulo
    IEEE ACCESS, 2022, 10 : 63422 - 63441
  • [10] Hierarchical Federated Learning with Edge Optimization in Constrained Networks
    Zhang, Xiaoyang
    Tham, Chen-Khong
    Wang, Wenyi
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,