Three-Stage Stackelberg Game Enabled Clustered Federated Learning in Heterogeneous UAV Swarms

被引:21
|
作者
He, Wenji [1 ]
Yao, Haipeng [1 ]
Mai, Tianle [1 ]
Wang, Fu [2 ]
Guizani, Mohsen [3 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Elect & Engn, Beijing 100876, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence MB, Machine Learning Dept, Abu Dhabi, U Arab Emirates
基金
中国国家自然科学基金;
关键词
UAV swarms; clustered federated learning; Stackelberg game; multi-agent reinforcement learning; INTERNET; THINGS;
D O I
10.1109/TVT.2023.3246636
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the past decade, the unmanned aerial vehicles (UAVs) swarm has become a disruptive force reshaping our lives and work. In particular, advances in artificial intelligence have allowed multiple UAVs to coordinate their operations and work together to accomplish various complex tasks, one of which is Federated Learning (FL). As a promising distributed learning paradigm, FL can be adopted well with the limited resources and dynamic network topology of UAV swarms. However, the current FL's training process relies on homogeneous data paradigms, which require distributed UAVs to hold the same structure data. This ideal hypothesis can not apply to the heterogeneous UAV swarms. To tackle this problem, in this paper, we design a clustered federated learning (CFL) architecture, in which we cluster UAV swarms based on the similarities between the participants' optimization directions. Then, we formulate the model trading among model owners, cluster heads, and UAV workers as a three-stage Stackelberg game to optimize the allocation of the limited resources. We design a hierarchical reinforcement learning algorithm to search for the Stackelberg equilibrium under the clustered federated learning system. The performance evaluation demonstrates the uniqueness and stability of the proposed three-stage leader-follower game under the clustered framework, as well as the convergence and effectiveness of the reinforcement learning algorithm.
引用
收藏
页码:9366 / 9380
页数:15
相关论文
共 50 条
  • [41] Federated Learning for UAV Swarms Under Class Imbalance and Power Consumption Constraints
    Mrad, Ilyes
    Samara, Lutfi
    Abdellatif, Alaa Awad
    Al-Abbasi, Abubakr
    Hamila, Ridha
    Erbad, Aiman
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [42] TSFed: A three-stage optimization mechanism for secure and efficient federated learning in industrial IoT networks
    Putra, Made Adi Paramartha
    Karna, Nyoman Bogi Aditya
    Zainudin, Ahmad
    Kim, Dong-Seong
    Lee, Jae-Min
    INTERNET OF THINGS, 2024, 27
  • [43] Economic and Energy-Efficient Wireless Federated Learning Based on Stackelberg Game
    Zhao, Haitao
    Zhou, Mengying
    Xia, Wenchao
    Ni, Yiyang
    Gui, Guan
    Zhu, Hongbo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 2995 - 2999
  • [44] Incentive mechanism design for Federated Learning with Stackelberg game perspective in the industrial scenario
    Guo, Wei
    Wang, Yijin
    Jiang, Pingyu
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 184
  • [45] FLCAP: Federated Learning with Clustered Adaptive Pruning for Heterogeneous and Scalable Systems
    Miralles, Hugo
    Tosic, Tamara
    Riveill, Michel
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [46] Clustered Federated Learning Based on Momentum Gradient Descent for Heterogeneous Data
    Zhao, Xiaoyi
    Xie, Ping
    Xing, Ling
    Zhang, Gaoyuan
    Ma, Huahong
    ELECTRONICS, 2023, 12 (09)
  • [47] Relay Selection for Three-Stage Relaying Scheme in Clustered Wireless Networks
    Liu, Lingya
    Hua, Cunqing
    Chen, Cailian
    Guan, Xinping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2015, 64 (06) : 2398 - 2408
  • [48] SoFL: Clustered Federated Learning Based on Dual Clustering for Heterogeneous Data
    Zhang, Jianfei
    Qiao, Zhiming
    ELECTRONICS, 2024, 13 (18)
  • [49] Three-Stage Stackelberg Long-Term Incentive Mechanism and Monetization for Mobile Crowdsensing: An Online Learning Approach
    Li, Youqi
    Li, Fan
    Yang, Song
    Zhou, Pan
    Zhu, Liehuang
    Wang, Yu
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 1385 - 1398
  • [50] Jointly optimizing resource and heterogeneity in IoT networks using a Three-Stage Asynchronous Federated Reinforcement Learning
    Sagar, A. S. M. Sharifuzzaman
    Chen, Yu
    Rob, Md. Abdur
    Kim, Hyung Seok
    INTERNET OF THINGS, 2024, 27