The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning With Efficient Communication

被引:4
|
作者
Xu, Xing [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ,3 ]
Zhang, Honggang [2 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 311121, Peoples R China
[3] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Independent reinforcement learning; federated learning; consensus algorithm; communication overheads;
D O I
10.1109/TWC.2023.3279268
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper considers independent reinforcement learning (IRL) for multi-agent collaborative decision-making in the paradigm of federated learning (FL). However, FL generates excessive communication overheads between agents and a remote central server, especially when it involves a large number of agents or iterations. Besides, due to the heterogeneity of independent learning environments, multiple agents may undergo asynchronous Markov decision processes (MDPs), which will affect the training samples and the model's convergence performance. On top of the variation-aware periodic averaging (VPA) method and the policy-based deep reinforcement learning (DRL) algorithm (i.e., proximal policy optimization (PPO)), this paper proposes two advanced optimization schemes orienting to stochastic gradient descent (SGD): 1) A decay-based scheme gradually decays the weights of a model's local gradients with the progress of successive local updates, and 2) By representing the agents as a graph, a consensus-based scheme studies the impact of exchanging a model's local gradients among nearby agents from an algebraic connectivity perspective. This paper also provides novel convergence guarantees for both developed schemes, and demonstrates their superior effectiveness and efficiency in improving the system's utility value through theoretical analyses and simulation results.
引用
收藏
页码:507 / 528
页数:22
相关论文
共 50 条
  • [21] Biases for Emergent Communication in Multi-agent Reinforcement Learning
    Eccles, Tom
    Bachrach, Yoram
    Lever, Guy
    Lazaridou, Angeliki
    Graepel, Thore
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [22] A federated multi-agent deep reinforcement learning for vehicular fog computing
    Shabir, Balawal
    Rahman, Anis U.
    Malik, Asad Waqar
    Buyya, Rajkumar
    Khan, Muazzam A.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6141 - 6167
  • [23] A federated multi-agent deep reinforcement learning for vehicular fog computing
    Balawal Shabir
    Anis U. Rahman
    Asad Waqar Malik
    Rajkumar Buyya
    Muazzam A. Khan
    The Journal of Supercomputing, 2023, 79 : 6141 - 6167
  • [24] Attention based Reinforcement Learning for Efficient Communication under Constraint in Multi-Agent Systems
    Mei, Jianguo
    Quan, Zhibin
    Yang, Wankou
    Cao, Xianghui
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3867 - 3873
  • [25] Learning multi-agent communication with double attentional deep reinforcement learning
    Mao, Hangyu
    Zhang, Zhengchao
    Xiao, Zhen
    Gong, Zhibo
    Ni, Yan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
  • [26] Learning multi-agent communication with double attentional deep reinforcement learning
    Hangyu Mao
    Zhengchao Zhang
    Zhen Xiao
    Zhibo Gong
    Yan Ni
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [27] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [28] Diffusion-based Multi-agent Reinforcement Learning with Communication
    Qi, Xinyue
    Tang, Jianhang
    Jin, Jiangming
    Zhang, Yang
    2024 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM, APWCS 2024, 2024,
  • [29] Cooperative Behavior by Multi-agent Reinforcement Learning with Abstractive Communication
    Tanda, Jin
    Moustafa, Ahmed
    Ito, Takayuki
    2019 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2019, : 8 - 13
  • [30] Multi-agent Pathfinding with Communication Reinforcement Learning and Deadlock Detection
    Ye, Zhaohui
    Li, Yanjie
    Guo, Ronghao
    Gao, Jianqi
    Fu, Wen
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I, 2022, 13455 : 493 - 504