The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning With Efficient Communication

被引:4
|
作者
Xu, Xing [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ,3 ]
Zhang, Honggang [2 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 311121, Peoples R China
[3] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Independent reinforcement learning; federated learning; consensus algorithm; communication overheads;
D O I
10.1109/TWC.2023.3279268
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper considers independent reinforcement learning (IRL) for multi-agent collaborative decision-making in the paradigm of federated learning (FL). However, FL generates excessive communication overheads between agents and a remote central server, especially when it involves a large number of agents or iterations. Besides, due to the heterogeneity of independent learning environments, multiple agents may undergo asynchronous Markov decision processes (MDPs), which will affect the training samples and the model's convergence performance. On top of the variation-aware periodic averaging (VPA) method and the policy-based deep reinforcement learning (DRL) algorithm (i.e., proximal policy optimization (PPO)), this paper proposes two advanced optimization schemes orienting to stochastic gradient descent (SGD): 1) A decay-based scheme gradually decays the weights of a model's local gradients with the progress of successive local updates, and 2) By representing the agents as a graph, a consensus-based scheme studies the impact of exchanging a model's local gradients among nearby agents from an algebraic connectivity perspective. This paper also provides novel convergence guarantees for both developed schemes, and demonstrates their superior effectiveness and efficiency in improving the system's utility value through theoretical analyses and simulation results.
引用
收藏
页码:507 / 528
页数:22
相关论文
共 50 条
  • [41] A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning
    Lin, Yixuan
    Zhang, Kaiqing
    Yang, Zhuoran
    Wang, Zhaoran
    Basar, Tamer
    Sandhu, Romeil
    Liu, Ji
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5562 - 5567
  • [42] Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
    Liu, Guanlin
    Lai, Lifeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] Efficient Communications in Multi-Agent Reinforcement Learning for Mobile Applications
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Zhu, Yunjun
    Han, Shuai
    Liu, Yong-Jin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (09) : 12440 - 12454
  • [44] Strategically Efficient Exploration in Competitive Multi-agent Reinforcement Learning
    Loftin, Robert
    Saha, Aadirupa
    Devlin, Sam
    Hofmann, Katja
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 161, 2021, 161 : 1587 - 1596
  • [45] A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
    Corcoran, Diarmuid
    Kreuger, Per
    Boman, Magnus
    2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 338 - 344
  • [46] Efficient Communications for Multi-Agent Reinforcement Learning in Wireless Networks
    Lv, Zefang
    Du, Yousong
    Chen, Yifan
    Xiao, Liang
    Han, Shuai
    Ji, Xiangyang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 583 - 588
  • [47] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
    Rehman, Hafiz Muhammad Raza Ur
    On, Byung-Won
    Ningombam, Devarani Devi
    Yi, Sungwon
    Choi, Gyu Sang
    IEEE ACCESS, 2021, 9 : 129728 - 129741
  • [48] DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning
    Yuan, Tingting
    Chung, Hwei-Ming
    Yuan, Jie
    Fu, Xiaoming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11763 - 11771
  • [49] Competitive-Cooperative Multi-Agent Reinforcement Learning for Auction-based Federated Learning
    Tang, Xiaoli
    Yu, Han
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4262 - 4270
  • [50] Efficient exploration by switching agents according to degree of convergence of learning on Heterogeneous Multi-Agent Reinforcement Learning in Single Robot
    Narita, Riku
    Matsushima, Tatsufumi
    Kurashige, Kentarou
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,