MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING

被引:0
|
作者
Atapour, S. Kaveh [1 ]
Seyedmohammadi, S. Jamal [2 ]
Sheikholeslami, S. Mohammad [3 ]
Abouei, Jamshid [4 ]
Mohammadi, Arash [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Tarbiat Modares Univ, Dept Comp & Elect Engn, Tehran, Iran
[2] Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
[3] Univ Toronto, Edward S Rogers Sr Dept Elect Comp Engn, Toronto, ON, Canada
[4] Yazd Univ, Dept Elect Engn, Yazd, Iran
关键词
Muti-Model Federated Learning; MDP; Reinforcement Learning; Team-Q algorithm; Cooperative Multi-Agents; MODEL;
D O I
10.1109/CAMSAP58249.2023.10403421
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the problem of Multi-Model Federated Learning (MMFL) in a typical wireless network, where a cellular Base Station (BS) cooperates with multiple clients to simultaneously train several Machine Learning (ML) models. Accordingly, the objective of this paper is to make an efficient joint decision for client association and communication-computation resource allocation to optimize the performance of the MMFL algorithm. In this regard, an optimization problem is formulated to minimize the average global loss of ML models under clients' energy and delay constraints. It is shown that the problem is a mixed-integer optimization whose objective is implicit in terms of the decision variables. To solve the optimization problem, we propose a Multi-Agent Multi-Model Federated Learning (MAMMFL) scheme based on a cooperative multi-agent configuration to intelligently assign models and resources to clients. Specifically, the problem is first converted to a Markov Decision Process (MDP) problem, then it is divided into four sub-MDP problems, where each problem relates to a phase in MMFL. The reinforcement learning algorithm solves each subproblem, and a team-Q algorithm is adopted to coordinate agents in a cooperative multi-agent setting. Simulation results show that the proposed method can outperform other baselines in terms of average global loss and resource consumption.
引用
收藏
页码:151 / 155
页数:5
相关论文
共 50 条
  • [41] Function approximation based multi-agent reinforcement learning
    Abul, O
    Polat, F
    Alhajj, R
    12TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, : 36 - 39
  • [42] Hierarchical Multi-Agent Training Based on Reinforcement Learning
    Wang, Guanghua
    Li, Wenjie
    Wu, Zhanghua
    Guo, Xian
    2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 11 - 18
  • [43] Multi-agent reinforcement learning based on local communication
    Wenxu Zhang
    Lei Ma
    Xiaonan Li
    Cluster Computing, 2019, 22 : 15357 - 15366
  • [44] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [45] Survey of Multi-Agent Strategy Based on Reinforcement Learning
    Chen, Liang
    Guo, Ting
    Liu, Yun-ting
    Yang, Jia-ming
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 604 - 609
  • [46] Multi-RAT Access based on Multi-Agent Reinforcement Learning
    Yan, Mu
    Feng, Gang
    Qin, Shuang
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [47] Multi-Agent Visualization for Explaining Federated Learning
    Wei, Xiguang
    Li, Quan
    Liu, Yang
    Yu, Han
    Chen, Tianjian
    Yang, Qiang
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6572 - 6574
  • [48] An Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent Reinforcement Learning
    Feng, Cheng
    Fu, Xiongjun
    Wang, Ziyi
    Dong, Jian
    Zhao, Zhichun
    Pan, Teng
    REMOTE SENSING, 2023, 15 (11)
  • [49] Proximal Policy Optimization based Decentralized Networked Multi-Agent Reinforcement Learning
    Liu, Jinyi
    Li, Fangyu
    Wang, Jingjing
    Han, Honggui
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 839 - 844
  • [50] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
    Cassano, Lucas
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066