MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING

被引:0
|
作者
Atapour, S. Kaveh [1 ]
Seyedmohammadi, S. Jamal [2 ]
Sheikholeslami, S. Mohammad [3 ]
Abouei, Jamshid [4 ]
Mohammadi, Arash [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Tarbiat Modares Univ, Dept Comp & Elect Engn, Tehran, Iran
[2] Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
[3] Univ Toronto, Edward S Rogers Sr Dept Elect Comp Engn, Toronto, ON, Canada
[4] Yazd Univ, Dept Elect Engn, Yazd, Iran
关键词
Muti-Model Federated Learning; MDP; Reinforcement Learning; Team-Q algorithm; Cooperative Multi-Agents; MODEL;
D O I
10.1109/CAMSAP58249.2023.10403421
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the problem of Multi-Model Federated Learning (MMFL) in a typical wireless network, where a cellular Base Station (BS) cooperates with multiple clients to simultaneously train several Machine Learning (ML) models. Accordingly, the objective of this paper is to make an efficient joint decision for client association and communication-computation resource allocation to optimize the performance of the MMFL algorithm. In this regard, an optimization problem is formulated to minimize the average global loss of ML models under clients' energy and delay constraints. It is shown that the problem is a mixed-integer optimization whose objective is implicit in terms of the decision variables. To solve the optimization problem, we propose a Multi-Agent Multi-Model Federated Learning (MAMMFL) scheme based on a cooperative multi-agent configuration to intelligently assign models and resources to clients. Specifically, the problem is first converted to a Markov Decision Process (MDP) problem, then it is divided into four sub-MDP problems, where each problem relates to a phase in MMFL. The reinforcement learning algorithm solves each subproblem, and a team-Q algorithm is adopted to coordinate agents in a cooperative multi-agent setting. Simulation results show that the proposed method can outperform other baselines in terms of average global loss and resource consumption.
引用
收藏
页码:151 / 155
页数:5
相关论文
共 50 条
  • [31] Partitioning in multi-agent reinforcement learning
    Sun, R
    Peterson, T
    FROM ANIMALS TO ANIMATS 6, 2000, : 325 - 332
  • [32] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [33] Reinforcement learning model based on regret for multi-agent conflict games
    Department of Computer and Information Technology, Fudan University, Shanghai 200433, China
    Ruan Jian Xue Bao, 2008, 11 (2957-2967):
  • [34] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [35] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [37] Multi-agent reinforcement learning based on local communication
    Zhang, Wenxu
    Ma, Lei
    Li, Xiaonan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 15357 - 15366
  • [38] Collaborative Optimization of Multi-microgrid System Based on Multi-agent Game and Reinforcement Learning
    Liu, Junfeng
    Wang, Xiaosheng
    Lu, Junbo
    Zeng, Jun
    Dianwang Jishu/Power System Technology, 2022, 46 (07): : 2722 - 2732
  • [39] Multi-agent Cooperative Search based on Reinforcement Learning
    Sun, Yinjiang
    Zhang, Rui
    Liang, Wenbao
    Xu, Cheng
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896
  • [40] Multi-objective optimization of turbine blade profiles based on multi-agent reinforcement learning
    Li, Lele
    Zhang, Weihao
    Li, Ya
    Jiang, Chiju
    Wang, Yufan
    ENERGY CONVERSION AND MANAGEMENT, 2023, 297