Cooperative Multi-Agent Q-Learning Using Distributed MPC

被引:2
|
作者
Esfahani, Hossein Nejatbakhsh [1 ]
Velni, Javad Mohammadpour [1 ]
机构
[1] Clemson Univ, Dept Mech Engn, Clemson, SC 29634 USA
来源
基金
美国国家科学基金会;
关键词
Q-learning; Approximation algorithms; Couplings; Costs; Predictive control; Multi-agent systems; Linear programming; Multi-agent Q-Learning; distributed MPC; cooperative control;
D O I
10.1109/LCSYS.2024.3407632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this letter, we propose a cooperative Multi-Agent Reinforcement Learning (MARL) approach based on Distributed Model Predictive Control (DMPC). In the proposed framework, the local MPC schemes are formulated based on the dual decomposition method in the context of DMPC and will be used to derive the local state (and action) value functions required in a cooperative Q-learning algorithm. We further show that the DMPC scheme can yield a framework based on the Value Function Decomposition (VFD) principle so that the global state (and action) value functions can be decomposed into several local state (and action) value functions captured from the local MPCs. In the proposed cooperative MARL, the coordination between individual agents is then achieved based on the multiplier-sharing step, a.k.a inter-agent negotiation in the DMPC scheme.
引用
收藏
页码:2193 / 2198
页数:6
相关论文
共 50 条
  • [31] Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning
    Cai, Jia
    Huang, Changqiang
    Guo, Haifeng
    MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3, 2012, 591-593 : 1475 - 1478
  • [32] Cooperative Output Regulation By Q-learning For Discrete Multi-agent Systems In Finite-time
    Wei, Wenjun
    Tang, Jingyuan
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2022, 26 (06): : 853 - 864
  • [33] Cooperative Multi-Agent Systems Using Distributed Reinforcement Learning Techniques
    Zemzem, Wiem
    Tagina, Moncef
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 517 - 526
  • [34] Using Fuzzy Logic and Q-Learning for Trust Modeling in Multi-agent Systems
    Aref, Abdullah
    Tran, Thomas
    FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 59 - 66
  • [35] Q-Learning based Protection Scheme for Microgrid using Multi-Agent System
    Satuyeva, Botazhan
    Sultankulov, Bekbol
    Nunna, H. S. V. S. Kumar
    Kalakova, Aidana
    Doolla, Suryanarayana
    2019 2ND INTERNATIONAL CONFERENCE ON SMART ENERGY SYSTEMS AND TECHNOLOGIES (SEST 2019), 2019,
  • [36] Distributed learning and cooperative control for multi-agent systems
    Choi, Jongeun
    Oh, Songhwai
    Horowitz, Roberto
    AUTOMATICA, 2009, 45 (12) : 2802 - 2814
  • [37] Distributed Multi-Agent Deep Q-Learning for Load Balancing User Association in Dense Networks
    Lim, Byungju
    Vu, Mai
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (07) : 1120 - 1124
  • [38] Multi-Agent Q-Learning with Joint State Value Approximation
    Chen Gang
    Cao Weihua
    Chen Xin
    Wu Min
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4878 - 4882
  • [39] Real-Valued Q-learning in Multi-agent Cooperation
    Hwang, Kao-Shing
    Lo, Chia-Yue
    Chen, Kim-Joan
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 395 - 400
  • [40] Continuous strategy replicator dynamics for multi-agent Q-learning
    Galstyan, Aram
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2013, 26 (01) : 37 - 53