Cooperative Multi-Agent Q-Learning Using Distributed MPC

被引：2

作者：

Esfahani, Hossein Nejatbakhsh ^{[1
]}

Velni, Javad Mohammadpour ^{[1
]}

机构：

[1] Clemson Univ, Dept Mech Engn, Clemson, SC 29634 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2024年 / 8卷

基金：

美国国家科学基金会;

关键词：

Q-learning; Approximation algorithms; Couplings; Costs; Predictive control; Multi-agent systems; Linear programming; Multi-agent Q-Learning; distributed MPC; cooperative control;

D O I：

10.1109/LCSYS.2024.3407632

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this letter, we propose a cooperative Multi-Agent Reinforcement Learning (MARL) approach based on Distributed Model Predictive Control (DMPC). In the proposed framework, the local MPC schemes are formulated based on the dual decomposition method in the context of DMPC and will be used to derive the local state (and action) value functions required in a cooperative Q-learning algorithm. We further show that the DMPC scheme can yield a framework based on the Value Function Decomposition (VFD) principle so that the global state (and action) value functions can be decomposed into several local state (and action) value functions captured from the local MPCs. In the proposed cooperative MARL, the coordination between individual agents is then achieved based on the multiplier-sharing step, a.k.a inter-agent negotiation in the DMPC scheme.

引用

页码：2193 / 2198

页数：6

共 50 条

[31] Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning
Cai, Jia
Huang, Changqiang
Guo, Haifeng
MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3, 2012, 591-593 : 1475 - 1478
[32] Cooperative Output Regulation By Q-learning For Discrete Multi-agent Systems In Finite-time
Wei, Wenjun
Tang, Jingyuan
JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2022, 26 (06): : 853 - 864
[33] Cooperative Multi-Agent Systems Using Distributed Reinforcement Learning Techniques
Zemzem, Wiem
Tagina, Moncef
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 517 - 526
[34] Using Fuzzy Logic and Q-Learning for Trust Modeling in Multi-agent Systems
Aref, Abdullah
Tran, Thomas
FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 59 - 66
[35] Q-Learning based Protection Scheme for Microgrid using Multi-Agent System
Satuyeva, Botazhan
Sultankulov, Bekbol
Nunna, H. S. V. S. Kumar
Kalakova, Aidana
Doolla, Suryanarayana
2019 2ND INTERNATIONAL CONFERENCE ON SMART ENERGY SYSTEMS AND TECHNOLOGIES (SEST 2019), 2019,
[36] Distributed learning and cooperative control for multi-agent systems
Choi, Jongeun
Oh, Songhwai
Horowitz, Roberto
AUTOMATICA, 2009, 45 (12) : 2802 - 2814
[37] Distributed Multi-Agent Deep Q-Learning for Load Balancing User Association in Dense Networks
Lim, Byungju
Vu, Mai
IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (07) : 1120 - 1124
[38] Multi-Agent Q-Learning with Joint State Value Approximation
Chen Gang
Cao Weihua
Chen Xin
Wu Min
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4878 - 4882
[39] Real-Valued Q-learning in Multi-agent Cooperation
Hwang, Kao-Shing
Lo, Chia-Yue
Chen, Kim-Joan
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 395 - 400
[40] Continuous strategy replicator dynamics for multi-agent Q-learning
Galstyan, Aram
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2013, 26 (01) : 37 - 53

← 1 2 3 4 5 →