Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

被引：658

作者：

Nguyen, Thanh Thi ^{[1
]}

Nguyen, Ngoc Duy ^{[2
]}

Nahavandi, Saeid ^{[2
]}

机构：

[1] Deakin Univ, Sch Informat Technol, Burwood Campus, Burwood, Vic 3125, Australia

[2] Deakin Univ, Inst Intelligent Syst Res & Innovat, Waurn Ponds Campus, Waurn Ponds, Vic 3216, Australia

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2020年 / 50卷 / 09期

关键词：

Mathematical model; Robots; Dynamic programming; Games; Reinforcement learning; Deep learning; Observability; Continuous action space; deep learning; deep reinforcement learning (RL); multiagent; nonstationary; partial observability; review; robotics; survey; DYNAMICS; ROBOTS; GAMES;

D O I：

10.1109/TCYB.2020.2977374

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms, however, have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This article addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to the future development of more robust and highly useful multiagent learning methods for solving real-world problems.

引用

页码：3826 / 3839

页数：14

共 50 条

[21] Applications of deep reinforcement learning in nuclear energy: A review
Liu, Yongchao
Wang, Bo
Tan, Sichao
Li, Tong
Lv, Wei
Niu, Zhenfeng
Li, Jiangkuan
Gao, Puzhen
Tian, Ruifeng
NUCLEAR ENGINEERING AND DESIGN, 2024, 429
[22] A survey on transfer learning for multiagent reinforcement learning systems
Da Silva, Felipe Leno
Reali Costa, Anna Helena
Journal of Artificial Intelligence Research, 2019, 64 : 645 - 703
[23] A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems
Da Silva, Felipe Leno
Reali Costa, Anna Helena
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 645 - 703
[24] Scheduling in Multiagent Systems Using Reinforcement Learning
Minashina, I. K.
Gorbachev, R. A.
Zakharova, E. M.
DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S70 - S78
[25] Decentralized Reinforcement Learning Inspired by Multiagent Systems
Adjodah, Dhaval
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1729 - 1730
[26] The dynamics of reinforcement learning in cooperative multiagent systems
Claus, C
Boutilier, C
FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 746 - 752
[27] An Advising Framework for Multiagent Reinforcement Learning Systems
da Silva, Felipe Leno
Glatt, Ruben
Reali Costa, Anna Helena
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4913 - 4914
[28] Deep reinforcement learning challenges and opportunities for urban water systems
Negm, Ahmed
Ma, Xiandong
Aggidis, George
WATER RESEARCH, 2024, 253
[29] Scheduling in Multiagent Systems Using Reinforcement Learning
I. K. Minashina
R. A. Gorbachev
E. M. Zakharova
Doklady Mathematics, 2022, 106 : S70 - S78
[30] A Review of Reinforcement Learning Evolution: Taxonomy, Challenges and Emerging Solutions
Tan, Ji Loun
Taha, Bakr Ahmed
Abd Aziz, Norazreen
Mokhtar, Mohd Hadri Hafiz
Mukhlisin, Muhammad
Arsad, Norhana
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 490 - 502

← 1 2 3 4 5 →