Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

被引:658
|
作者
Nguyen, Thanh Thi [1 ]
Nguyen, Ngoc Duy [2 ]
Nahavandi, Saeid [2 ]
机构
[1] Deakin Univ, Sch Informat Technol, Burwood Campus, Burwood, Vic 3125, Australia
[2] Deakin Univ, Inst Intelligent Syst Res & Innovat, Waurn Ponds Campus, Waurn Ponds, Vic 3216, Australia
关键词
Mathematical model; Robots; Dynamic programming; Games; Reinforcement learning; Deep learning; Observability; Continuous action space; deep learning; deep reinforcement learning (RL); multiagent; nonstationary; partial observability; review; robotics; survey; DYNAMICS; ROBOTS; GAMES;
D O I
10.1109/TCYB.2020.2977374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms, however, have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This article addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to the future development of more robust and highly useful multiagent learning methods for solving real-world problems.
引用
收藏
页码:3826 / 3839
页数:14
相关论文
共 50 条
  • [21] Applications of deep reinforcement learning in nuclear energy: A review
    Liu, Yongchao
    Wang, Bo
    Tan, Sichao
    Li, Tong
    Lv, Wei
    Niu, Zhenfeng
    Li, Jiangkuan
    Gao, Puzhen
    Tian, Ruifeng
    NUCLEAR ENGINEERING AND DESIGN, 2024, 429
  • [22] A survey on transfer learning for multiagent reinforcement learning systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    Journal of Artificial Intelligence Research, 2019, 64 : 645 - 703
  • [23] A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 645 - 703
  • [24] Scheduling in Multiagent Systems Using Reinforcement Learning
    Minashina, I. K.
    Gorbachev, R. A.
    Zakharova, E. M.
    DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S70 - S78
  • [25] Decentralized Reinforcement Learning Inspired by Multiagent Systems
    Adjodah, Dhaval
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1729 - 1730
  • [26] The dynamics of reinforcement learning in cooperative multiagent systems
    Claus, C
    Boutilier, C
    FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 746 - 752
  • [27] An Advising Framework for Multiagent Reinforcement Learning Systems
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4913 - 4914
  • [28] Deep reinforcement learning challenges and opportunities for urban water systems
    Negm, Ahmed
    Ma, Xiandong
    Aggidis, George
    WATER RESEARCH, 2024, 253
  • [29] Scheduling in Multiagent Systems Using Reinforcement Learning
    I. K. Minashina
    R. A. Gorbachev
    E. M. Zakharova
    Doklady Mathematics, 2022, 106 : S70 - S78
  • [30] A Review of Reinforcement Learning Evolution: Taxonomy, Challenges and Emerging Solutions
    Tan, Ji Loun
    Taha, Bakr Ahmed
    Abd Aziz, Norazreen
    Mokhtar, Mohd Hadri Hafiz
    Mukhlisin, Muhammad
    Arsad, Norhana
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 490 - 502