Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

被引:658
|
作者
Nguyen, Thanh Thi [1 ]
Nguyen, Ngoc Duy [2 ]
Nahavandi, Saeid [2 ]
机构
[1] Deakin Univ, Sch Informat Technol, Burwood Campus, Burwood, Vic 3125, Australia
[2] Deakin Univ, Inst Intelligent Syst Res & Innovat, Waurn Ponds Campus, Waurn Ponds, Vic 3216, Australia
关键词
Mathematical model; Robots; Dynamic programming; Games; Reinforcement learning; Deep learning; Observability; Continuous action space; deep learning; deep reinforcement learning (RL); multiagent; nonstationary; partial observability; review; robotics; survey; DYNAMICS; ROBOTS; GAMES;
D O I
10.1109/TCYB.2020.2977374
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms, however, have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This article addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to the future development of more robust and highly useful multiagent learning methods for solving real-world problems.
引用
收藏
页码:3826 / 3839
页数:14
相关论文
共 50 条
  • [31] Consistent epistemic planning for multiagent deep reinforcement learning
    Wu, Peiliang
    Luo, Shicheng
    Tian, Liqiang
    Mao, Bingyi
    Chen, Wenbai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1663 - 1675
  • [32] Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer
    Mai, Yuxiang
    Zang, Yifan
    Yin, Qiyue
    Ni, Wancheng
    Huang, Kaiqi
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 566 - 576
  • [33] Effect of reinforcement learning on coordination of multiagent systems
    Bukkapatnam, S
    Gao, G
    NETWORK INTELLIGENCE: INTERNET-BASED MANUFACTURING, 2000, 4208 : 31 - 41
  • [34] Coordination in multiagent reinforcement learning systems by virtual reinforcement signals
    Kamal, M.
    Murata, Junichi
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2007, 11 (03) : 181 - 191
  • [35] A Distributional Perspective on Multiagent Cooperation With Deep Reinforcement Learning
    Huang, Liwei
    Fu, Mingsheng
    Rao, Ananya
    Irissappane, Athirai A.
    Zhang, Jie
    Xu, Chengzhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4246 - 4259
  • [36] Consistent epistemic planning for multiagent deep reinforcement learning
    Peiliang Wu
    Shicheng Luo
    Liqiang Tian
    Bingyi Mao
    Wenbai Chen
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1663 - 1675
  • [37] Scalable Reinforcement Learning for Multiagent Networked Systems
    Qu, Guannan
    Wierman, Adam
    Li, Na
    OPERATIONS RESEARCH, 2022, 70 (06) : 3601 - 3628
  • [38] Deep Graph Reinforcement Learning for Mobile Edge Computing: Challenges and Solutions
    Wang, Yixiao
    Wu, Huaming
    Li, Ruidong
    IEEE NETWORK, 2024, 38 (05): : 314 - 323
  • [39] Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems
    Hao, Jianye
    Leung, Ho-Fung
    Ming, Zhong
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)
  • [40] Multi-Agent Reinforcement Learning: A Review of Challenges and Applications
    Canese, Lorenzo
    Cardarilli, Gian Carlo
    Di Nunzio, Luca
    Fazzolari, Rocco
    Giardino, Daniele
    Re, Marco
    Spano, Sergio
    APPLIED SCIENCES-BASEL, 2021, 11 (11):