Coordinated behavior of cooperative agents using deep reinforcement learning

被引:11
|
作者
Diallo, Elhadji Amadou Oury [1 ]
Sugiyama, Ayumi [1 ]
Sugawara, Toshiharu [1 ]
机构
[1] Waseda Univ, Dept Comp Sci & Commun Engn, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
关键词
Deep reinforcement learning; Multi-agent systems; Cooperation; Coordination; LINEAR MULTIAGENT SYSTEMS; INTELLIGENCE; TAXONOMY;
D O I
10.1016/j.neucom.2018.08.094
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we focus on an environment where multiple agents with complementary capabilities cooperate to generate non-conflicting joint actions that achieve a specific target. The central problem addressed is how several agents can collectively learn to coordinate their actions such that they complete a given task together without conflicts. However, sequential decision-making under uncertainty is one of the most challenging issues for intelligent cooperative systems. To address this, we propose a multi-agent concurrent framework where agents learn coordinated behaviors in order to divide their areas of responsibility. The proposed framework is an extension of some recent deep reinforcement learning algorithms such as DQN, double DQN, and dueling network architectures. Then, we investigate how the learned behaviors change according to the dynamics of the environment, reward scheme, and network structures. Next, we show how agents behave and choose their actions such that the resulting joint actions are optimal. We finally show that our method can lead to stable solutions in our specific environment. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:230 / 240
页数:11
相关论文
共 50 条
  • [1] Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
    Das, Abhishek
    Kottur, Satwik
    Moura, Jose M. F.
    Lee, Stefan
    Batra, Dhruv
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2970 - 2979
  • [2] Navigational Behavior of Humans and Deep Reinforcement Learning Agents
    Rigoli, Lillian M.
    Patil, Gaurav
    Stening, Hamish F.
    Kallen, Rachel W.
    Richardson, Michael J.
    FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [3] Cooperative behavior of a heterogeneous robot team for planetary exploration using deep reinforcement learning
    Barth, Andrew
    Ma, Ou
    ACTA ASTRONAUTICA, 2024, 214 : 689 - 700
  • [4] Cooperative behavior of a heterogeneous robot team for planetary exploration using deep reinforcement learning
    Barth, Andrew
    Ma, Ou
    Acta Astronautica, 2024, 214 : 689 - 700
  • [5] Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning
    Miyashita, Yuki
    Sugawara, Toshiharu
    Autonomous Intelligent Systems, 2022, 2 (01):
  • [6] Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning
    Miyashita Y.
    Sugawara T.
    Autonomous Intelligent Systems, 2 (1):
  • [7] ON THE DEVELOPMENT OF AUTONOMOUS AGENTS USING DEEP REINFORCEMENT LEARNING
    Barbu, Clara
    Mocanu, Stefan Alexandru
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (03): : 97 - 116
  • [8] On the development of autonomous agents using deep reinforcement learning
    Barbu, Clara
    Mocanu, Ștefan Alexandru
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2021, 83 (03): : 97 - 116
  • [9] Behavior analysis of emergent rule discovery for cooperative automated driving using deep reinforcement learning
    Harada, Tomohiro
    Matsuoka, Johei
    Hattori, Kiyohiko
    ARTIFICIAL LIFE AND ROBOTICS, 2023, 28 (01) : 31 - 42
  • [10] Behavior analysis of emergent rule discovery for cooperative automated driving using deep reinforcement learning
    Tomohiro Harada
    Johei Matsuoka
    Kiyohiko Hattori
    Artificial Life and Robotics, 2023, 28 : 31 - 42