Collective Intrinsic Motivation of a Multi-agent System Based on Reinforcement Learning Algorithms

被引:0
|
作者
Bolshakov, Vladislav [1 ]
Sakulin, Sergey [1 ]
Alfimtsev, Alexander [1 ]
机构
[1] BMSTU, Moscow, Russia
关键词
Multi-agent reinforcement learning; Intrinsic motivation; Reward shaping; LEVEL;
D O I
10.1007/978-3-031-47718-8_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the great challenges in reinforcement learning is learning an optimal behavior in environments with sparse rewards. Solving tasks in such setting require effective exploration methods that are often based on intrinsic rewards. Plenty of real-world problems involve sparse rewards and many of them are further complicated by multi-agent setting, where the majority of intrinsic motivation methods are ineffective. In this paper we address the problem of multi-agent environments with sparse rewards and propose to combine intrinsic rewards and multi-agent reinforcement learning (MARL) technics to create the Collective Intrinsic Motivation of Agents (CIMA) method. CIMA uses both the external reward and the intrinsic collective reward from the cooperative multi-agent system. The proposed method can be used along with any MARL method as base reinforcement learning algorithm. We compare CIMA with several state-of-the-art MARL methods within multi-agent environment with sparse rewards designed in StarCraft II.
引用
收藏
页码:655 / 670
页数:16
相关论文
共 50 条
  • [1] Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning
    Hong, Seung-Jin
    Lee, Sang-Kwang
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1697 - 1699
  • [2] Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
    Jaques, Natasha
    Lazaridou, Angeliki
    Hughes, Edward
    Gulcehre, Caglar
    Ortega, Pedro A.
    Strouse, D. J.
    Leibo, Joel Z.
    de Freitas, Nando
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [3] A Review of Multi-Agent Reinforcement Learning Algorithms
    Liang, Jiaxin
    Miao, Haotian
    Li, Kai
    Tan, Jianheng
    Wang, Xi
    Luo, Rui
    Jiang, Yueqiu
    ELECTRONICS, 2025, 14 (04):
  • [4] Temporal Inconsistency-Based Intrinsic Reward for Multi-Agent Reinforcement Learning
    Sun, Shaoqi
    Xu, Kele
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [5] LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
    Du, Yali
    Han, Lei
    Fang, Meng
    Dai, Tianhong
    Liu, Ji
    Tao, Dacheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] MaCA: a Multi-agent Reinforcement Learning Platform for Collective Intelligence
    Gao, Fang
    Chen, Si
    Li, Mingqiang
    Huang, Bincheng
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 108 - 111
  • [7] Reinforcement learning based on multi-agent in RoboCup
    Zhang, W
    Li, JG
    Ruan, XG
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 967 - 975
  • [8] Investigation of independent reinforcement learning algorithms in multi-agent environments
    Lee, Ken Ming
    Ganapathi Subramanian, Sriram
    Crowley, Mark
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [9] Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms
    Ibrahim, Abdikarim Mohamed
    Yau, Kok-Lim Alvin
    Chong, Yung-Wey
    Wu, Celimuge
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [10] A Multi-group Multi-agent System Based on Reinforcement Learning and Flocking
    Gang Wang
    Jian Xiao
    Rui Xue
    Yongting Yuan
    International Journal of Control, Automation and Systems, 2022, 20 : 2364 - 2378