Collective Intrinsic Motivation of a Multi-agent System Based on Reinforcement Learning Algorithms

被引:0
|
作者
Bolshakov, Vladislav [1 ]
Sakulin, Sergey [1 ]
Alfimtsev, Alexander [1 ]
机构
[1] BMSTU, Moscow, Russia
关键词
Multi-agent reinforcement learning; Intrinsic motivation; Reward shaping; LEVEL;
D O I
10.1007/978-3-031-47718-8_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the great challenges in reinforcement learning is learning an optimal behavior in environments with sparse rewards. Solving tasks in such setting require effective exploration methods that are often based on intrinsic rewards. Plenty of real-world problems involve sparse rewards and many of them are further complicated by multi-agent setting, where the majority of intrinsic motivation methods are ineffective. In this paper we address the problem of multi-agent environments with sparse rewards and propose to combine intrinsic rewards and multi-agent reinforcement learning (MARL) technics to create the Collective Intrinsic Motivation of Agents (CIMA) method. CIMA uses both the external reward and the intrinsic collective reward from the cooperative multi-agent system. The proposed method can be used along with any MARL method as base reinforcement learning algorithm. We compare CIMA with several state-of-the-art MARL methods within multi-agent environment with sparse rewards designed in StarCraft II.
引用
收藏
页码:655 / 670
页数:16
相关论文
共 50 条
  • [31] Multi-agent reinforcement learning based on local communication
    Wenxu Zhang
    Lei Ma
    Xiaonan Li
    Cluster Computing, 2019, 22 : 15357 - 15366
  • [32] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [33] Survey of Multi-Agent Strategy Based on Reinforcement Learning
    Chen, Liang
    Guo, Ting
    Liu, Yun-ting
    Yang, Jia-ming
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 604 - 609
  • [34] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
    Gao, La-mei
    Zeng, Jun
    Wu, Jie
    Li, Min
    2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
  • [35] Multi-agent Reinforcement Learning-based Network Intrusion Detection System
    Tellache, Amine
    Mokhtari, Amdjed
    Korba, Abdelaziz Amara
    Ghamri-Doudane, Yacine
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [36] MAPS: Multi-agent Reinforcement Learning-based Portfolio Management System
    Lee, Jinho
    Kim, Raehyun
    Yi, Seok-Won
    Kang, Jaewoo
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4520 - 4526
  • [37] A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning
    Wiggins, Samuel
    Meng, Yuan
    Iyer, Mahesh A.
    Prasanna, Viktor
    2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 236 - 242
  • [38] Modeling collective motion for fish schooling via multi-agent reinforcement learning
    Wang, Xin
    Liu, Shuo
    Yu, Yifan
    Yue, Shengzhi
    Liu, Ying
    Zhang, Fumin
    Lin, Yuanshan
    ECOLOGICAL MODELLING, 2023, 477
  • [39] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920