Multiagent Reinforcement Learning for Swarm Confrontation Environments

被引:9
|
作者
Zhang, Guanyu [1 ]
Li, Yuan [2 ]
Xu, Xinhai [2 ]
Dai, Huadong [2 ]
机构
[1] Natl Univ Def Technol, Changsha 410073, Hunan, Peoples R China
[2] Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing 100071, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III | 2019年 / 11742卷
基金
中国国家自然科学基金;
关键词
Swarm confrontation; Multiple reinforcement learning; Transfer leaning; Self-play; GAME;
D O I
10.1007/978-3-030-27535-8_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The swarm confrontation problem is always a hot research topic, which has attracted much attention. Previous research focuses on devising rules to improve the intelligence of the swarm, which is not suitable for complex scenarios. Multi-agent reinforcement learning has been used in some similar confrontation tasks. However, many of these works take centralized method to control all entities in a swarm, which is hard to meet the real-time requirement of practical systems. Recently, OpenAI proposes Multi-Agent Deep Deterministic Policy Gradient algorithm (MADDPG), which can be used for centralized training but decentralized execution in multi-agent environments. We examine the method in our constructed swarm confrontation environment and find that it is not easy to deal with complex scenarios. We propose two improved training methods, scenario-transfer training and self-play training, which greatly enhance the performance of MADDPG. Experimental results show that the scenario-transfer training accelerate the convergence speed by 50%, and the self-play training increases the winning rate of MADDPG from 42% to 96%.
引用
收藏
页码:533 / 543
页数:11
相关论文
共 50 条
  • [1] UAV Swarm Confrontation Using Hierarchical Multiagent Reinforcement Learning
    Wang, Baolai
    Li, Shengang
    Gao, Xianzhong
    Xie, Tao
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2021, 2021
  • [2] Hierarchical Reinforcement Learning for Swarm Confrontation With High Uncertainty
    Wu, Qizhen
    Liu, Kexin
    Chen, Lei
    Lu, Jinhu
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 8630 - 8644
  • [3] Reinforcement Learning with Symbiotic Relationships for Multiagent Environments
    Mabu, Shingo
    Obayashi, Masanao
    Kuremoto, Takashi
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2015), 2015, : 102 - 106
  • [4] Reinforcement Learning with Symbiotic Relationships for Multiagent Environments
    Mabu, Shingo
    Obayashi, Masanao
    Kuremoto, Takashi
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2015, 2 (01): : 40 - 45
  • [5] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
    Liu, Jia-yi
    Wang, Gang
    Fu, Qiang
    Yue, Shao-hua
    Wang, Si-yuan
    DEFENCE TECHNOLOGY, 2023, 19 : 210 - 219
  • [6] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
    Jia-yi Liu
    Gang Wang
    Qiang Fu
    Shao-hua Yue
    Si-yuan Wang
    Defence Technology, 2023, 19 (01) : 210 - 219
  • [7] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
    Wang, Zhi
    Liu, Fan
    Guo, Jing
    Hong, Chen
    Chen, Ming
    Wang, Ershen
    Zhao, Yunbo
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
  • [8] Collaborative decision-making for UAV swarm confrontation based on reinforcement learning
    Jiao, Yongkang
    Fu, Wenxing
    Cao, Xinying
    Su, Qiangqing
    Wang, Yusheng
    Shen, Zixiang
    Yu, Lanlin
    IET CONTROL THEORY AND APPLICATIONS, 2025, 19 (01):
  • [9] Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
    Zheng, Yan
    Meng, Zhaopeng
    Hao, Jianye
    Zhang, Zongzhang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 421 - 429
  • [10] Performance Evaluation of Multiagent Reinforcement Learning Based Training Methods for Swarm Fighting
    Gao, Huanli
    Cai, Yahui
    Cai, He
    Lu, Haolin
    Lu, Jiahui
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022