Multiagent Reinforcement Learning for Swarm Confrontation Environments

被引：9

作者：

Zhang, Guanyu ^{[1
]}

Li, Yuan ^{[2
]}

Xu, Xinhai ^{[2
]}

Dai, Huadong ^{[2
]}

机构：

[1] Natl Univ Def Technol, Changsha 410073, Hunan, Peoples R China

[2] Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing 100071, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III | 2019年 / 11742卷

基金：

中国国家自然科学基金;

关键词：

Swarm confrontation; Multiple reinforcement learning; Transfer leaning; Self-play; GAME;

D O I：

10.1007/978-3-030-27535-8_48

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The swarm confrontation problem is always a hot research topic, which has attracted much attention. Previous research focuses on devising rules to improve the intelligence of the swarm, which is not suitable for complex scenarios. Multi-agent reinforcement learning has been used in some similar confrontation tasks. However, many of these works take centralized method to control all entities in a swarm, which is hard to meet the real-time requirement of practical systems. Recently, OpenAI proposes Multi-Agent Deep Deterministic Policy Gradient algorithm (MADDPG), which can be used for centralized training but decentralized execution in multi-agent environments. We examine the method in our constructed swarm confrontation environment and find that it is not easy to deal with complex scenarios. We propose two improved training methods, scenario-transfer training and self-play training, which greatly enhance the performance of MADDPG. Experimental results show that the scenario-transfer training accelerate the convergence speed by 50%, and the self-play training increases the winning rate of MADDPG from 42% to 96%.

引用

页码：533 / 543

页数：11

共 50 条

[1] UAV Swarm Confrontation Using Hierarchical Multiagent Reinforcement Learning
Wang, Baolai
Li, Shengang
Gao, Xianzhong
Xie, Tao
INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2021, 2021
[2] Hierarchical Reinforcement Learning for Swarm Confrontation With High Uncertainty
Wu, Qizhen
Liu, Kexin
Chen, Lei
Lu, Jinhu
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 8630 - 8644
[3] Reinforcement Learning with Symbiotic Relationships for Multiagent Environments
Mabu, Shingo
Obayashi, Masanao
Kuremoto, Takashi
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2015), 2015, : 102 - 106
[4] Reinforcement Learning with Symbiotic Relationships for Multiagent Environments
Mabu, Shingo
Obayashi, Masanao
Kuremoto, Takashi
JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2015, 2 (01): : 40 - 45
[5] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
Liu, Jia-yi
Wang, Gang
Fu, Qiang
Yue, Shao-hua
Wang, Si-yuan
DEFENCE TECHNOLOGY, 2023, 19 : 210 - 219
[6] Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning
Jia-yi Liu
Gang Wang
Qiang Fu
Shao-hua Yue
Si-yuan Wang
Defence Technology, 2023, 19 (01) : 210 - 219
[7] UAV Swarm Confrontation Based on Multi-agent Deep Reinforcement Learning
Wang, Zhi
Liu, Fan
Guo, Jing
Hong, Chen
Chen, Ming
Wang, Ershen
Zhao, Yunbo
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4996 - 5001
[8] Collaborative decision-making for UAV swarm confrontation based on reinforcement learning
Jiao, Yongkang
Fu, Wenxing
Cao, Xinying
Su, Qiangqing
Wang, Yusheng
Shen, Zixiang
Yu, Lanlin
IET CONTROL THEORY AND APPLICATIONS, 2025, 19 (01):
[9] Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Zheng, Yan
Meng, Zhaopeng
Hao, Jianye
Zhang, Zongzhang
PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 421 - 429
[10] Performance Evaluation of Multiagent Reinforcement Learning Based Training Methods for Swarm Fighting
Gao, Huanli
Cai, Yahui
Cai, He
Lu, Haolin
Lu, Jiahui
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022

← 1 2 3 4 5 →