Dynamic Attention Network for Multi-UAV Reinforcement Learning

被引:0
|
作者
Xu, Dongsheng [1 ]
Wu, Shang [1 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Coll Comp, Changsha, Hunan, Peoples R China
关键词
MADDPG; Transfer learning; Attention; Reinforcement learning; LEVEL;
D O I
10.1117/12.2626437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent methods for multi-agent reinforcement learning problems make use of Deep Neural Networks and provide stateof-the-art performance with dedicated neural network architectures and comprehensive training tricks. However, these deep reinforcement learning methods suffer from reproducibility issues, especially in transfer learning. Since the fixed size of the network input, it is difficult for the existing network structure to transfer the strategies learned from a small scale to a large scale. We argue that proper network architecture design is crucial to the cross-scale reinforcement transfer learning. In this paper, we use transfer training with attention network to solve multi-agent combat problems from aerial unmanned aerial vehicle (UAV) combat scenarios, and extend the small-scale learning to large-scale complex scenarios. We combine the attention neural network with the MADDPG algorithm to process the agent observation. It started training from a small-scale multi-UAV combat scenario and gradually increases the number of UAV. The experimental results show that methods for multi-agent UAV combat problems trained by attention transfer learning can achieve the target performance faster and provide better performance than the method without attention transfer learning.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning
    Ding, Yunlong
    Kuang, Minchi
    Shi, Heng
    Gao, Jiazhan
    DRONES, 2024, 8 (10)
  • [22] Deep Reinforcement Learning for Multi-UAV Exploration Under Energy Constraints
    Zhou, Yating
    Shi, Dianxi
    Yang, Huanhuan
    Hu, Haomeng
    Yang, Shaowu
    Zhang, Yongjun
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 363 - 379
  • [23] On Designing Multi-UAV Aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning
    Zhao, Ze Yu
    Che, Yue Ling
    Luo, Sheng
    Luo, Gege
    Wu, Kaishun
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 13991 - 14004
  • [24] A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment
    Xiao, Jian
    Yuan, Guohui
    Xue, Yuxi
    He, Jinhui
    Wang, Yaoting
    Zou, Yuanjiang
    Wang, Zhuoran
    NEUROCOMPUTING, 2024, 595
  • [25] Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks
    Gong, Shimin
    Wang, Meng
    Gu, Bo
    Zhang, Wenjie
    Dinh Thai Hoang
    Niyato, Dusit
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 10933 - 10948
  • [26] Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method
    Liao, Guang
    Wang, Jian
    Yang, Dujia
    Yang, Junan
    SENSORS, 2024, 24 (21)
  • [27] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
    Zhao, Xiaoru
    Yang, Rennong
    Zhong, Liangsheng
    Hou, Zhiwei
    DRONES, 2024, 8 (01)
  • [28] An evolutionary multi-agent reinforcement learning algorithm for multi-UAV air combat
    Wang, Baolai
    Gao, Xianzhong
    Xie, Tao
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [29] Maintaining Connectivity for Multi-UAV Multi-Target Search Using Reinforcement Learning
    Guven, Islam
    Yanmaz, Evsen
    PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, DIVANET 2023, 2023, : 109 - 114
  • [30] Joint optimization of communication and mission performance for multi-UAV collaboration network: A multi-agent reinforcement learning method
    He, Yuan
    Xie, Jun
    Hu, Guyu
    Liu, Yaqun
    Luo, Xijian
    AD HOC NETWORKS, 2024, 164