Multi-UAV Cooperative Search Based on Reinforcement Learning With a Digital Twin Driven Training Framework

被引:32
|
作者
Shen, Gaoqing [1 ]
Lei, Lei [1 ]
Zhang, Xinting [1 ]
Li, Zhilin [1 ]
Cai, Shengsuo [1 ]
Zhang, Lijuan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Cooperative target search; digital twin; multi-agent deep reinforcement learning; unmanned aerial vehicles; TARGET SEARCH; FUSION;
D O I
10.1109/TVT.2023.3245120
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper considers the cooperative search for stationary targets by multiple unmanned aerial vehicles (UAVs) with limited sensing range and communication ability in a dynamic threatening environment. The main purpose is to use multiple UAVs to find more unknown targets as soon as possible, increase the coverage rate of the mission area, and more importantly, guide UAVs away from threats. However, traditional search methods are mostly unscalable and perform poorly in dynamic environments. A new multi-agent deep reinforcement learning (MADRL) method, DNQMIX, is proposed in this study to solve the multi-UAV cooperative target search (MCTS) problem. The reward function is also newly designed for the MCTS problem to guide UAVs to explore and exploit the environment information more efficiently. Moreover, this paper proposes a digital twin (DT) driven training framework "centralized training, decentralized execution, and continuous evolution" (CTDECE). It can facilitate the continuous evolution of MADRL models and solve the tradeoff between training speed and environment fidelity when MADRL is applied to real-world multi-UAV systems. Simulation results show that DNQMIX outperforms state-of-art methods in terms of search rate and coverage rate.
引用
收藏
页码:8354 / 8368
页数:15
相关论文
共 50 条
  • [1] A Method of Multi-UAV Cooperative Task Assignment Based on Reinforcement Learning
    Zhao, Xiaohu
    Jiang, Hanli
    An, Chenyang
    Wu, Ruocheng
    Guo, Yijun
    Yang, Daquan
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [2] Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning
    Ding, Yunlong
    Kuang, Minchi
    Shi, Heng
    Gao, Jiazhan
    DRONES, 2024, 8 (10)
  • [3] Deep Reinforcement Learning Based Computation Offloading and Trajectory Planning for Multi-UAV Cooperative Target Search
    Luo, Quyuan
    Luan, Tom H.
    Shi, Weisong
    Fan, Pingzhi
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (02) : 504 - 520
  • [4] Reinforcement Learning based Approach for Multi-UAV Cooperative Searching in Unknown Environments
    Yue, Wei
    Guan, Xianhe
    Xi, Yun
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2018 - 2023
  • [5] Reinforcement-Learning-Based Multi-UAV Cooperative Search for Moving Targets in 3D Scenarios
    Liu, Yifei
    Li, Xiaoshuai
    Wang, Jian
    Wei, Feiyu
    Yang, Junan
    DRONES, 2024, 8 (08)
  • [6] Deep Reinforcement Learning for Flocking Motion of Multi-UAV Systems: Learn From a Digital Twin
    Shen, Gaoqing
    Lei, Lei
    Li, Zhilin
    Cai, Shengsuo
    Zhang, Lijuan
    Cao, Pan
    Liu, Xiaojiao
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (13): : 11141 - 11153
  • [7] Multi-UAV cooperative search using an opportunistic learning method
    Yang, Yanli
    Polycarpou, Marios M.
    Minai, Ali A.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2007, 129 (05): : 716 - 728
  • [8] A Multiagent Deep Reinforcement Learning Approach for Multi-UAV Cooperative Search in Multilayered Aerial Computing Networks
    Wu, Jiaqi
    Luo, Jingjing
    Jiang, Changkun
    Gao, Lin
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 5807 - 5821
  • [9] Digital-Twin-Assisted Task Assignment in Multi-UAV Systems: A Deep Reinforcement Learning Approach
    Tang, Xin
    Li, Xiaohuan
    Yu, Rong
    Wu, Yuan
    Ye, Jin
    Tang, Fengzhu
    Chen, Qian
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (17): : 15362 - 15375
  • [10] Multi-UAV Collaborative Detection Based on Reinforcement Learning
    Hao, Yuanhui
    Guo, Chubing
    Ke, Liangjun
    ADVANCES IN SWARM INTELLIGENCE, PT I, ICSI 2024, 2024, 14788 : 463 - 474