Multi-UAV Cooperative Search Based on Reinforcement Learning With a Digital Twin Driven Training Framework

被引:32
|
作者
Shen, Gaoqing [1 ]
Lei, Lei [1 ]
Zhang, Xinting [1 ]
Li, Zhilin [1 ]
Cai, Shengsuo [1 ]
Zhang, Lijuan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Cooperative target search; digital twin; multi-agent deep reinforcement learning; unmanned aerial vehicles; TARGET SEARCH; FUSION;
D O I
10.1109/TVT.2023.3245120
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper considers the cooperative search for stationary targets by multiple unmanned aerial vehicles (UAVs) with limited sensing range and communication ability in a dynamic threatening environment. The main purpose is to use multiple UAVs to find more unknown targets as soon as possible, increase the coverage rate of the mission area, and more importantly, guide UAVs away from threats. However, traditional search methods are mostly unscalable and perform poorly in dynamic environments. A new multi-agent deep reinforcement learning (MADRL) method, DNQMIX, is proposed in this study to solve the multi-UAV cooperative target search (MCTS) problem. The reward function is also newly designed for the MCTS problem to guide UAVs to explore and exploit the environment information more efficiently. Moreover, this paper proposes a digital twin (DT) driven training framework "centralized training, decentralized execution, and continuous evolution" (CTDECE). It can facilitate the continuous evolution of MADRL models and solve the tradeoff between training speed and environment fidelity when MADRL is applied to real-world multi-UAV systems. Simulation results show that DNQMIX outperforms state-of-art methods in terms of search rate and coverage rate.
引用
收藏
页码:8354 / 8368
页数:15
相关论文
共 50 条
  • [41] Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning
    Zhang C.Y.
    Liang S.Y.
    He C.L.
    Wang K.Z.
    Journal of Communications and Information Networks, 2022, 7 (02): : 192 - 201
  • [42] Multi-UAV Cooperative Coverage Search for Various Regions Based on Differential Evolution Algorithm
    Zeng, Hui
    Tong, Lei
    Xia, Xuewen
    BIOMIMETICS, 2024, 9 (07)
  • [43] Q-Learning-based Multi-UAV Cooperative Path Planning Method
    Yin Y.
    Wang X.
    Zhou J.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (02): : 484 - 495
  • [44] Multi-UAV Cooperative Search Planning Algorithm Based on Dynamic Target Probability Model
    Ao, Zihang
    Zhang, Yulong
    Huang, Jing
    Lin, Yichen
    Zhou, Xiaoden
    Zhang, Youmin
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 543 - 548
  • [45] A cooperative MEC framework based on multi-UAV and AP to minimize weighted energy consumption
    Tang, Qiang
    Liu, Chuan
    Li, Linjiang
    He, Shiming
    Wang, Jin
    PERVASIVE AND MOBILE COMPUTING, 2023, 93
  • [46] A hierarchical reinforcement learning framework for multi-UAV combat using leader-follower strategy
    Pang, Jinhui
    He, Jinglin
    Mohamed, Noureldin Mohamed Abdelaal Ahmed
    Lin, Changqing
    Zhang, Zhihui
    Hao, Xiaoshuai
    KNOWLEDGE-BASED SYSTEMS, 2025, 316
  • [47] Evidential map-building approaches for multi-UAV cooperative search
    Yang, YL
    Minai, AA
    Polycarpou, MM
    ACC: PROCEEDINGS OF THE 2005 AMERICAN CONTROL CONFERENCE, VOLS 1-7, 2005, : 116 - 121
  • [48] Multi-UAV Dynamic Wireless Networking With Deep Reinforcement Learning
    Wang, Qiang
    Zhang, Wenqi
    Liu, Yuanwei
    Liu, Ying
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2243 - 2246
  • [49] Bird Flocking Inspired Methods for Multi-UAV Cooperative Target Search
    Shen, Yankai
    Wei, Chen
    Sun, Yongbin
    Duan, Haibin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (02) : 702 - 706
  • [50] Multi-UAV Conflict Resolution with Graph Convolutional Reinforcement Learning
    Isufaj, Ralvi
    Omeri, Marsel
    Piera, Miquel Angel
    APPLIED SCIENCES-BASEL, 2022, 12 (02):