Multi-UAV Cooperative Search Based on Reinforcement Learning With a Digital Twin Driven Training Framework

被引:32
|
作者
Shen, Gaoqing [1 ]
Lei, Lei [1 ]
Zhang, Xinting [1 ]
Li, Zhilin [1 ]
Cai, Shengsuo [1 ]
Zhang, Lijuan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Cooperative target search; digital twin; multi-agent deep reinforcement learning; unmanned aerial vehicles; TARGET SEARCH; FUSION;
D O I
10.1109/TVT.2023.3245120
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper considers the cooperative search for stationary targets by multiple unmanned aerial vehicles (UAVs) with limited sensing range and communication ability in a dynamic threatening environment. The main purpose is to use multiple UAVs to find more unknown targets as soon as possible, increase the coverage rate of the mission area, and more importantly, guide UAVs away from threats. However, traditional search methods are mostly unscalable and perform poorly in dynamic environments. A new multi-agent deep reinforcement learning (MADRL) method, DNQMIX, is proposed in this study to solve the multi-UAV cooperative target search (MCTS) problem. The reward function is also newly designed for the MCTS problem to guide UAVs to explore and exploit the environment information more efficiently. Moreover, this paper proposes a digital twin (DT) driven training framework "centralized training, decentralized execution, and continuous evolution" (CTDECE). It can facilitate the continuous evolution of MADRL models and solve the tradeoff between training speed and environment fidelity when MADRL is applied to real-world multi-UAV systems. Simulation results show that DNQMIX outperforms state-of-art methods in terms of search rate and coverage rate.
引用
收藏
页码:8354 / 8368
页数:15
相关论文
共 50 条
  • [21] Scalable and Cooperative Deep Reinforcement Learning Approaches for Multi-UAV Systems: A Systematic Review
    Frattolillo, Francesco
    Brunori, Damiano
    Iocchi, Luca
    DRONES, 2023, 7 (04)
  • [22] Optimization Design of Multi-UAV Communication Network Based on Reinforcement Learning
    Cao, Zhengyang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [23] Reinforcement Learning Based Trajectory Planning for Multi-UAV Load Transportation
    Estevez, Julian
    Manuel Lopez-Guede, Jose
    del Valle-Echavarri, Javier
    Grana, Manuel
    IEEE ACCESS, 2024, 12 : 144009 - 144016
  • [24] Collision Detection and Avoidance for Multi-UAV based on Deep Reinforcement Learning
    Wang, Guanzheng
    Liu, Zhihong
    Xiao, Kun
    Xu, Yinbo
    Yang, Lingjie
    Wang, Xiangke
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7783 - 7789
  • [25] Distributed cooperative search methods of multi-UAV based on prediction of moving targets
    Qi, Xiao-Ming
    Wei, Rui-Xuan
    Shen, Dong
    Ru, Chang-Jian
    Zhou, Huan
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2014, 36 (12): : 2417 - 2425
  • [26] An Improved Potential Game Theory Based Method for Multi-UAV Cooperative Search
    Ni, Jianjun
    Tang, Guangyi
    Mo, Zhengpei
    Cao, Weidong
    Yang, Simon X.
    IEEE ACCESS, 2020, 8 : 47787 - 47796
  • [27] Multi-UAV cooperative system for search and rescue based on YOLOv5
    Xing, Linjie
    Fan, Xiaoyan
    Dong, Yaxin
    Xiong, Zenghui
    Xing, Lin
    Yang, Yang
    Bai, Haicheng
    Zhou, Chengjiang
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2022, 76
  • [28] A Multi-UAV Cooperative Search System Design Based on Man-in-the-loop
    Liu, Xiyang
    Lin, Zhaochen
    Niu, Yinbao
    Lyu, Zibo
    Xu, Qinzhe
    Cui, Bohan
    Deng, Tianchen
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 757 - 762
  • [29] Distributed cooperative search method for multi-UAV with unstable communications
    Zhang, Huaqing
    Ma, Hongbin
    Mersha, Bemnet Wondimagegnehu
    Zhang, Xiaofei
    Jin, Ying
    APPLIED SOFT COMPUTING, 2023, 148
  • [30] Multi-UAV cooperative search on region division and path planning
    Dai J.
    Xu F.
    Chen Q.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2020, 41