Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance

被引:0
|
作者
Wang, Guanzheng [1 ]
Xu, Yinbo [1 ]
Liu, Zhihong [1 ]
Xu, Xin [1 ]
Wang, Xiangke [1 ]
Yan, Jiarun [1 ]
机构
[1] College of Intelligence Science and Technology, National University of Defense Technology, Changsha, China
来源
Industrial Robot | 2022年 / 49卷 / 02期
基金
中国国家自然科学基金;
关键词
Aircraft detection - Deep learning - Learning systems - Unmanned aerial vehicles (UAV) - Aircraft control - Collision avoidance - Antennas - Flight simulators;
D O I
暂无
中图分类号
学科分类号
摘要
Purpose: This paper aims to realize a fully distributed multi-UAV collision detection and avoidance based on deep reinforcement learning (DRL). To deal with the problem of low sample efficiency in DRL and speed up the training. To improve the applicability and reliability of the DRL-based approach in multi-UAV control problems. Design/methodology/approach: In this paper, a fully distributed collision detection and avoidance approach for multi-UAV based on DRL is proposed. A method that integrates human experience into policy training via a human experience-based adviser is proposed. The authors propose a hybrid control method which combines the learning-based policy with traditional model-based control. Extensive experiments including simulations, real flights and comparative experiments are conducted to evaluate the performance of the approach. Findings: A fully distributed multi-UAV collision detection and avoidance method based on DRL is realized. The reward curve shows that the training process when integrating human experience is significantly accelerated and the mean episode reward is higher than the pure DRL method. The experimental results show that the DRL method with human experience integration has a significant improvement than the pure DRL method for multi-UAV collision detection and avoidance. Moreover, the safer flight brought by the hybrid control method has also been validated. Originality/value: The fully distributed architecture is suitable for large-scale unmanned aerial vehicle (UAV) swarms and real applications. The DRL method with human experience integration has significantly accelerated the training compared to the pure DRL method. The proposed hybrid control strategy makes up for the shortcomings of two-dimensional light detection and ranging and other puzzles in applications. © 2021, Emerald Publishing Limited.
引用
收藏
页码:256 / 270
相关论文
共 50 条
  • [31] Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches
    Chang, Zheng
    Deng, Hengwei
    You, Li
    Min, Geyong
    Garg, Sahil
    Kaddoum, Georges
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2940 - 2951
  • [32] Minimum Throughput Maximization for Multi-UAV Enabled WPCN: A Deep Reinforcement Learning Method
    Tang, Jie
    Song, Jingru
    Ou, Junhui
    Luo, Jingci
    Zhang, Xiuyin
    Wong, Kai-Kit
    IEEE Access, 2020, 8 : 9124 - 9132
  • [33] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
  • [34] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Fei WANG
    Xiaoping ZHU
    Zhou ZHOU
    Yang TANG
    Chinese Journal of Aeronautics, 2024, 37 (03) : 237 - 257
  • [35] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Wang, Fei
    Zhu, Xiaoping
    Zhou, Zhou
    Tang, Yang
    CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (03) : 237 - 257
  • [36] Decentralized Planning-Assisted Deep Reinforcement Learning for Collision and Obstacle Avoidance in UAV Networks
    Lin, Ju-Shan
    Chiu, Hsiao-Ting
    Gau, Rung-Hung
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [37] Formation control and collision avoidance for multi-UAV systems based on Voronoi partition
    Hu, JinWen
    Wang, Man
    Zhao, ChunHui
    Pan, Quan
    Du, Chang
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (01) : 65 - 72
  • [38] Formation control and collision avoidance for multi-UAV systems based on Voronoi partition
    JinWen Hu
    Man Wang
    ChunHui Zhao
    Quan Pan
    Chang Du
    Science China Technological Sciences, 2020, 63 : 65 - 72
  • [39] Formation control and collision avoidance for multi-UAV systems based on Voronoi partition
    HU JinWen
    WANG Man
    ZHAO ChunHui
    PAN Quan
    DU Chang
    Science China(Technological Sciences), 2020, 63 (01) : 65 - 72
  • [40] Energy-Efficient Multi-UAV Network using Multi-Agent Deep Reinforcement Learning
    Ju, Hyungyu
    Shim, Byonghyo
    2022 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM, APWCS, 2022, : 70 - 74