Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance

被引:0
|
作者
Wang, Guanzheng [1 ]
Xu, Yinbo [1 ]
Liu, Zhihong [1 ]
Xu, Xin [1 ]
Wang, Xiangke [1 ]
Yan, Jiarun [1 ]
机构
[1] College of Intelligence Science and Technology, National University of Defense Technology, Changsha, China
来源
Industrial Robot | 2022年 / 49卷 / 02期
基金
中国国家自然科学基金;
关键词
Aircraft detection - Deep learning - Learning systems - Unmanned aerial vehicles (UAV) - Aircraft control - Collision avoidance - Antennas - Flight simulators;
D O I
暂无
中图分类号
学科分类号
摘要
Purpose: This paper aims to realize a fully distributed multi-UAV collision detection and avoidance based on deep reinforcement learning (DRL). To deal with the problem of low sample efficiency in DRL and speed up the training. To improve the applicability and reliability of the DRL-based approach in multi-UAV control problems. Design/methodology/approach: In this paper, a fully distributed collision detection and avoidance approach for multi-UAV based on DRL is proposed. A method that integrates human experience into policy training via a human experience-based adviser is proposed. The authors propose a hybrid control method which combines the learning-based policy with traditional model-based control. Extensive experiments including simulations, real flights and comparative experiments are conducted to evaluate the performance of the approach. Findings: A fully distributed multi-UAV collision detection and avoidance method based on DRL is realized. The reward curve shows that the training process when integrating human experience is significantly accelerated and the mean episode reward is higher than the pure DRL method. The experimental results show that the DRL method with human experience integration has a significant improvement than the pure DRL method for multi-UAV collision detection and avoidance. Moreover, the safer flight brought by the hybrid control method has also been validated. Originality/value: The fully distributed architecture is suitable for large-scale unmanned aerial vehicle (UAV) swarms and real applications. The DRL method with human experience integration has significantly accelerated the training compared to the pure DRL method. The proposed hybrid control strategy makes up for the shortcomings of two-dimensional light detection and ranging and other puzzles in applications. © 2021, Emerald Publishing Limited.
引用
收藏
页码:256 / 270
相关论文
共 50 条
  • [41] Formation control of Multi-UAV with collision avoidance using artificial potential field
    Wang, Yifan
    Sun, Xingyan
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 296 - 300
  • [42] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [43] Formation control and collision avoidance for multi-UAV systems based on Voronoi partition
    HU JinWen
    WANG Man
    ZHAO ChunHui
    PAN Quan
    DU Chang
    Science China(Technological Sciences), 2020, (01) : 65 - 72
  • [44] Experimental Validation of Cooperative Formation Control with Collision Avoidance for a Multi-UAV System
    Kuriki, Yasuhiro
    Namerikawa, Toru
    PROCEEDINGS OF THE 2015 6TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA), 2015, : 531 - 536
  • [45] Optimal Reciprocal Collision Avoidance with Mobile and Static Obstacles for Multi-UAV Systems
    Alejo, D.
    Cobano, J. A.
    Heredia, G.
    Ollero, A.
    2014 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2014, : 1259 - 1266
  • [46] Deep Reinforcement Learning-enabled Dynamic UAV Deployment and Power Control in Multi-UAV Wireless Networks
    Bai, Yu
    Chang, Zheng
    Jantti, Riku
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 1286 - 1291
  • [47] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
    Alireza Rafiei
    Amirhossein Oliaei Fasakhodi
    Farshid Hajati
    International Journal of Automotive Technology, 2022, 23 : 613 - 622
  • [48] Deep Reinforcement Learning for Collision Avoidance of Robotic Manipulators
    Sangiovanni, Bianca
    Rendiniello, Angelo
    Incremona, Gian Paolo
    Ferrara, Antonella
    Piastra, Marco
    2018 EUROPEAN CONTROL CONFERENCE (ECC), 2018, : 2063 - 2068
  • [49] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Lin, Wei-Ting
    Lin, Jyun-Ting
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [50] Pedestrian Collision Avoidance Using Deep Reinforcement Learning
    Rafiei, Alireza
    Fasakhodi, Amirhossein Oliaei
    Hajati, Farshid
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2022, 23 (03) : 613 - 622