Deep reinforcement learning for time-critical wilderness search and rescue using drones

被引:0
|
作者
Ewers, Jan-Hendrik [1 ]
Anderson, David [1 ]
Thomson, Douglas [1 ]
机构
[1] Univ Glasgow, Autonomous Syst & Connect, Glasgow City, Scotland
来源
基金
英国工程与自然科学研究理事会;
关键词
reinforcement learning; search planning; mission planning; autonomous systems; wilderness search and rescue; unmanned aerial vehicle; machine learning;
D O I
10.3389/frobt.2024.1527095
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160 % , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Aerial filming with synchronized drones using reinforcement learning
    Goh, Kenneth C. W.
    Ng, Raymond B. C.
    Wong, Yoke-Keong
    Ho, Nicholas J. H.
    Chua, Matthew C. H.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 18125 - 18150
  • [32] Optimal Feature Search for Vigilance Estimation Using Deep Reinforcement Learning
    Seok, Woojoon
    Yeo, Minsoo
    You, Jiwoo
    Lee, Heejun
    Cho, Taeheum
    Hwang, Bosun
    Park, Cheolsoo
    ELECTRONICS, 2020, 9 (01)
  • [33] Optimal Path Search for Robot Manipulator using Deep Reinforcement Learning
    Sunwoo Y.
    Lee W.C.
    IEIE Transactions on Smart Processing and Computing, 2021, 10 (05): : 424 - 430
  • [34] Deep Reinforcement Learning for Internet of Drones Networks: Issues and Research Directions
    Aboueleneen, Noor
    Alwarafy, Abdulmalik
    Abdallah, Mohamed
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2023, 4 : 671 - 683
  • [35] PBRL-TChain: A performance-enhanced permissioned blockchain for time-critical applications based on reinforcement learning
    Zhang, Yiguang
    Lin, Junxiong
    Lu, Zhihui
    Duan, Qiang
    Huang, Shih-Chia
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 301 - 313
  • [36] A time-critical crowdsourced computational search for the origins of COVID-19
    Manuel Cebrian
    Nature Electronics, 2021, 4 : 450 - 451
  • [37] A time-critical crowdsourced computational search for the origins of COVID-19
    Cebrian, Manuel
    NATURE ELECTRONICS, 2021, 4 (07) : 450 - 451
  • [38] Proposal of Feature Value Selection Method for Time-Critical Learning
    Yuyama, Kanami
    Nishi, Hiroaki
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 1365 - 1371
  • [39] First Report of Using Portable Unmanned Aircraft Systems (Drones) for Search and Rescue
    van Tilburg, Christopher
    WILDERNESS & ENVIRONMENTAL MEDICINE, 2017, 28 (02) : 116 - 118
  • [40] Satellite Image Segmentation with Deep Residual Architectures for Time-Critical Applications
    Ghassemi, Sina
    Sandu, Constantin
    Fiandrotti, Attilio
    Tonolo, Fabio Giulio
    Boccardo, Piero
    Francini, Gianluca
    Magli, Enrico
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2235 - 2239