Deep reinforcement learning for time-critical wilderness search and rescue using drones

被引:0
|
作者
Ewers, Jan-Hendrik [1 ]
Anderson, David [1 ]
Thomson, Douglas [1 ]
机构
[1] Univ Glasgow, Autonomous Syst & Connect, Glasgow City, Scotland
来源
基金
英国工程与自然科学研究理事会;
关键词
reinforcement learning; search planning; mission planning; autonomous systems; wilderness search and rescue; unmanned aerial vehicle; machine learning;
D O I
10.3389/frobt.2024.1527095
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160 % , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Wilderness Search and Rescue Missions using Deep Reinforcement Learning
    Peake, Ashley
    McCalmon, Joe
    Zhang, Yixin
    Raiford, Benjamin
    Alqahtani, Sarra
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 102 - 107
  • [2] Autonomous UAV Navigation in Wilderness Search-and-Rescue Operations Using Deep Reinforcement Learning
    Talha, Muhammad
    Hussein, Aya
    Hossny, Mohammed
    AI 2022: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13728 : 733 - 746
  • [3] Drones Chasing Drones: Reinforcement Learning and Deep Search Area Proposal
    Akhloufi, Moulay A.
    Arola, Sebastien
    Bonnet, Alexandre
    DRONES, 2019, 3 (03) : 1 - 14
  • [4] Deep Reinforcement Learning for Autonomous Search and Rescue
    Zuluaga, Juan Gonzalo Carcamo
    Leidig, Jonathan P.
    Trefftz, Christian
    Wolffe, Greg
    NAECON 2018 - IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, 2018, : 521 - 524
  • [5] Time-Critical Search
    Mishra, Nina
    White, Ryen W.
    Ieong, Samuel
    Horvitz, Eric
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 747 - 756
  • [6] Search is a time-critical event: When search and rescue missions may become futile
    Adams, Annette L.
    Schmidt, Terri A.
    Newgard, Craig D.
    Federiuk, Carol S.
    Christie, Michael
    Scorvo, Sean
    DeFreest, Melissa
    WILDERNESS & ENVIRONMENTAL MEDICINE, 2007, 18 (02) : 95 - 101
  • [7] Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical Computing
    Liu, Zixia
    Wang, Liqiang
    Quan, Gang
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [8] Energy-Efficient Computation Offloading With DVFS Using Deep Reinforcement Learning for Time-Critical IoT Applications in Edge Computing
    Panda, Saroj Kumar
    Lin, Man
    Zhou, Ti
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (08) : 6611 - 6621
  • [9] 3-D Active Sensing in Time-Critical Urban Search and Rescue Missions
    Mobedi, Babak
    Nejat, Goldie
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2012, 17 (06) : 1111 - 1119
  • [10] Time-critical testing and search problems
    Agnetis, Alessandro
    Ben Hermans
    Leus, Roel
    Rostami, Salim
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 296 (02) : 440 - 452