Deep reinforcement learning for time-critical wilderness search and rescue using drones

被引:0
|
作者
Ewers, Jan-Hendrik [1 ]
Anderson, David [1 ]
Thomson, Douglas [1 ]
机构
[1] Univ Glasgow, Autonomous Syst & Connect, Glasgow City, Scotland
来源
基金
英国工程与自然科学研究理事会;
关键词
reinforcement learning; search planning; mission planning; autonomous systems; wilderness search and rescue; unmanned aerial vehicle; machine learning;
D O I
10.3389/frobt.2024.1527095
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160 % , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Explainability of Deep Reinforcement Learning Method with Drones
    Cetin, Ender
    Barrado, Cristina
    Pastor, Enric
    2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
  • [12] Deep Learning and Statistical Models for Time-Critical Pedestrian Behaviour Prediction
    Dabrowski, Joel Janek
    de Villiers, Johan Pieter
    Rahman, Ashfaqur
    Beyers, Conrad
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 458 - 465
  • [13] Deep Reinforcement Learning for Frontal View Person Shooting using Drones
    Passalis, Nikolaos
    Tefas, Anastasios
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2018,
  • [14] Optimised Deep Learning for Time-Critical Load Forecasting Using LSTM and Modified Particle Swarm Optimisation
    Zulfiqar, M.
    Gamage, Kelum A. A.
    Rasheed, M. B.
    Gould, C.
    ENERGIES, 2024, 17 (22)
  • [15] Coverage path planning for maritime search and rescue using reinforcement learning
    Ai, Bo
    Jia, Maoxin
    Xu, Hanwen
    Xu, Jiangling
    Wen, Zhen
    Li, Benshuai
    Zhang, Dan
    OCEAN ENGINEERING, 2021, 241
  • [16] Towards Nano-Drones Agile Flight Using Deep Reinforcement Learning
    Mengozzi, Sebastiano
    Zanatta, Luca
    Barchi, Francesco
    Bartolini, Andrea
    Acquaviva, Andrea
    2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 297 - 302
  • [17] Power Control in Internet of Drones by Deep Reinforcement Learning
    Yao, Jingjing
    Ansari, Nirwan
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [18] Deep Reinforcement Learning Robot for Search and Rescue Applications: Exploration in Unknown Cluttered Environments
    Niroui, Farzad
    Zhang, Kaicheng
    Kashino, Zendai
    Nejat, Goldie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) : 610 - 617
  • [19] A Reinforcement Learning Based Resource Management Approach for Time-critical Workloads in Distributed Computing Environment
    Liu, Zixia
    Zhang, Hong
    Rao, Bingbing
    Wang, Liqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 252 - 261
  • [20] Dynamic Data Streams for Time-Critical IoT Systems in Energy-Aware IoT Devices Using Reinforcement Learning
    Habeeb, Fawzy
    Szydlo, Tomasz
    Kowalski, Lukasz
    Noor, Ayman
    Thakker, Dhaval
    Morgan, Graham
    Ranjan, Rajiv
    SENSORS, 2022, 22 (06)