Deep reinforcement learning for time-critical wilderness search and rescue using drones

被引：0

作者：

Ewers, Jan-Hendrik ^{[1
]}

Anderson, David ^{[1
]}

Thomson, Douglas ^{[1
]}

机构：

[1] Univ Glasgow, Autonomous Syst & Connect, Glasgow City, Scotland

来源：

FRONTIERS IN ROBOTICS AND AI | 2025年 / 11卷

基金：

英国工程与自然科学研究理事会;

关键词：

reinforcement learning; search planning; mission planning; autonomous systems; wilderness search and rescue; unmanned aerial vehicle; machine learning;

D O I：

10.3389/frobt.2024.1527095

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160 % , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.

引用

页数：10

共 50 条

[11] Explainability of Deep Reinforcement Learning Method with Drones
Cetin, Ender
Barrado, Cristina
Pastor, Enric
2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
[12] Deep Learning and Statistical Models for Time-Critical Pedestrian Behaviour Prediction
Dabrowski, Joel Janek
de Villiers, Johan Pieter
Rahman, Ashfaqur
Beyers, Conrad
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 458 - 465
[13] Deep Reinforcement Learning for Frontal View Person Shooting using Drones
Passalis, Nikolaos
Tefas, Anastasios
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2018,
[14] Optimised Deep Learning for Time-Critical Load Forecasting Using LSTM and Modified Particle Swarm Optimisation
Zulfiqar, M.
Gamage, Kelum A. A.
Rasheed, M. B.
Gould, C.
ENERGIES, 2024, 17 (22)
[15] Coverage path planning for maritime search and rescue using reinforcement learning
Ai, Bo
Jia, Maoxin
Xu, Hanwen
Xu, Jiangling
Wen, Zhen
Li, Benshuai
Zhang, Dan
OCEAN ENGINEERING, 2021, 241
[16] Towards Nano-Drones Agile Flight Using Deep Reinforcement Learning
Mengozzi, Sebastiano
Zanatta, Luca
Barchi, Francesco
Bartolini, Andrea
Acquaviva, Andrea
2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 297 - 302
[17] Power Control in Internet of Drones by Deep Reinforcement Learning
Yao, Jingjing
Ansari, Nirwan
ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
[18] Deep Reinforcement Learning Robot for Search and Rescue Applications: Exploration in Unknown Cluttered Environments
Niroui, Farzad
Zhang, Kaicheng
Kashino, Zendai
Nejat, Goldie
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) : 610 - 617
[19] A Reinforcement Learning Based Resource Management Approach for Time-critical Workloads in Distributed Computing Environment
Liu, Zixia
Zhang, Hong
Rao, Bingbing
Wang, Liqiang
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 252 - 261
[20] Dynamic Data Streams for Time-Critical IoT Systems in Energy-Aware IoT Devices Using Reinforcement Learning
Habeeb, Fawzy
Szydlo, Tomasz
Kowalski, Lukasz
Noor, Ayman
Thakker, Dhaval
Morgan, Graham
Ranjan, Rajiv
SENSORS, 2022, 22 (06)

← 1 2 3 4 5 →