Reinforcement learning framework for UAV-based target localization applications

被引:14
|
作者
Shurrab, Mohammed [1 ]
Mizouni, Rabeb [1 ]
Singh, Shakti [1 ]
Otrok, Hadi [1 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi 127788, U Arab Emirates
关键词
Target localization; Unmanned aerial vehicle (UAV); Reinforcement learning (RL); Deep Q-network (DQN); Data-driven; Deep reinforcement learning (DRL); Smart environmental monitoring (SEM); INTERNET; SYSTEM; THINGS; IOT;
D O I
10.1016/j.iot.2023.100867
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart environmental monitoring has gained prominence, where target localization is of utmost importance. Employing UAVs for localization tasks is appealing owing to their low-cost, light-weight, and high maneuverability. However, UAVs lack the autonomy of decision-making if met with uncertain situations. Therefore, reinforcement learning (RL) can introduce intelligence to UAVs, where they learn to act based on the presented situation. Existing works focus on UAV trajectory optimization, navigation, and target tracking. These methods are application-specific and cannot be adapted to localization tasks since they require prior knowledge of the target. Moreover, the current RL-based autonomous target localization systems are lacking since-1) they must keep track of all visited locations and their corresponding readings, 2) they require retraining when encountering new environments, and 3) they are not scalable since the agent's movement is limited to slow speeds and for specific environments. Therefore, this work proposes a data-driven UAV target localization system based on Q-learning, which employs tabular methods to learn the optimal policy. Deep Q-network (DQN) is introduced to enhance the RL model and alleviate the curse of dimensionality. The proposed models enable smart decision-making, where the sensory information gathered by the UAV is exploited to produce the best action. Moreover, the UAV movement is modeled based on motion physics, where the actions correspond to linear velocities and heading angles. The proposed approach is compared with different benchmarks, where the results indicate that a more efficient, scalable, and adaptable localization is achieved, irrespective of the environment or source characteristics, without retraining.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A UAV-based framework for quick recognition of pipeline defects
    Ma, Yinghan
    Zhao, Hong
    Miao, Xingyuan
    Gao, Boxuan
    Song, Fulin
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [32] Modelling Framework for Reinforcement Learning based Scheduling Applications
    Steinbacher, Lennart M.
    Ait-Alla, Abderahim
    Rippel, Daniel
    Duee, Tim
    Freitag, Michael
    IFAC PAPERSONLINE, 2022, 55 (10): : 67 - 72
  • [33] Multiple Radio Transmitter Localization via UAV-Based Mapping
    Li, Zhuyin
    Giorgetti, Andrea
    Kandeepan, Sithamparanathan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (09) : 8811 - 8822
  • [34] UAV-based smart rock localization for bridge scour monitoring
    Haibin Zhang
    Zhaochao Li
    Genda Chen
    Alec Reven
    Buddy Scharfenberg
    Jinping Ou
    Journal of Civil Structural Health Monitoring, 2021, 11 : 301 - 313
  • [35] An Effective Transfer Learning Based Landmark Detection Framework for UAV-Based Aerial Imagery of Urban Landscapes
    Praveen, Bishwas
    Menon, Vineetha
    Mukherjee, Tathagata
    Mesmer, Bryan
    Gholston, Sampson
    Corns, Steven
    SOUTHEASTCON 2023, 2023, : 844 - 850
  • [36] Research on Target Capturing of UAV Circumnavigation Formation Based on Deep Reinforcement Learning
    Xia, Qianxin
    Li, Peng
    Shi, Xufeng
    Li, Qian
    Cai, Weijun
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3751 - 3762
  • [37] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning
    Guoqiang Xu
    Weilai Jiang
    Zhaolei Wang
    Yaonan Wang
    Journal of Intelligent & Robotic Systems, 2022, 104
  • [38] Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning
    Ding, Yunlong
    Kuang, Minchi
    Shi, Heng
    Gao, Jiazhan
    DRONES, 2024, 8 (10)
  • [39] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning
    Xu, Guoqiang
    Jiang, Weilai
    Wang, Zhaolei
    Wang, Yaonan
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (04)
  • [40] A reinforcement learning approach for UAV target searching and tracking
    Tian Wang
    Ruoxi Qin
    Yang Chen
    Hichem Snoussi
    Chang Choi
    Multimedia Tools and Applications, 2019, 78 : 4347 - 4364