Reinforcement learning framework for UAV-based target localization applications

被引:14
|
作者
Shurrab, Mohammed [1 ]
Mizouni, Rabeb [1 ]
Singh, Shakti [1 ]
Otrok, Hadi [1 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi 127788, U Arab Emirates
关键词
Target localization; Unmanned aerial vehicle (UAV); Reinforcement learning (RL); Deep Q-network (DQN); Data-driven; Deep reinforcement learning (DRL); Smart environmental monitoring (SEM); INTERNET; SYSTEM; THINGS; IOT;
D O I
10.1016/j.iot.2023.100867
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart environmental monitoring has gained prominence, where target localization is of utmost importance. Employing UAVs for localization tasks is appealing owing to their low-cost, light-weight, and high maneuverability. However, UAVs lack the autonomy of decision-making if met with uncertain situations. Therefore, reinforcement learning (RL) can introduce intelligence to UAVs, where they learn to act based on the presented situation. Existing works focus on UAV trajectory optimization, navigation, and target tracking. These methods are application-specific and cannot be adapted to localization tasks since they require prior knowledge of the target. Moreover, the current RL-based autonomous target localization systems are lacking since-1) they must keep track of all visited locations and their corresponding readings, 2) they require retraining when encountering new environments, and 3) they are not scalable since the agent's movement is limited to slow speeds and for specific environments. Therefore, this work proposes a data-driven UAV target localization system based on Q-learning, which employs tabular methods to learn the optimal policy. Deep Q-network (DQN) is introduced to enhance the RL model and alleviate the curse of dimensionality. The proposed models enable smart decision-making, where the sensory information gathered by the UAV is exploited to produce the best action. Moreover, the UAV movement is modeled based on motion physics, where the actions correspond to linear velocities and heading angles. The proposed approach is compared with different benchmarks, where the results indicate that a more efficient, scalable, and adaptable localization is achieved, irrespective of the environment or source characteristics, without retraining.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A reinforcement learning approach for UAV target searching and tracking
    Wang, Tian
    Qin, Ruoxi
    Chen, Yang
    Snoussi, Hichem
    Choi, Chang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) : 4347 - 4364
  • [42] UAV-based smart rock localization for bridge scour monitoring
    Zhang, Haibin
    Li, Zhaochao
    Chen, Genda
    Reven, Alec
    Scharfenberg, Buddy
    Ou, Jinping
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2021, 11 (02) : 301 - 313
  • [43] Towards UAV-Based Absolute Hierarchical Localization in Confined Spaces
    Brogaard, Rune Y.
    Zajaczkowski, Marcin
    Kovac, Luka
    Ravn, Ole
    Boukas, Evangelos
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 182 - 188
  • [44] Deep Reinforcement Learning for Interference Management in UAV-Based 3D Networks: Potentials and Challenges
    Vaezi, Mojtaba
    Lin, Xingqin
    Zhang, Hongliang
    Saad, Walid
    Poor, H. Vincent
    IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (02) : 134 - 140
  • [45] AirEye: UAV-Based Intelligent DRL Mobile Target Visitation
    Soliman, Abdulrahman
    Bahri, Mohamad
    Izham, Daniel
    Mohamed, Amr
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 554 - 559
  • [46] UAV-Based Bridge Inspection via Transfer Learning
    Aliyari, Mostafa
    Droguett, Enrique Lopez
    Ayele, Yonas Zewdu
    SUSTAINABILITY, 2021, 13 (20)
  • [47] An informative path planning framework for UAV-based terrain monitoring
    Marija Popović
    Teresa Vidal-Calleja
    Gregory Hitz
    Jen Jen Chung
    Inkyu Sa
    Roland Siegwart
    Juan Nieto
    Autonomous Robots, 2020, 44 : 889 - 911
  • [48] IoT Sensor Selection for Target Localization: A Reinforcement Learning based Approach
    Shurrab, Mohammed
    Singh, Shakti
    Mizouni, Rabeb
    Otrok, Hadi
    AD HOC NETWORKS, 2022, 134
  • [49] A Framework for Information Freshness Analysis in UAV-based Sensing and Communications
    Hazarika, Ananya
    Rahmati, Mehdi
    2022 WIRELESS TELECOMMUNICATIONS SYMPOSIUM (WTS), 2022,
  • [50] An informative path planning framework for UAV-based terrain monitoring
    Popovic, Marija
    Vidal-Calleja, Teresa
    Hitz, Gregory
    Chung, Jen Jen
    Sa, Inkyu
    Siegwart, Roland
    Nieto, Juan
    AUTONOMOUS ROBOTS, 2020, 44 (06) : 889 - 911