Reinforcement learning framework for UAV-based target localization applications

被引:14
|
作者
Shurrab, Mohammed [1 ]
Mizouni, Rabeb [1 ]
Singh, Shakti [1 ]
Otrok, Hadi [1 ]
机构
[1] Khalifa Univ, Elect Engn & Comp Sci Dept, Abu Dhabi 127788, U Arab Emirates
关键词
Target localization; Unmanned aerial vehicle (UAV); Reinforcement learning (RL); Deep Q-network (DQN); Data-driven; Deep reinforcement learning (DRL); Smart environmental monitoring (SEM); INTERNET; SYSTEM; THINGS; IOT;
D O I
10.1016/j.iot.2023.100867
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart environmental monitoring has gained prominence, where target localization is of utmost importance. Employing UAVs for localization tasks is appealing owing to their low-cost, light-weight, and high maneuverability. However, UAVs lack the autonomy of decision-making if met with uncertain situations. Therefore, reinforcement learning (RL) can introduce intelligence to UAVs, where they learn to act based on the presented situation. Existing works focus on UAV trajectory optimization, navigation, and target tracking. These methods are application-specific and cannot be adapted to localization tasks since they require prior knowledge of the target. Moreover, the current RL-based autonomous target localization systems are lacking since-1) they must keep track of all visited locations and their corresponding readings, 2) they require retraining when encountering new environments, and 3) they are not scalable since the agent's movement is limited to slow speeds and for specific environments. Therefore, this work proposes a data-driven UAV target localization system based on Q-learning, which employs tabular methods to learn the optimal policy. Deep Q-network (DQN) is introduced to enhance the RL model and alleviate the curse of dimensionality. The proposed models enable smart decision-making, where the sensory information gathered by the UAV is exploited to produce the best action. Moreover, the UAV movement is modeled based on motion physics, where the actions correspond to linear velocities and heading angles. The proposed approach is compared with different benchmarks, where the results indicate that a more efficient, scalable, and adaptable localization is achieved, irrespective of the environment or source characteristics, without retraining.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Autonomous UAV-based Target Search, Tracking and Following using Reinforcement Learning and YOLOFlow
    Ajmera, Yug
    Singh, Surya Pratap
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 15 - 20
  • [2] UAV-based Localization for Layered Framework of the Internet of Things
    Pandey, Saurabh K.
    Zaveri, Mukesh A.
    Choksi, Meghavi
    Kumar, J. Sathish
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 728 - 735
  • [3] Autonomous UAV-based surveillance system for multi-target detection using reinforcement learning
    Salameh, Haythem Bany
    Hussienat, Ayyoub
    Alhafnawi, Mohannad
    Al-Ajlouni, Ahmad
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (07): : 9381 - 9394
  • [4] A Hybrid Framework of Reinforcement Learning and Convex Optimization for UAV-Based Autonomous Metaverse Data Collection
    Si, Peiyuan
    Qian, Liangxin
    Zhao, Jun
    Lam, Kwok-Yan
    IEEE NETWORK, 2023, 37 (04): : 248 - 254
  • [5] Deep Reinforcement Learning for UAV-Based SDWSN Data Collection
    Karegar, Pejman A.
    Al-Hamid, Duaa Zuhair
    Chong, Peter Han Joo
    FUTURE INTERNET, 2024, 16 (11)
  • [6] UAV-based Localization of Mobile Phones for Search and Rescue Applications
    Dorn, Christian
    Depold, Andreas
    Lurz, Fabian
    Erhardt, Stefan
    Hagelauer, Amelie
    2022 IEEE 22ND ANNUAL WIRELESS AND MICROWAVE TECHNOLOGY CONFERENCE (WAMICON), 2022,
  • [7] UAV-based Localization of Mobile Phones for Search and Rescue Applications
    Dorn, Christian
    Depold, Andreas
    Lurz, Fabian
    Erhardt, Stefan
    Hagelauer, Amelie
    2022 IEEE 22nd Annual Wireless and Microwave Technology Conference, WAMICON 2022, 2022,
  • [8] Reinforcement Learning for Improved UAV-based Integrated Access and Backhaul Operation
    Tafintsev, Nikita
    Moltchanov, Dmitri
    Simsek, Meryem
    Yeh, Shu-ping
    Andreev, Sergey
    Koucheryavy, Yevgeni
    Valkama, Mikko
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [9] Illegal Radio Station Localization with UAV-Based Q-Learning
    Shengjun Wu
    中国通信, 2018, 15 (12) : 122 - 131
  • [10] Illegal Radio Station Localization with UAV-Based Q-Learning
    Wu, Shengjun
    CHINA COMMUNICATIONS, 2018, 15 (12) : 122 - 131