UAV Dynamic Object Tracking with Lightweight Deep Vision Reinforcement Learning

被引:3
|
作者
Nguyen, Hy [1 ]
Thudumu, Srikanth [1 ]
Du, Hung [1 ]
Mouzakis, Kon [1 ]
Vasa, Rajesh [1 ]
机构
[1] Deakin Univ, Appl Artificial Intelligence Inst A2I2, Geelong, Vic 3216, Australia
关键词
deep Q-network (DQN); deep deterministic policy gradient (DDPG); deep reinforcement learning (DRL); object tracking; object detection; unmanned aerial vehicle (UAV);
D O I
10.3390/a16050227
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several approaches have applied Deep Reinforcement Learning (DRL) to Unmanned Aerial Vehicles (UAVs) to do autonomous object tracking. These methods, however, are resource intensive and require prior knowledge of the environment, making them difficult to use in real-world applications. In this paper, we propose a Lightweight Deep Vision Reinforcement Learning (LDVRL) framework for dynamic object tracking that uses the camera as the only input source. Our framework employs several techniques such as stacks of frames, segmentation maps from the simulation, and depth images to reduce the overall computational cost. We conducted the experiment with a non-sparse Deep Q-Network (DQN) (value-based) and a Deep Deterministic Policy Gradient (DDPG) (actor-critic) to test the adaptability of our framework with different methods and identify which DRL method is the most suitable for this task. In the end, a DQN is chosen for several reasons. Firstly, a DQN has fewer networks than a DDPG, hence reducing the computational resources on physical UAVs. Secondly, it is surprising that although a DQN is smaller in model size than a DDPG, it still performs better in this specific task. Finally, a DQN is very practical for this task due to the ability to operate in continuous state space. Using a high-fidelity simulation environment, our proposed approach is verified to be effective.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A survey on security of UAV and deep reinforcement learning
    Sarikaya, Burcu Sonmez
    Bahtiyar, Serif
    AD HOC NETWORKS, 2024, 164
  • [42] Lightweight and Deep Appearance Embedding for Multiple Object Tracking
    Ye, Liangling
    Li, Weida
    Zheng, Lixin
    Zeng, Yuanyue
    IET COMPUTER VISION, 2022, 16 (06) : 489 - 503
  • [43] Vision Memory for Target Object Navigation using Deep Reinforcement Learning: An Empirical Study
    Do-Van Nguyen
    Tung-Long Vuong
    Hai-Dang Kieu
    Linh Pham
    Thanh-Ha Le
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3267 - 3273
  • [44] Vision-based Navigation of UAV with Continuous Action Space Using Deep Reinforcement Learning
    Zhou, Benchun
    Wang, Weihong
    Liu, Zhenghua
    Wang, Jia
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5030 - 5035
  • [45] Vision-Based Robotic Object Grasping-A Deep Reinforcement Learning Approach
    Chen, Ya-Ling
    Cai, Yan-Rou
    Cheng, Ming-Yang
    MACHINES, 2023, 11 (02)
  • [46] Dynamic Object Detection and Tracking in Vision SLAM
    Liu H.
    Niu L.
    Deng Y.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [47] Object tracking: Feature selection by reinforcement learning
    Deng, Jiali
    Gong, Haigang
    Liu, Minghui
    Liu, Ming
    INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
  • [48] A reinforcement learning approach for UAV target searching and tracking
    Tian Wang
    Ruoxi Qin
    Yang Chen
    Hichem Snoussi
    Chang Choi
    Multimedia Tools and Applications, 2019, 78 : 4347 - 4364
  • [49] A reinforcement learning approach for UAV target searching and tracking
    Wang, Tian
    Qin, Ruoxi
    Chen, Yang
    Snoussi, Hichem
    Choi, Chang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) : 4347 - 4364
  • [50] Dynamic Coordination in UAV Swarm Assisted MEC via Decentralized Deep Reinforcement Learning
    Ye, Yuting
    Wei, Wenshu
    Geng, Dongqing
    He, Xiaofan
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1064 - 1069