UAV Dynamic Object Tracking with Lightweight Deep Vision Reinforcement Learning

被引：3

作者：

Nguyen, Hy ^{[1
]}

Thudumu, Srikanth ^{[1
]}

Du, Hung ^{[1
]}

Mouzakis, Kon ^{[1
]}

Vasa, Rajesh ^{[1
]}

机构：

[1] Deakin Univ, Appl Artificial Intelligence Inst A2I2, Geelong, Vic 3216, Australia

来源：

ALGORITHMS | 2023年 / 16卷 / 05期

关键词：

deep Q-network (DQN); deep deterministic policy gradient (DDPG); deep reinforcement learning (DRL); object tracking; object detection; unmanned aerial vehicle (UAV);

D O I：

10.3390/a16050227

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Several approaches have applied Deep Reinforcement Learning (DRL) to Unmanned Aerial Vehicles (UAVs) to do autonomous object tracking. These methods, however, are resource intensive and require prior knowledge of the environment, making them difficult to use in real-world applications. In this paper, we propose a Lightweight Deep Vision Reinforcement Learning (LDVRL) framework for dynamic object tracking that uses the camera as the only input source. Our framework employs several techniques such as stacks of frames, segmentation maps from the simulation, and depth images to reduce the overall computational cost. We conducted the experiment with a non-sparse Deep Q-Network (DQN) (value-based) and a Deep Deterministic Policy Gradient (DDPG) (actor-critic) to test the adaptability of our framework with different methods and identify which DRL method is the most suitable for this task. In the end, a DQN is chosen for several reasons. Firstly, a DQN has fewer networks than a DDPG, hence reducing the computational resources on physical UAVs. Secondly, it is surprising that although a DQN is smaller in model size than a DDPG, it still performs better in this specific task. Finally, a DQN is very practical for this task due to the ability to operate in continuous state space. Using a high-fidelity simulation environment, our proposed approach is verified to be effective.

引用

页数：23

共 50 条

[41] A survey on security of UAV and deep reinforcement learning
Sarikaya, Burcu Sonmez
Bahtiyar, Serif
AD HOC NETWORKS, 2024, 164
[42] Lightweight and Deep Appearance Embedding for Multiple Object Tracking
Ye, Liangling
Li, Weida
Zheng, Lixin
Zeng, Yuanyue
IET COMPUTER VISION, 2022, 16 (06) : 489 - 503
[43] Vision Memory for Target Object Navigation using Deep Reinforcement Learning: An Empirical Study
Do-Van Nguyen
Tung-Long Vuong
Hai-Dang Kieu
Linh Pham
Thanh-Ha Le
2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3267 - 3273
[44] Vision-based Navigation of UAV with Continuous Action Space Using Deep Reinforcement Learning
Zhou, Benchun
Wang, Weihong
Liu, Zhenghua
Wang, Jia
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5030 - 5035
[45] Vision-Based Robotic Object Grasping-A Deep Reinforcement Learning Approach
Chen, Ya-Ling
Cai, Yan-Rou
Cheng, Ming-Yang
MACHINES, 2023, 11 (02)
[46] Dynamic Object Detection and Tracking in Vision SLAM
Liu H.
Niu L.
Deng Y.
Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
[47] Object tracking: Feature selection by reinforcement learning
Deng, Jiali
Gong, Haigang
Liu, Minghui
Liu, Ming
INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
[48] A reinforcement learning approach for UAV target searching and tracking
Tian Wang
Ruoxi Qin
Yang Chen
Hichem Snoussi
Chang Choi
Multimedia Tools and Applications, 2019, 78 : 4347 - 4364
[49] A reinforcement learning approach for UAV target searching and tracking
Wang, Tian
Qin, Ruoxi
Chen, Yang
Snoussi, Hichem
Choi, Chang
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) : 4347 - 4364
[50] Dynamic Coordination in UAV Swarm Assisted MEC via Decentralized Deep Reinforcement Learning
Ye, Yuting
Wei, Wenshu
Geng, Dongqing
He, Xiaofan
2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1064 - 1069

← 1 2 3 4 5 →