Enhancing Single Object Tracking With a Hybrid Approach: Temporal Convolutional Networks, Attention Mechanisms, and Spatial–Temporal Memory

被引:0
|
作者
Cheewaprakobkit, Pimpa [1 ,2 ]
Lin, Chih-Yang [3 ]
Shih, Timothy K. [1 ]
Enkhbat, Avirmed [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 32001, Taiwan
[2] Asia Pacific Int Univ, Dept Informat Technol, Sara Buri 18180, Thailand
[3] Natl Cent Univ, Dept Mech Engn, Taoyuan 32001, Taiwan
关键词
Temporal convolutional networks; attention mechanism; spatial-temporal memory; single object tracking; TEMPLATE;
D O I
10.1109/ACCESS.2023.3330644
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network-based tracking tasks have experienced significant advancements in recent years. However, these networks continue to face challenges in effectively adapting to appearance changes in both target and background, as well as linking objects after extended periods. The primary challenge in tracking lies in the frequent changes in a target's appearance throughout the tracking process, which can potentially reduce tracker robustness when faced with issues such as aspect ratio changes, occlusions, scale variations, and confusion from similar objects. To address this challenge, we propose a tracking architecture that combines a temporal convolutional network (TCN) and attention mechanism with spatial-temporal memory. The TCN component empowers the model to capture temporal dependencies, while the attention mechanism reduces computational complexity by focusing on crucial regions based on context. We leverage the target's historical information stored in the spatial-temporal memory network to guide the tracker in better adapting to target deformation. Our model attains a 67.5% average overlap (AO) on the GOT-10K dataset, a 72.1% success score (AUC) on OTB2015, a 65.8% success score (AUC) on UAV123, and achieves 59.0% accuracy on the VOT2018 dataset. These outcomes demonstrate the high effectiveness of our proposed tracker in tracking a single object.
引用
收藏
页码:139211 / 139222
页数:12
相关论文
共 50 条
  • [1] Relationship of spatial memory to spatial and temporal aspects of multiple object tracking
    Howard, C. J.
    Guest, D.
    PERCEPTION, 2014, 43 (01) : 99 - 99
  • [2] Object Tracking via Spatial-Temporal Memory Network
    Zhou, Zikun
    Li, Xin
    Zhang, Tianzhu
    Wang, Hongpeng
    He, Zhenyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2976 - 2989
  • [3] Deep Ensemble Object Tracking Based on Temporal and Spatial Networks
    Hu, Zhaohua
    Chen, Huxin
    Li, Gaofei
    IEEE ACCESS, 2020, 8 : 7490 - 7505
  • [4] Robust Object Tracking based on Temporal and Spatial Deep Networks
    Teng, Zhu
    Xing, Junliang
    Wang, Qiang
    Lang, Congyan
    Feng, Songhe
    Jin, Yi
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1153 - 1162
  • [5] SPATIAL-TEMPORAL CONVOLUTIONAL ATTENTION FOR MAPPING FUNCTIONAL BRAIN NETWORKS
    Liu, Yiheng
    Ge, Enjie
    Qiang, Ning
    Liu, Tianming
    Ge, Bao
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [6] Forecasting traffic flow with spatial–temporal convolutional graph attention networks
    Xiyue Zhang
    Yong Xu
    Yizhen Shao
    Neural Computing and Applications, 2022, 34 : 15457 - 15479
  • [7] Dynamic Fusion of Convolutional Features based on Spatial and Temporal Attention for Visual Tracking
    Zhao, Dongcheng
    Zeng, Yi
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [8] Spatial-Temporal Relation Networks for Multi-Object Tracking
    Xu, Jiarui
    Cao, Yue
    Zhang, Zheng
    Hu, Han
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3987 - 3997
  • [9] Spatial and temporal attention embedded spatial temporal graph convolutional networks for skeleton based gait recognition with multiple IMUs
    Yan, Jianjun
    Xiong, Weixiang
    Jin, Li
    Jiang, Jinlin
    Yang, Zhihao
    Hu, Shuai
    Zhang, Qinghong
    ISCIENCE, 2024, 27 (09)
  • [10] Spatial-Temporal Attention Mechanism and Graph Convolutional Networks for Destination Prediction
    Li, Cong
    Zhang, Huyin
    Wang, Zengkai
    Wu, Yonghao
    Yang, Fei
    FRONTIERS IN NEUROROBOTICS, 2022, 16