Enhancing Single Object Tracking With a Hybrid Approach: Temporal Convolutional Networks, Attention Mechanisms, and Spatial–Temporal Memory

被引:0
|
作者
Cheewaprakobkit, Pimpa [1 ,2 ]
Lin, Chih-Yang [3 ]
Shih, Timothy K. [1 ]
Enkhbat, Avirmed [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 32001, Taiwan
[2] Asia Pacific Int Univ, Dept Informat Technol, Sara Buri 18180, Thailand
[3] Natl Cent Univ, Dept Mech Engn, Taoyuan 32001, Taiwan
关键词
Temporal convolutional networks; attention mechanism; spatial-temporal memory; single object tracking; TEMPLATE;
D O I
10.1109/ACCESS.2023.3330644
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network-based tracking tasks have experienced significant advancements in recent years. However, these networks continue to face challenges in effectively adapting to appearance changes in both target and background, as well as linking objects after extended periods. The primary challenge in tracking lies in the frequent changes in a target's appearance throughout the tracking process, which can potentially reduce tracker robustness when faced with issues such as aspect ratio changes, occlusions, scale variations, and confusion from similar objects. To address this challenge, we propose a tracking architecture that combines a temporal convolutional network (TCN) and attention mechanism with spatial-temporal memory. The TCN component empowers the model to capture temporal dependencies, while the attention mechanism reduces computational complexity by focusing on crucial regions based on context. We leverage the target's historical information stored in the spatial-temporal memory network to guide the tracker in better adapting to target deformation. Our model attains a 67.5% average overlap (AO) on the GOT-10K dataset, a 72.1% success score (AUC) on OTB2015, a 65.8% success score (AUC) on UAV123, and achieves 59.0% accuracy on the VOT2018 dataset. These outcomes demonstrate the high effectiveness of our proposed tracker in tracking a single object.
引用
收藏
页码:139211 / 139222
页数:12
相关论文
共 50 条
  • [21] Attention based spatial-temporal graph convolutional networks for boiler NOx prediction
    Zhou, Yongqing
    Hao, Dawei
    Fan, Yuchen
    Wen, Xintong
    Wei, Chang
    Liu, Xin
    Zhang, Wenzhen
    Wang, Heyang
    Meitan Xuebao/Journal of the China Coal Society, 2024, 49 (10): : 4127 - 4137
  • [22] TCN-Attention-BIGRU: Building energy modelling based on attention mechanisms and temporal convolutional networks
    Deng, Yi
    Yue, Zhanpeng
    Wu, Ziyi
    Li, Yitong
    Wang, Yifei
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (03): : 2160 - 2179
  • [23] Extended Hierarchical Temporal Memory for Visual Object Tracking
    Krys, Sebastian
    Jankowski, Stanislaw
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2011, 2011, 8008
  • [24] Object tracking with temporal prediction and spatial refinement (TPSR)
    Gan, Weihao
    Lee, Ming-Sui
    Wu, Chi-hao
    Kuo, C. -C. Jay
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 303 - 312
  • [25] Temporal convolutional networks for musical audio beat tracking
    Davies, Matthew E. P.
    Boeck, Sebastian
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [26] Object tracking based on temporal and spatial context information
    Chen, Yan
    Lin, Tao
    Du, Jixiang
    Zhang, Hongbo
    IMAGE AND VISION COMPUTING, 2025, 157
  • [27] A spatial-temporal contexts network for object tracking
    Huang, Kai
    Xiao, Kai
    Chu, Jun
    Leng, Lu
    Dong, Xingbo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [28] Enhancing Online UAV Multi-Object Tracking with Temporal Context and Spatial Topological Relationships
    Xiao, Changcheng
    Cao, Qiong
    Zhong, Yujie
    Lan, Long
    Zhang, Xiang
    Cai, Huayue
    Luo, Zhigang
    DRONES, 2023, 7 (06)
  • [29] Spatial-Temporal Convolutional Attention Network for Action Recognition
    Luo, Huilan
    Chen, Han
    Computer Engineering and Applications, 2023, 59 (09): : 150 - 158
  • [30] Spatial-temporal single object tracking with three-way decision theory
    Wang, Ziye
    Miao, Duoqian
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 154 : 38 - 47