Propagating prior information with transformer for robust visual object tracking

被引:0
|
作者
Wu, Yue [1 ]
Cai, Chengtao [1 ,2 ]
Yeo, Chai Kiat [3 ]
机构
[1] Harbin Engn Univ, Sch Intelligent Sci & Engn, Harbin 150001, Peoples R China
[2] Harbin Engn Univ, Key Lab Intelligent Technol & Applicat Marine Equi, Minist Educ, Harbin 150001, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
关键词
Visual object tracking; Siamese network; Transformer; Prior information; VIDEO;
D O I
10.1007/s00530-024-01423-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the domain of visual object tracking has witnessed considerable advancements with the advent of deep learning methodologies. Siamese-based trackers have been pivotal, establishing a new architecture with a weight-shared backbone. With the inclusion of the transformer, attention mechanism has been exploited to enhance the feature discriminability across successive frames. However, the limited adaptability of many existing trackers to the different tracking scenarios has led to inaccurate target localization. To effectively solve this issue, in this paper, we have integrated a siamese network with the transformer, where the former utilizes ResNet50 as the backbone network to extract the target features, while the latter consists of an encoder and a decoder, where the encoder can effectively utilize global contextual information to obtain the discriminative features. Simultaneously, we employ the decoder to propagate prior information related to the target, which enables the tracker to successfully locate the target in a variety of environments, enhancing the stability and robustness of the tracker. Extensive experiments on four major public datasets, OTB100, UAV123, GOT10k and LaSOText demonstrate the effectiveness of the proposed method. Its performance surpasses many state-of-the-art trackers. Additionally, the proposed tracker can achieve a tracking speed of 60 fps, meeting the requirements for real-time tracking.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Learning Feature Restoration Transformer for Robust Dehazing Visual Object Tracking
    Xu, Tianyang
    Pan, Yifan
    Feng, Zhenhua
    Zhu, Xuefeng
    Cheng, Chunyang
    Wu, Xiao-Jun
    Kittler, Josef
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 6021 - 6038
  • [2] Robust Visual Tracking with Reliable Object Information and Kalman Filter
    Chen, Hang
    Zhang, Weiguo
    Yan, Danghui
    SENSORS, 2021, 21 (03) : 1 - 18
  • [3] Multiple templates transformer for visual object tracking
    Pang, Haibo
    Su, Jie
    Ma, Rongqi
    Li, Tingting
    Liu, Chengming
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [4] Robust Object Modeling for Visual Tracking
    Cai, Yidong
    Liu, Jie
    Tang, Jie
    Wu, Gangshan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9555 - 9566
  • [5] A Robust Framework for Visual Object Tracking
    Nguyen Dang Binh
    2009 IEEE-RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES: RESEARCH, INNOVATION AND VISION FOR THE FUTURE, 2009, : 95 - 102
  • [6] Transferring Visual Prior for Online Object Tracking
    Wang, Qing
    Chen, Feng
    Yang, Jimei
    Xu, Wenli
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (07) : 3296 - 3305
  • [7] Transformer Union Convolution Network for visual object tracking
    Song, Zhehan
    Chen, Yiming
    Luo, Peng
    Feng, Huajun
    Xu, Zhihai
    Li, Qi
    OPTICS COMMUNICATIONS, 2022, 524
  • [8] Hunt-inspired Transformer for visual object tracking
    Zhang, Zhibin
    Xue, Wanli
    Zhou, Yuxi
    Zhang, Kaihua
    Chen, Shengyong
    PATTERN RECOGNITION, 2024, 156
  • [9] An Implementation of a Robust Visual Object Tracking System
    Nguyen, An Hoang
    Mai, Linh
    Do, Hung Ngoc
    PROCEEDINGS OF 202013TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2020), 2020, : 166 - 171
  • [10] Robust Visual Object Tracking with Interleaved Segmentation
    Abel, Peter
    Kieritz, Hilke
    Becker, Stefan
    Arens, Michael
    COUNTERTERRORISM, CRIME FIGHTING, FORENSICS, AND SURVEILLANCE TECHNOLOGIES, 2017, 10441