Siamese transformer RGBT tracking

被引:8
|
作者
Wang, Futian [1 ,2 ]
Wang, Wenqi [1 ]
Liu, Lei [1 ]
Li, Chenglong [1 ]
Tang, Jing [1 ,2 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Anhui, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
RGBT tracking; Siamese network; Transformer; Template update strategy;
D O I
10.1007/s10489-023-04741-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Siamese-based RGBT trackers have attracted wide attention because of their high efficiency. However, there is a lack of an effective multimodal fusion module and information interaction between the search area and template area, which leads to poor performance of these siamese-based RGBT trackers. To solve this problem, inspire by the global information modeling capability of the transformer, we construct a siamese-based transformer RGBT tracker consisting of a single unified transformer module. Specifically, we propose a unified transformer fusion module to achieve feature extraction and global information interaction in the siamese RGBT tracker, i.e., the interaction between the search area and template area, the interaction between different modalities. It consists of self-attention and cross-attention, which are used to extract features and information interaction respectively. In addition, to alleviate the impact of multimodal fusion on the efficiency of template update in the tracking stage, we propose a feature-level template update strategy, which effectively improves tracking efficiency. To verify the effectiveness of our tracker, we evaluate it on five benchmark datasets including GTOT, RGBT210, RGBT234, LasHeR and VTUAV, and the results show that our tracker achieves excellent performance compared to the state-of-the-art methods.
引用
收藏
页码:24709 / 24723
页数:15
相关论文
共 50 条
  • [31] SiamHOT: Siamese High-Order Transformer for Aerial Tracking
    Chen, Qiqi
    Zuo, Yujia
    Wang, Bo
    Liu, Jinghong
    Liu, Chenglong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [32] A Comprehensive Review of RGBT Tracking
    Zhang, Haiping
    Yuan, Di
    Shu, Xiu
    Li, Zhihui
    Liu, Qiao
    Chang, Xiaojun
    He, Zhenyu
    Shi, Guangming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [33] Siamese Transformer Pyramid Networks for Real-Time UAV Tracking
    Xing, Daitao
    Evangeliou, Nikolaos
    Tsoukalas, Athanasios
    Tzes, Anthony
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1898 - 1907
  • [34] Siamese Adaptive Transformer Network for Real-Time Aerial Tracking
    Xing, Daitao
    Tsoukalas, Athanasios
    Evangeliou, Nikolaos
    Giakoumidis, Nikolaos
    Tzes, Anthony
    2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, : 570 - 575
  • [35] Siamese Transformer Network for Real-Time Aerial Object Tracking
    Wang, Haijun
    Zhang, Shengyan
    IEEE ACCESS, 2022, 10 : 105201 - 105213
  • [36] RGBT tracking: A comprehensive review
    Feng, Mingzheng
    Su, Jianbo
    INFORMATION FUSION, 2024, 110
  • [37] Temporal Aggregation for Adaptive RGBT Tracking
    Tang, Zhangyong
    Xu, Tianyang
    Wu, Xiao-Jun
    arXiv, 2022,
  • [38] Fusion Tree Network for RGBT Tracking
    Cheng, Zhiyuan
    Lu, Andong
    Zhang, Zhang
    Li, Chenglong
    Wang, Liang
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [39] Response map evaluation for RGBT tracking
    Wang, Yong
    Wei, Xian
    Tang, Xuan
    Wu, Jingjing
    Fang, Jiangxiong
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (07): : 5757 - 5769
  • [40] Multi-Adapter RGBT Tracking
    Li, Chenglong
    Lu, Andong
    Zheng, Aihua
    Tu, Zhengzheng
    Tang, Jin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2262 - 2270