Siamese transformer RGBT tracking

被引:8
|
作者
Wang, Futian [1 ,2 ]
Wang, Wenqi [1 ]
Liu, Lei [1 ]
Li, Chenglong [1 ]
Tang, Jing [1 ,2 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Anhui, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
RGBT tracking; Siamese network; Transformer; Template update strategy;
D O I
10.1007/s10489-023-04741-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Siamese-based RGBT trackers have attracted wide attention because of their high efficiency. However, there is a lack of an effective multimodal fusion module and information interaction between the search area and template area, which leads to poor performance of these siamese-based RGBT trackers. To solve this problem, inspire by the global information modeling capability of the transformer, we construct a siamese-based transformer RGBT tracker consisting of a single unified transformer module. Specifically, we propose a unified transformer fusion module to achieve feature extraction and global information interaction in the siamese RGBT tracker, i.e., the interaction between the search area and template area, the interaction between different modalities. It consists of self-attention and cross-attention, which are used to extract features and information interaction respectively. In addition, to alleviate the impact of multimodal fusion on the efficiency of template update in the tracking stage, we propose a feature-level template update strategy, which effectively improves tracking efficiency. To verify the effectiveness of our tracker, we evaluate it on five benchmark datasets including GTOT, RGBT210, RGBT234, LasHeR and VTUAV, and the results show that our tracker achieves excellent performance compared to the state-of-the-art methods.
引用
收藏
页码:24709 / 24723
页数:15
相关论文
共 50 条
  • [21] Learning modality feature fusion via transformer for RGBT-tracking
    Cai, Yujue
    Sui, Xiubao
    Gu, Guohua
    Chen, Qian
    INFRARED PHYSICS & TECHNOLOGY, 2023, 133
  • [22] RGBT Tracking via Progressive Fusion Transformer With Dynamically Guided Learning
    Zhu, Yabin
    Li, Chenglong
    Wang, Xiao
    Tang, Jin
    Huang, Zhixiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8722 - 8735
  • [23] MTNet: Learning Modality-aware Representation with Transformer for RGBT Tracking
    Hou, Ruichao
    Xu, Boyue
    Ren, Tongwei
    Wu, Gangshan
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1163 - 1168
  • [24] Siamese hierarchical feature fusion transformer for efficient tracking
    Dai, Jiahai
    Fu, Yunhao
    Wang, Songxin
    Chang, Yuchun
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [25] Siamese network with transformer and saliency encoder for object tracking
    Lei Liu
    Guangqian Kong
    Xun Duan
    Huiyun Long
    Yun Wu
    Applied Intelligence, 2023, 53 : 2265 - 2279
  • [26] RGBT Image Fusion Tracking via Sparse Trifurcate Transformer Aggregation Network
    Feng, Mingzheng
    Su, Jianbo
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 10
  • [27] SiamUT: Siamese Unsymmetrical Transformer-like Tracking
    Yang, Lingyu
    Zhou, Hao
    Yuan, Guowu
    Xia, Mengen
    Chen, Dong
    Shi, Zhiliang
    Chen, Enbang
    ELECTRONICS, 2023, 12 (14)
  • [28] Siamese network with transformer and saliency encoder for object tracking
    Liu, Lei
    Kong, Guangqian
    Duan, Xun
    Long, Huiyun
    Wu, Yun
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2265 - 2279
  • [29] Spatial Transformer Part-based Siamese Visual Tracking
    Zhang, Ximing
    Lei, Hao
    Ma, Yilong
    Luo, Shujuan
    Fan, Xuewu
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7269 - 7274
  • [30] SPPT: Siamese Pyramid Pooling Transformer for Visual Object Tracking
    Fang, Yang
    Xie, Bailian
    Jiang, Bingbing
    Ke, Xuhui
    Li, Yan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2023, 13