Learning attention modules for visual tracking

被引:2
|
作者
Wang, Jun [1 ]
Meng, Chenchen [1 ]
Deng, Chengzhi [1 ]
Wang, Yuanyun [1 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330029, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual tracking; Siamese networks; Channel attention; Spatial attention;
D O I
10.1007/s11760-022-02177-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Siamese networks have been widely used in visual tracking. However, it is difficult to deal with complex appearance variations when the discriminative background information is ignored and an offline training strategy is adopted. In this paper, we present a novel backbone network based on CNN model and attention mechanism in the Siamese framework. The attention mechanism is composed of a channel attention module and a spatial attention module. The channel attention module uses the learned global information to selectively focus on the convolution features, which enhances a network representation ability. Besides, the spatial attention module obtains more contextual information and semantic features of target candidates. The designed attention mechanism-based backbone is lightweight and has a real-time tracking performance. We utilize GOT-10K as a training set to offline adjust trained model parameters. The extensive experimental evaluations on OTB2015, VOT2016, VOT2018, GOT-10k and UAV123 datasets demonstrate that the proposed algorithm has excellent performances against state-of-the-art trackers.
引用
收藏
页码:2149 / 2156
页数:8
相关论文
共 50 条
  • [1] Learning attention modules for visual tracking
    Jun Wang
    Chenchen Meng
    Chengzhi Deng
    Yuanyun Wang
    Signal, Image and Video Processing, 2022, 16 : 2149 - 2156
  • [2] Paralleled attention modules and adaptive focal loss for Siamese visual tracking
    Zhao, Yuyao
    Jiang, Min
    Kong, Jun
    Li, Sha
    IET IMAGE PROCESSING, 2021, 15 (06) : 1345 - 1358
  • [3] Learning Spatial-Channel Attention for Visual Tracking
    Zeng, Yingsen
    Wang, Haiying
    Lu, Ting
    2019 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2019,
  • [4] Learning spatial self-attention information for visual tracking
    Li, Shengwu
    Zhang, Xuande
    Xiong, Jing
    Ning, Chenjing
    Zhang, Mingke
    IET IMAGE PROCESSING, 2022, 16 (01) : 49 - 60
  • [5] Learning Attention Through Hierarchical Architecture for Visual Object Tracking
    Wang, Qinghui
    Yang, Peng
    Dou, Lei
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 186 - 190
  • [6] Learning Multidimensional Spatial Attention for Robust Nighttime Visual Tracking
    Gao, Qi
    Yin, Mingfeng
    Ni, Yuanzhi
    Bo, Yuming
    Bei, Shaoyi
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2910 - 2914
  • [7] Efficient Siamese model for visual object tracking with attention-based fusion modules
    Zhou, Wenjun
    Liu, Yao
    Wang, Nan
    Liang, Dong
    Peng, Bo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7801 - 7810
  • [8] Visual attention learning and antiocclusion-based correlation filter for visual object tracking
    Huang, Yuming
    Chen, Yingpin
    Lin, Chen
    Hu, Qiang
    Song, Jianhua
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [9] Lateralization of modules of visual spatial attention and the perceptive learning effect in preschool children
    Voronin, N. A.
    Stroganova, T. A.
    VOPROSY PSIKHOLOGII, 2009, (06) : 138 - +
  • [10] Efficient Visual Tracking With Stacked Channel-Spatial Attention Learning
    Rahman, Md. Maklachur
    Fiaz, Mustansar
    Jung, Soon Ki
    IEEE ACCESS, 2020, 8 : 100857 - 100869