Learning attention modules for visual tracking

被引:2
|
作者
Wang, Jun [1 ]
Meng, Chenchen [1 ]
Deng, Chengzhi [1 ]
Wang, Yuanyun [1 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330029, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual tracking; Siamese networks; Channel attention; Spatial attention;
D O I
10.1007/s11760-022-02177-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Siamese networks have been widely used in visual tracking. However, it is difficult to deal with complex appearance variations when the discriminative background information is ignored and an offline training strategy is adopted. In this paper, we present a novel backbone network based on CNN model and attention mechanism in the Siamese framework. The attention mechanism is composed of a channel attention module and a spatial attention module. The channel attention module uses the learned global information to selectively focus on the convolution features, which enhances a network representation ability. Besides, the spatial attention module obtains more contextual information and semantic features of target candidates. The designed attention mechanism-based backbone is lightweight and has a real-time tracking performance. We utilize GOT-10K as a training set to offline adjust trained model parameters. The extensive experimental evaluations on OTB2015, VOT2016, VOT2018, GOT-10k and UAV123 datasets demonstrate that the proposed algorithm has excellent performances against state-of-the-art trackers.
引用
收藏
页码:2149 / 2156
页数:8
相关论文
共 50 条
  • [31] A RANKING BASED ATTENTION APPROACH FOR VISUAL TRACKING
    Peng, Shenhui
    Kamata, Sei-ichiro
    Breckon, Toby P.
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3073 - 3077
  • [32] LATrack: Limited Attention for Visual Object Tracking
    Shi, Jian
    Chang, Zheng
    Yu, Yang
    Shi, Junze
    Luo, Haibo
    IEEE ACCESS, 2025, 13 : 4034 - 4047
  • [33] MTAtrack: Multilevel transformer attention for visual tracking
    An, Dong
    Zhang, Fan
    Zhao, Yuqian
    Luo, Biao
    Yang, Chunhua
    Chen, Baifan
    Yu, Lingli
    OPTICS AND LASER TECHNOLOGY, 2023, 166
  • [34] Visual Attention Model Based Object Tracking
    Ma, Lili
    Cheng, Jian
    Liu, Jing
    Wang, Jinqiao
    Lu, Hanging
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT II, 2010, 6298 : 483 - 493
  • [35] STASiamRPN: visual tracking based on spatiotemporal and attention
    Wu, Ruixu
    Wen, Xianbin
    Liu, Zhanlu
    Yuan, Liming
    Xu, Haixia
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1543 - 1555
  • [36] Incremental focus of attention for robust visual tracking
    Toyama, K
    Hager, GD
    1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, : 189 - 195
  • [37] SiamAtt: Siamese attention network for visual tracking
    Yang, Kai
    He, Zhenyu
    Zhou, Zikun
    Fan, Nana
    KNOWLEDGE-BASED SYSTEMS, 2020, 203
  • [38] Object detection and tracking based on visual attention
    Zhang, Huawei
    Zhang, Qiaorong
    ICIC Express Letters, 2012, 6 (10): : 2667 - 2671
  • [39] A Visual Attention Model for Robot Object Tracking
    JinKui Chu RongHua Li QingYing Li HongQing Wang School of Mechanical Engineering Dalian University of Technology Dalian PRC
    International Journal of Automation & Computing, 2010, 7 (01) : 39 - 46
  • [40] EANTrack: An Efficient Attention Network for Visual Tracking
    Gu, Fengwei
    Lu, Jun
    Cai, Chengtao
    Zhu, Qidan
    Ju, Zhaojie
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 5911 - 5928