SiamPAT: Siamese point attention networks for robust visual tracking

被引:0
|
作者
Chen, Hang [1 ]
Zhang, Weiguo [1 ]
Yan, Danghui [1 ]
机构
[1] Northwestern Polytech Univ, Automat Coll, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
visual tracking; attention mechanism; Siamese point attention; object attention; OBJECT TRACKING;
D O I
10.1117/1.JEI.30.5.053001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Attention mechanism originates from the study of human visual behavior, which has been widely used in various fields of artificial intelligence in recent years and has become an important part of neural network structure. Many attention mechanism-based trackers have gained improved performance in both accuracy and robustness. However, these trackers cannot suppress the influence of background information and distractors accurately and do not enhance the target object information, which limits the performance of these trackers. We propose new Siamese point attention (SPA) networks for robust visual tracking. SPA networks learn position attention and channel attention jointly on two branch information. To construct point attention, each point on the template feature is used to calculate the similarity on the search feature. The similarity calculation is based on the local information of the target object, which can reduce the influence of background, deformation, and rotation factors. We can obtain the region of interest by calculating the position attention from point attention. Position attention is integrated into the calculation of channel attention to reduce the influence of irrelevant areas. In addition, we also propose the object attention, and integrate it into the classification and regression module to further enhance the semantic information of the target object and improve the tracking accuracy. Extensive experiments are also conducted on five benchmark datasets. The experiment results show that our method achieves state-of-the-art performance. (C) 2021 SPIE and IS&T
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Robust adaptive learning with Siamese network architecture for visual tracking
    Wancheng Zhang
    Yongzhao Du
    Zhi Chen
    Jianhua Deng
    Peizhong Liu
    The Visual Computer, 2021, 37 : 881 - 894
  • [32] Siamese Visual Tracking With Deep Features and Robust Feature Fusion
    Li, Daqun
    Wang, Xize
    Yu, Yi
    IEEE ACCESS, 2020, 8 : 3863 - 3874
  • [33] Robust adaptive learning with Siamese network architecture for visual tracking
    Zhang, Wancheng
    Du, Yongzhao
    Chen, Zhi
    Deng, Jianhua
    Liu, Peizhong
    VISUAL COMPUTER, 2021, 37 (05): : 881 - 894
  • [34] Robust visual tracking algorithm with coattention guided Siamese network
    Dai, Jiahai
    Jiang, Jiaqi
    Wang, Songxin
    Chang, Yuchun
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
  • [35] Robust Template Adjustment Siamese Network for Object Visual Tracking
    Tang, Chuanming
    Qin, Peng
    Zhang, Jianlin
    SENSORS, 2021, 21 (04) : 1 - 17
  • [36] REGION-BASED FULLY CONVOLUTIONAL SIAMESE NETWORKS FOR ROBUST REAL-TIME VISUAL TRACKING
    Yang, Longchao
    Jiang, Peilin
    Wang, Fei
    Wang, Xuan
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2567 - 2571
  • [37] Multi-granularity Hierarchical Attention Siamese Network for Visual Tracking
    Chen, Xing
    Zhang, Xiang
    Tan, Huibin
    Lan, Long
    Luo, Zhigang
    Huang, Xuhui
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [38] Paralleled attention modules and adaptive focal loss for Siamese visual tracking
    Zhao, Yuyao
    Jiang, Min
    Kong, Jun
    Li, Sha
    IET IMAGE PROCESSING, 2021, 15 (06) : 1345 - 1358
  • [39] Attention-Based Siamese Region Proposals Network for Visual Tracking
    Wang, Fan
    Yang, Bo
    Li, Jingting
    Hu, Xiaopeng
    Ji, Zhihang
    IEEE ACCESS, 2020, 8 (08): : 86595 - 86607
  • [40] Siamese Implicit Region Proposal Network With Compound Attention for Visual Tracking
    Chan, Sixian
    Tao, Jian
    Zhou, Xiaolong
    Bai, Cong
    Zhang, Xiaoqin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1882 - 1894