SiamPAT: Siamese point attention networks for robust visual tracking

被引:0
|
作者
Chen, Hang [1 ]
Zhang, Weiguo [1 ]
Yan, Danghui [1 ]
机构
[1] Northwestern Polytech Univ, Automat Coll, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
visual tracking; attention mechanism; Siamese point attention; object attention; OBJECT TRACKING;
D O I
10.1117/1.JEI.30.5.053001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Attention mechanism originates from the study of human visual behavior, which has been widely used in various fields of artificial intelligence in recent years and has become an important part of neural network structure. Many attention mechanism-based trackers have gained improved performance in both accuracy and robustness. However, these trackers cannot suppress the influence of background information and distractors accurately and do not enhance the target object information, which limits the performance of these trackers. We propose new Siamese point attention (SPA) networks for robust visual tracking. SPA networks learn position attention and channel attention jointly on two branch information. To construct point attention, each point on the template feature is used to calculate the similarity on the search feature. The similarity calculation is based on the local information of the target object, which can reduce the influence of background, deformation, and rotation factors. We can obtain the region of interest by calculating the position attention from point attention. Position attention is integrated into the calculation of channel attention to reduce the influence of irrelevant areas. In addition, we also propose the object attention, and integrate it into the classification and regression module to further enhance the semantic information of the target object and improve the tracking accuracy. Extensive experiments are also conducted on five benchmark datasets. The experiment results show that our method achieves state-of-the-art performance. (C) 2021 SPIE and IS&T
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Object tracking based on Siamese networks and attention mechanism
    Yan, Zhengbang
    Quan, Wenjun
    Yang, Congxian
    Wang, Wei
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [22] Target-Aware Siamese Networks Based on Masked Attention Mechanism for Visual Object Tracking
    Su, Yao-Hui
    Shieh, Ming-Der
    Tsai, Chia-Chi
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, : 28 - 34
  • [23] Siamese Cascaded Region Proposal Networks With Channel-Interconnection-Spatial Attention for Visual Tracking
    Cui, Zhoujuan
    An, Junshe
    Ye, Qing
    Cui, Tianshu
    IEEE ACCESS, 2020, 8 : 154800 - 154815
  • [24] Siamese Local and Global Networks for Robust Face Tracking
    Qi, Yuankai
    Zhang, Shengping
    Jiang, Feng
    Zhou, Huiyu
    Tao, Dacheng
    Li, Xuelong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9152 - 9164
  • [25] MASNet: mixed attention Siamese network for visual object tracking
    Zhang, Jianwei
    Zhang, Zhichen
    Zhang, Huanlong
    Wang, Jingchao
    Wang, He
    Zheng, Menya
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)
  • [26] Siamese-Based Twin Attention Network for Visual Tracking
    Bao, Hua
    Shu, Ping
    Zhang, Hongchao
    Liu, Xiaobai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) : 847 - 860
  • [27] Dual attention Siamese network with anchor free for visual tracking
    Guo W.
    Liang B.-W.
    Ding X.-M.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (02): : 633 - 640
  • [28] Visual Tracking With Siamese Network Based on Fast Attention Network
    Qin, Lin
    Yang, Yang
    Huang, Dandan
    Zhu, Naibo
    Yang, Han
    Xu, Zhisong
    IEEE ACCESS, 2022, 10 : 35632 - 35642
  • [29] SiamAGN: Siamese attention-guided network for visual tracking
    Wei, Bingbing
    Chen, Hongyu
    Ding, Qinghai
    Luo, Haibo
    NEUROCOMPUTING, 2022, 512 : 69 - 82
  • [30] Adaptive Feature Selection Siamese Networks for Visual Tracking
    Fiaz, Mustansar
    Rahman, Md Maklachur
    Mahmood, Arif
    Farooq, Sehar Shahzad
    Baek, Ki Yeol
    Jung, Soon Ki
    FRONTIERS OF COMPUTER VISION, 2020, 1212 : 167 - 179