Object semantic-guided graph attention feature fusion network for Siamese visual tracking

被引:3
|
作者
Zhang, Jianwei [1 ]
Miao, Mengen [1 ]
Zhang, Huanlong [2 ]
Wang, Jingchao [1 ]
Zhao, Yanchun [3 ]
Chen, Zhiwu [2 ]
Qiao, Jianwei [4 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Software Engn, Zhengzhou 450001, Peoples R China
[2] Zhengzhou Univ Light Ind, Coll Elect & Informat Engn, Zhengzhou 450002, Peoples R China
[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Peoples R China
[4] Wolong Elect Nanyang Explos Proof Motor Grp, Nanyang 473000, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual tracking; Siamese network; Semantic; -guided; Graph attention; ROBUST;
D O I
10.1016/j.jvcir.2022.103705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The similarity matching between the template and the search area plays a key role in Siamese-based trackers. Most Siamese-based trackers adopt correlation operation to perform feature fusion on the template branch and search branch for similarity matching. However, the correlation operation directly uses the template feature to slide the window on the search area feature without distinguishing the discriminant part of the target and the background noise, which blurs the spatial information of the response feature. To address this issue, this work proposes a novel object semantic-guided graph attention feature fusion network that both removes background information and focuses on the discriminative part of the object. The proposed network effectively removes background noise by utilizing an adaptive template instead of the fixed-size template used by the correlation operation. The network also models the contextual semantic relations of the target and uses the resulting se-mantic relations to guide the feature fusion process in a part-based manner, thereby accurately highlighting the discriminative parts of the target. Therefore, the problem of blurring response feature caused by correlation operation is effectively resolved. Furthermore, we propose an object-aware prediction network to learn object -aware features for classification and regression task, which effectively improves the discriminative ability of the prediction network. Experiments on many challenging benchmarks like OTB-100, LaSOT, TColor-128, GOT -10k and VOT2019, show that our methods achieves excellent performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] CATrack: Convolution and Attention Feature Fusion for Visual Object Tracking
    Zhang, Longkun
    Wen, Jiajun
    Dai, Zichen
    Zhou, Rouyi
    Lai, Zhihui
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 469 - 480
  • [22] Graph attention information fusion for Siamese adaptive attention tracking
    Wei, Lixin
    Xi, Zeyu
    Hu, Ziyu
    Sun, Hao
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2068 - 2087
  • [23] Graph attention information fusion for Siamese adaptive attention tracking
    Lixin Wei
    Zeyu Xi
    Ziyu Hu
    Hao Sun
    Applied Intelligence, 2023, 53 : 2068 - 2087
  • [24] Object tracking based on siamese network with 3D attention and multiple graph attention
    Yan, Shilei
    Qi, Yujuan
    Liu, Mengxue
    Wang, Yanjiang
    Liu, Baodi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
  • [25] Object Detection by Attention-Guided Feature Fusion Network
    Shi, Yuxuan
    Fan, Yue
    Xu, Siqi
    Gao, Yue
    Gao, Ran
    SYMMETRY-BASEL, 2022, 14 (05):
  • [26] Channel and spatial attention-based Siamese network for visual object tracking
    Tian, Shishun
    Chen, Zixi
    Chen, Bolin
    Zou, Wenbin
    Li, Xia
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [27] Attention shake siamese network with auxiliary relocation branch for visual object tracking
    Wang, Jun
    Liu, Weibin
    Xing, Weiwei
    Wang, Liqiang
    Zhang, Shunli
    NEUROCOMPUTING, 2020, 400 : 53 - 72
  • [28] Siamese High-Level Feature Refine Network for Visual Object Tracking
    Rahman, Md. Maklachur
    Ahmed, Md Rishad
    Laishram, Lamyanba
    Kim, Seock Ho
    Jung, Soon Ki
    ELECTRONICS, 2020, 9 (11) : 1 - 21
  • [29] Efficient Siamese model for visual object tracking with attention-based fusion modules
    Zhou, Wenjun
    Liu, Yao
    Wang, Nan
    Liang, Dong
    Peng, Bo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7801 - 7810
  • [30] Deformable Siamese Attention Networks for Visual Object Tracking
    Yu, Yuechen
    Xiong, Yilei
    Huang, Weilin
    Scott, Matthew R.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6727 - 6736