Object semantic-guided graph attention feature fusion network for Siamese visual tracking

被引：3

作者：

Zhang, Jianwei ^{[1
]}

Miao, Mengen ^{[1
]}

Zhang, Huanlong ^{[2
]}

Wang, Jingchao ^{[1
]}

Zhao, Yanchun ^{[3
]}

Chen, Zhiwu ^{[2
]}

Qiao, Jianwei ^{[4
]}

机构：

[1] Zhengzhou Univ Light Ind, Coll Software Engn, Zhengzhou 450001, Peoples R China

[2] Zhengzhou Univ Light Ind, Coll Elect & Informat Engn, Zhengzhou 450002, Peoples R China

[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Peoples R China

[4] Wolong Elect Nanyang Explos Proof Motor Grp, Nanyang 473000, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2023年 / 90卷

基金：

中国国家自然科学基金;

关键词：

Visual tracking; Siamese network; Semantic; -guided; Graph attention; ROBUST;

D O I：

10.1016/j.jvcir.2022.103705

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The similarity matching between the template and the search area plays a key role in Siamese-based trackers. Most Siamese-based trackers adopt correlation operation to perform feature fusion on the template branch and search branch for similarity matching. However, the correlation operation directly uses the template feature to slide the window on the search area feature without distinguishing the discriminant part of the target and the background noise, which blurs the spatial information of the response feature. To address this issue, this work proposes a novel object semantic-guided graph attention feature fusion network that both removes background information and focuses on the discriminative part of the object. The proposed network effectively removes background noise by utilizing an adaptive template instead of the fixed-size template used by the correlation operation. The network also models the contextual semantic relations of the target and uses the resulting se-mantic relations to guide the feature fusion process in a part-based manner, thereby accurately highlighting the discriminative parts of the target. Therefore, the problem of blurring response feature caused by correlation operation is effectively resolved. Furthermore, we propose an object-aware prediction network to learn object -aware features for classification and regression task, which effectively improves the discriminative ability of the prediction network. Experiments on many challenging benchmarks like OTB-100, LaSOT, TColor-128, GOT -10k and VOT2019, show that our methods achieves excellent performance.

引用

页数：10

共 50 条

[21] CATrack: Convolution and Attention Feature Fusion for Visual Object Tracking
Zhang, Longkun
Wen, Jiajun
Dai, Zichen
Zhou, Rouyi
Lai, Zhihui
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 469 - 480
[22] Graph attention information fusion for Siamese adaptive attention tracking
Wei, Lixin
Xi, Zeyu
Hu, Ziyu
Sun, Hao
APPLIED INTELLIGENCE, 2023, 53 (02) : 2068 - 2087
[23] Graph attention information fusion for Siamese adaptive attention tracking
Lixin Wei
Zeyu Xi
Ziyu Hu
Hao Sun
Applied Intelligence, 2023, 53 : 2068 - 2087
[24] Object tracking based on siamese network with 3D attention and multiple graph attention
Yan, Shilei
Qi, Yujuan
Liu, Mengxue
Wang, Yanjiang
Liu, Baodi
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
[25] Object Detection by Attention-Guided Feature Fusion Network
Shi, Yuxuan
Fan, Yue
Xu, Siqi
Gao, Yue
Gao, Ran
SYMMETRY-BASEL, 2022, 14 (05):
[26] Channel and spatial attention-based Siamese network for visual object tracking
Tian, Shishun
Chen, Zixi
Chen, Bolin
Zou, Wenbin
Li, Xia
JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
[27] Attention shake siamese network with auxiliary relocation branch for visual object tracking
Wang, Jun
Liu, Weibin
Xing, Weiwei
Wang, Liqiang
Zhang, Shunli
NEUROCOMPUTING, 2020, 400 : 53 - 72
[28] Siamese High-Level Feature Refine Network for Visual Object Tracking
Rahman, Md. Maklachur
Ahmed, Md Rishad
Laishram, Lamyanba
Kim, Seock Ho
Jung, Soon Ki
ELECTRONICS, 2020, 9 (11) : 1 - 21
[29] Efficient Siamese model for visual object tracking with attention-based fusion modules
Zhou, Wenjun
Liu, Yao
Wang, Nan
Liang, Dong
Peng, Bo
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7801 - 7810
[30] Deformable Siamese Attention Networks for Visual Object Tracking
Yu, Yuechen
Xiong, Yilei
Huang, Weilin
Scott, Matthew R.
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6727 - 6736

← 1 2 3 4 5 →