Graph Attention Network for Context-Aware Visual Tracking

被引：0

作者：

Shao, Yanyan ^{[1
]}

Guo, Dongyan ^{[1
]}

Cui, Ying ^{[1
]}

Wang, Zhenhua ^{[2
]}

Zhang, Liyan ^{[3
]}

Zhang, Jianhua ^{[4
]}

机构：

[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China

[2] Northwest A&F Univ, Coll Informat Engn, Xianyang 712199, Peoples R China

[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

[4] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

基金：

中国国家自然科学基金;

关键词：

Target tracking; Visualization; Shape; Search problems; Object tracking; Feature extraction; Correlation; Context-aware tracking; graph attention mechanism; Siamese network; visual tracking; ONLINE OBJECT TRACKING;

D O I：

10.1109/TNNLS.2024.3442290

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Siamese-network-based trackers convert the general object tracking as a similarity matching task between a template and a search region. Using convolutional feature cross correlation (Xcorr) for similarity matching, a large number of Siamese trackers are proposed and achieved great success. However, due to the predefined size of the target feature, these trackers suffer from either retaining much background information or losing important foreground information. Moreover, the global matching between the target and search region also largely neglects the part-level structural information and the contextual information of the target. To tackle the aforementioned obstacles, in this article, we propose a simple context-aware Siamese graph attention network, which establishes part-to-part correspondence between the Siamese branches with a complete bipartite graph. The object information from the template is propagated to the search region via a graph attention mechanism. With such a design, a target-aware template input is enabled to replace the prefixed template region, which can adaptively fit the size and aspect ratio variations in different objects. Based on it, we further construct a context-aware feature matching mechanism to embed both the target and the contextual information in the search region. Experiments on challenging benchmarks including GOT-10k, TrackingNet, LaSOT, VOT2020, and OTB-100 demonstrate that the proposed SiamGAT* outperforms many state-of-the-art trackers and achieves leading performance. Code is available at: https://git.io/SiamGAT.

引用

页数：14

共 50 条

[31] Context-Aware Graph Label Propagation Network for Saliency Detection
Ji, Wei
Li, Xi
Wei, Lina
Wu, Fei
Zhuang, Yueting
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8177 - 8186
[32] A Dynamic and Static Context-Aware Attention Network for Trajectory Prediction
Yu, Jian
Zhou, Meng
Wang, Xin
Pu, Guoliang
Cheng, Chengqi
Chen, Bo
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (05)
[33] Context-Aware Attention Network for Image-Text Retrieval
Zhang, Qi
Lei, Zhen
Zhang, Zhaoxiang
Li, Stan Z.
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3533 - 3542
[34] Context-aware Attention Network for Predicting Image Aesthetic Subjectivity
Xu, Munan
Zhong, Jia-Xing
Ren, Yurui
Liu, Shan
Li, Ge
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 798 - 806
[35] Context-Aware Attention Network for Human Emotion Recognition in Video
Liu, Xiaodong
Wang, Miao
ADVANCES IN MULTIMEDIA, 2020, 2020
[36] Stacked Multimodal Attention Network for Context-Aware Video Captioning
Zheng, Yi
Zhang, Yuejie
Feng, Rui
Zhang, Tao
Fan, Weiguo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 31 - 42
[37] Relevant Visual Semantic Context-Aware Attention-Based Dialog
Hong, Eugene Tan Boon
Chong, Yung-Wey
Wan, Tat-Chee
Yau, Kok-Lim Alvin
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2337 - 2354
[38] Context-Aware and Occlusion Handling Mechanism for Online Visual Object Tracking
Mehmood, Khizer
Jalil, Abdul
Ali, Ahmad
Khan, Baber
Murad, Maria
Khan, Wasim Ullah
He, Yigang
ELECTRONICS, 2021, 10 (01) : 1 - 16
[39] Lightweight visual backbone network with enhanced comprehensive strength through context-aware dual attention mechanism
Xue, Jianxin
Hu, Yaohua
Hua, Sicheng
Chen, Minyu
Wu, Ling-, I
Chang, Xi
Li, Guoqiang
NEUROCOMPUTING, 2025, 624
[40] Context-Aware Correlation Filter for Visual Tracking with Deep Convolution Features
Zhang, Leyi
Wu, Huicong
Song, Jie
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2018, : 1 - 7

← 1 2 3 4 5 →