Graph Attention Network for Context-Aware Visual Tracking

被引:0
|
作者
Shao, Yanyan [1 ]
Guo, Dongyan [1 ]
Cui, Ying [1 ]
Wang, Zhenhua [2 ]
Zhang, Liyan [3 ]
Zhang, Jianhua [4 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Northwest A&F Univ, Coll Informat Engn, Xianyang 712199, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[4] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Visualization; Shape; Search problems; Object tracking; Feature extraction; Correlation; Context-aware tracking; graph attention mechanism; Siamese network; visual tracking; ONLINE OBJECT TRACKING;
D O I
10.1109/TNNLS.2024.3442290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Siamese-network-based trackers convert the general object tracking as a similarity matching task between a template and a search region. Using convolutional feature cross correlation (Xcorr) for similarity matching, a large number of Siamese trackers are proposed and achieved great success. However, due to the predefined size of the target feature, these trackers suffer from either retaining much background information or losing important foreground information. Moreover, the global matching between the target and search region also largely neglects the part-level structural information and the contextual information of the target. To tackle the aforementioned obstacles, in this article, we propose a simple context-aware Siamese graph attention network, which establishes part-to-part correspondence between the Siamese branches with a complete bipartite graph. The object information from the template is propagated to the search region via a graph attention mechanism. With such a design, a target-aware template input is enabled to replace the prefixed template region, which can adaptively fit the size and aspect ratio variations in different objects. Based on it, we further construct a context-aware feature matching mechanism to embed both the target and the contextual information in the search region. Experiments on challenging benchmarks including GOT-10k, TrackingNet, LaSOT, VOT2020, and OTB-100 demonstrate that the proposed SiamGAT* outperforms many state-of-the-art trackers and achieves leading performance. Code is available at: https://git.io/SiamGAT.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Context-Aware Graph Label Propagation Network for Saliency Detection
    Ji, Wei
    Li, Xi
    Wei, Lina
    Wu, Fei
    Zhuang, Yueting
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8177 - 8186
  • [32] A Dynamic and Static Context-Aware Attention Network for Trajectory Prediction
    Yu, Jian
    Zhou, Meng
    Wang, Xin
    Pu, Guoliang
    Cheng, Chengqi
    Chen, Bo
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (05)
  • [33] Context-Aware Attention Network for Image-Text Retrieval
    Zhang, Qi
    Lei, Zhen
    Zhang, Zhaoxiang
    Li, Stan Z.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3533 - 3542
  • [34] Context-aware Attention Network for Predicting Image Aesthetic Subjectivity
    Xu, Munan
    Zhong, Jia-Xing
    Ren, Yurui
    Liu, Shan
    Li, Ge
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 798 - 806
  • [35] Context-Aware Attention Network for Human Emotion Recognition in Video
    Liu, Xiaodong
    Wang, Miao
    ADVANCES IN MULTIMEDIA, 2020, 2020
  • [36] Stacked Multimodal Attention Network for Context-Aware Video Captioning
    Zheng, Yi
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 31 - 42
  • [37] Relevant Visual Semantic Context-Aware Attention-Based Dialog
    Hong, Eugene Tan Boon
    Chong, Yung-Wey
    Wan, Tat-Chee
    Yau, Kok-Lim Alvin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2337 - 2354
  • [38] Context-Aware and Occlusion Handling Mechanism for Online Visual Object Tracking
    Mehmood, Khizer
    Jalil, Abdul
    Ali, Ahmad
    Khan, Baber
    Murad, Maria
    Khan, Wasim Ullah
    He, Yigang
    ELECTRONICS, 2021, 10 (01) : 1 - 16
  • [39] Lightweight visual backbone network with enhanced comprehensive strength through context-aware dual attention mechanism
    Xue, Jianxin
    Hu, Yaohua
    Hua, Sicheng
    Chen, Minyu
    Wu, Ling-, I
    Chang, Xi
    Li, Guoqiang
    NEUROCOMPUTING, 2025, 624
  • [40] Context-Aware Correlation Filter for Visual Tracking with Deep Convolution Features
    Zhang, Leyi
    Wu, Huicong
    Song, Jie
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2018, : 1 - 7