Graph attention information fusion for Siamese adaptive attention tracking

被引:4
|
作者
Wei, Lixin [1 ,2 ]
Xi, Zeyu [1 ,2 ]
Hu, Ziyu [1 ,2 ]
Sun, Hao [1 ,2 ]
机构
[1] Yanshan Univ, Engn Res Ctr, Minist Educ Intelligent Control Syst & Intelligen, Qinhuangdao, Hebei, Peoples R China
[2] Yanshan Univ, Key Lab Ind Comp Control Engn Hebei Prov, Qinhuangdao, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Siamese adaptive attention; Graph attention information fusion; Template update; Layerwise aggregation;
D O I
10.1007/s10489-022-03502-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A single target tracker based on a Siamese network regards tracking as a process of similarity matching. The convolution features of the template branch and search area branch realize similarity matching and information fusion by a correlation operation. However, the correlation operation is a local linear matching, which limits the tracker to capturing the complex nonlinear relationship between the template branch and search area branch. In addition, it is easy to lose useful information. Moreover, most trackers do not update the template. The template branch and the search area branch compute convolution features independently without information exchange. To solve these existing problems, a graph attention information fusion for Siamese adaptive attention tracking network (GIFT) is proposed. The information flow between the template branch and search area branch is connected by designing a Siamese adaptive attention module (SAA), and the template information is updated indirectly. The graph attention information fusion module (GAIF) is proposed to effectively fuse the information of the template branch and search area branch and realize the similarity matching of their corresponding parts. Layerwise aggregation makes full use of the shallow and deep features of neural networks. This further improves tracking performance. Experiments on 6 challenging benchmarks, including GOT-10k, OTB100, VOT2018, VOT2019, UAV123 and LaSOT, demonstrate that GIFT has the leading performance and runs at 28.34 FPS, which surpasses the real-time level of 25 FPS.
引用
收藏
页码:2068 / 2087
页数:20
相关论文
共 50 条
  • [1] Graph attention information fusion for Siamese adaptive attention tracking
    Lixin Wei
    Zeyu Xi
    Ziyu Hu
    Hao Sun
    Applied Intelligence, 2023, 53 : 2068 - 2087
  • [2] SiamSGA: Siamese Symmetric Graph Attention Tracking
    Sun, Pengzhan
    Gao, Xiaoguang
    Zhang, Bojie
    Wang, Yangyang
    2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 326 - 333
  • [3] Siamese Attention Networks with Adaptive Templates for Visual Tracking
    Zhang, Bo
    Liang, Zhixue
    Dong, Wenyong
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [4] Object semantic-guided graph attention feature fusion network for Siamese visual tracking
    Zhang, Jianwei
    Miao, Mengen
    Zhang, Huanlong
    Wang, Jingchao
    Zhao, Yanchun
    Chen, Zhiwu
    Qiao, Jianwei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [5] Siamese Graph Attention Networks for robust visual object tracking
    Lu, Junjie
    Li, Shengyang
    Guo, Weilong
    Zhao, Manqi
    Yang, Jian
    Liu, Yunfei
    Zhou, Zhuang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
  • [6] Object tracking based on siamese network with 3D attention and multiple graph attention
    Yan, Shilei
    Qi, Yujuan
    Liu, Mengxue
    Wang, Yanjiang
    Liu, Baodi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
  • [7] Siamese tracking combing frequency channel attention with adaptive template
    Pang, Haibo
    Xie, Meiqin
    Liu, Chengming
    Ma, Rongqi
    Han, Linxuan
    IET COMMUNICATIONS, 2021, 15 (20) : 2493 - 2502
  • [8] SGAT: Shuffle and graph attention based Siamese networks for visual tracking
    Wang, Jun
    Zhang, Limin
    Zhang, Wenshuang
    Wang, Yuanyun
    Deng, Chengzhi
    PLOS ONE, 2022, 17 (11):
  • [9] Siamese Progressive Attention-Guided Fusion Network for Object Tracking
    Fan Y.
    Song X.
    Song, Xiaoning (x.song@jiangnan.edu.cn), 1600, Institute of Computing Technology (33): : 199 - 206
  • [10] Paralleled attention modules and adaptive focal loss for Siamese visual tracking
    Zhao, Yuyao
    Jiang, Min
    Kong, Jun
    Li, Sha
    IET IMAGE PROCESSING, 2021, 15 (06) : 1345 - 1358