Evota: an enhanced visual object tracking network with attention mechanism

被引:0
|
作者
An Zhao
Yi Zhang
机构
[1] Sichuan University,Department of Computer Science
来源
关键词
Attention mechanism; Visual tracking; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
Transformer architecture has made breakthrough in various downstream computer vision tasks and has shown its great potential in visual object tracking. However, existing transformer-based approaches adopt pixel-to-pixel attention strategy to integrate the domain knowledge, but fail to explore the channel and location information from object features, which limits the expressivity of the tracker. To address the above problems, we propose a novel tracking framework, where we propose 2 attention blocks that fuses with Transformer (dubbed EVOTA). It has 4 modules: the feature extraction module, the enhanced attention module, a transformer module and a model predictor. Specifically, a channel-wise attention module re-calibrates the channel-wise feature responses in an adaptive way by modelling interdependencies explicitly between channels. A local cross-channel interaction scheme learns strong channel context information. Meanwhile, an energy function is developed to analyze the importance of each neuron and infers their 3D weights. Extensive experiments have been carried out on 5 prevalent tracking benchmarks to testify the effectiveness of our model, in which EVOTA outperforms several state-of-the-art methods.
引用
收藏
页码:24939 / 24960
页数:21
相关论文
共 50 条
  • [1] Evota: an enhanced visual object tracking network with attention mechanism
    Zhao, An
    Zhang, Yi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 24939 - 24960
  • [2] Visual Object Tracking by Hierarchical Attention Siamese Network
    Shen, Jianbing
    Tang, Xin
    Dong, Xingping
    Shao, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3068 - 3080
  • [3] Object-Aware Adaptive Convolution Kernel Attention Mechanism in Siamese Network for Visual Tracking
    Yuan, Dongliang
    Li, Qingdang
    Yang, Xiaohui
    Zhang, Mingyue
    Sun, Zhen
    APPLIED SCIENCES-BASEL, 2022, 12 (02):
  • [4] MASNet: mixed attention Siamese network for visual object tracking
    Zhang, Jianwei
    Zhang, Zhichen
    Zhang, Huanlong
    Wang, Jingchao
    Wang, He
    Zheng, Menya
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)
  • [5] SiamDTA: Dual-Template Siamese Network Visual Object Tracking Algorithm Based on Attention Mechanism
    Wan, Zhen
    Ma, Sugang
    Zhang, Zixian
    Sun, Siwei
    2024 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS, NANA 2024, 2024, : 418 - 423
  • [6] A novel Siamese Attention Network for visual object tracking of autonomous vehicles
    Chen, Jia
    Ai, Yibo
    Qian, Yuhan
    Zhang, Weidong
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2021, 235 (10-11) : 2764 - 2775
  • [7] A robust attention-enhanced network with transformer for visual tracking
    Fengwei Gu
    Jun Lu
    Chengtao Cai
    Multimedia Tools and Applications, 2023, 82 : 40761 - 40782
  • [8] A robust attention-enhanced network with transformer for visual tracking
    Gu, Fengwei
    Lu, Jun
    Cai, Chengtao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40761 - 40782
  • [9] Object Tracking Based on Visual Attention
    Lin, Mingqiang
    Dai, Houde
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1846 - 1849
  • [10] Small object detection based on attention mechanism and enhanced network
    Wang, Bingbing
    Zhang, Fengxiang
    Li, Kaipeng
    Shi, Kuijie
    Wang, Lei
    Liu, Gang
    INTELLIGENT DATA ANALYSIS, 2023, 27 (06) : 1725 - 1739