Evota: an enhanced visual object tracking network with attention mechanism

被引:0
|
作者
An Zhao
Yi Zhang
机构
[1] Sichuan University,Department of Computer Science
来源
关键词
Attention mechanism; Visual tracking; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
Transformer architecture has made breakthrough in various downstream computer vision tasks and has shown its great potential in visual object tracking. However, existing transformer-based approaches adopt pixel-to-pixel attention strategy to integrate the domain knowledge, but fail to explore the channel and location information from object features, which limits the expressivity of the tracker. To address the above problems, we propose a novel tracking framework, where we propose 2 attention blocks that fuses with Transformer (dubbed EVOTA). It has 4 modules: the feature extraction module, the enhanced attention module, a transformer module and a model predictor. Specifically, a channel-wise attention module re-calibrates the channel-wise feature responses in an adaptive way by modelling interdependencies explicitly between channels. A local cross-channel interaction scheme learns strong channel context information. Meanwhile, an energy function is developed to analyze the importance of each neuron and infers their 3D weights. Extensive experiments have been carried out on 5 prevalent tracking benchmarks to testify the effectiveness of our model, in which EVOTA outperforms several state-of-the-art methods.
引用
收藏
页码:24939 / 24960
页数:21
相关论文
共 50 条
  • [21] A visual attention model for robot object tracking
    Chu J.-K.
    Li R.-H.
    Li Q.-Y.
    Wang H.-Q.
    International Journal of Automation and Computing, 2010, 7 (01) : 39 - 46
  • [22] Siamese network visual tracking algorithm based on cascaded attention mechanism
    Pu L.
    Feng X.
    Hou Z.
    Yu W.
    Ma S.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2020, 46 (12): : 2302 - 2310
  • [23] Object detection and tracking based on visual attention
    Zhang, Huawei
    Zhang, Qiaorong
    ICIC Express Letters, 2012, 6 (10): : 2667 - 2671
  • [24] A Visual Attention Model for Robot Object Tracking
    JinKui Chu RongHua Li QingYing Li HongQing Wang School of Mechanical Engineering Dalian University of Technology Dalian PRC
    International Journal of Automation & Computing, 2010, 7 (01) : 39 - 46
  • [25] A Visual Attention Model for Robot Object Tracking
    Jin-Kui Chu Rong-Hua Li Qing-Ying Li Hong-Qing Wang School of Mechanical Engineering
    Machine Intelligence Research, 2010, (01) : 39 - 46
  • [26] Visual Attention Model Based Object Tracking
    Ma, Lili
    Cheng, Jian
    Liu, Jing
    Wang, Jinqiao
    Lu, Hanging
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT II, 2010, 6298 : 483 - 493
  • [27] Visual Attention Is Required for Multiple Object Tracking
    Tran, Annie
    Hoffman, James E.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2016, 42 (12) : 2103 - 2114
  • [28] Residual attention mechanism for visual tracking
    Cheng L.
    Wang Y.
    Tian C.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2020, 47 (06): : 148 - 157and163
  • [29] Object tracking based on spatial attention mechanism
    Xie, Yu
    Chen, Ying
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7595 - 7599
  • [30] MULTI-OBJECT TRACKING AS ATTENTION MECHANISM
    Fukui, Hiroshi
    Miyagawa, Taiki
    Morishita, Yusuke
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 505 - 509