Siamese Tracking Network with Multi-attention Mechanism

被引:0
|
作者
Xu, Yuzhuo [1 ]
Li, Ting [1 ]
Zhu, Bing [2 ]
Wang, Fasheng [1 ]
Sun, Fuming [1 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Liaohexi Rd, Dalian 116600, Liaoning, Peoples R China
[2] Harbin Inst Technol, Dept Informat Engn, Xidazhi St, Harbin 150006, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Object tracking; Feature representation; Multi-scale feature fusion; Transformer; Multi-attention mechanism; VISUAL TRACKING;
D O I
10.1007/s11063-024-11670-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object trackers based on Siamese networks view tracking as a similarity-matching process. However, the correlation operation operates as a local linear matching process, limiting the tracker's ability to capture the intricate nonlinear relationship between the template and search region branches. Moreover, most trackers don't update the template and often use the first frame of an image as the initial template, which will easily lead to poor tracking performance of the algorithm when facing instances of deformation, scale variation, and occlusion of the tracking target. To this end, we propose a Simases tracking network with a multi-attention mechanism, including a template branch and a search branch. To adapt to changes in target appearance, we integrate dynamic templates and multi-attention mechanisms in the template branch to obtain more effective feature representation by fusing the features of initial templates and dynamic templates. To enhance the robustness of the tracking model, we utilize a multi-attention mechanism in the search branch that shares weights with the template branch to obtain multi-scale feature representation by fusing search region features at different scales. In addition, we design a lightweight and simple feature fusion mechanism, in which the Transformer encoder structure is utilized to fuse the information of the template area and search area, and the dynamic template is updated online based on confidence. Experimental results on publicly tracking datasets show that the proposed method achieves competitive results compared to several state-of-the-art trackers.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Learning Multi-Layer Attention Aggregation Siamese Network for Robust RGBT Tracking
    Feng, Mingzheng
    Su, Jianbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3378 - 3391
  • [32] Visual Object Tracking by Hierarchical Attention Siamese Network
    Shen, Jianbing
    Tang, Xin
    Dong, Xingping
    Shao, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3068 - 3080
  • [33] SRMANet: Toward an Interpretable Neural Network with Multi-Attention Mechanism for Gearbox Fault Diagnosis
    Liu, Siyuan
    Huang, Jinying
    Ma, Jiancheng
    Luo, Jia
    APPLIED SCIENCES-BASEL, 2022, 12 (16):
  • [34] Stereo Matching Algorithm Based on Multi-Attention Mechanism
    Chen Qibo
    Ge Baozhen
    Li Yunpeng
    Quan Jianing
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)
  • [35] Bearing Fault Diagnosis Using Convolutional Neural Network Based on a Multi-Attention Mechanism
    Kang T.
    Duan R.
    Yang L.
    Xue J.
    Liao Y.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2022, 56 (12): : 68 - 77
  • [36] Multi-attention embedded network for salient object detection
    Wei He
    Chen Pan
    Wenlong Xu
    Ning Zhang
    Soft Computing, 2021, 25 : 13053 - 13067
  • [37] Multi-Attention Generative Adversarial Network for image captioning
    Wei, Yiwei
    Wang, Leiquan
    Cao, Haiwen
    Shao, Mingwen
    Wu, Chunlei
    NEUROCOMPUTING, 2020, 387 : 91 - 99
  • [38] Multi-Attention Network for Sewage Treatment Plant Detection
    Shuai, Yue
    Xie, Jun
    Lu, Kaixuan
    Chen, Zhengchao
    SUSTAINABILITY, 2023, 15 (07)
  • [39] Multi-Attention Network for Unsupervised Video Object Segmentation
    Zhang, Guifang
    Wong, Hon-Cheng
    Lo, Sio-Long
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 71 - 75
  • [40] Adaptive Multi-Feature Fusion Visual Target Tracking Based on Siamese Neural Network with Cross-Attention Mechanism
    Zhou, Qian
    Xia, Haoran
    Yan, Hongzheng
    Yang, Ming
    Chen, Shidong
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 307 - 316