Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation

被引:2
|
作者
Chen, Qingjun [1 ]
Zheng, Hua [1 ,2 ,3 ,4 ]
Pan, Hao [1 ]
Liao, Xiaoqi [1 ]
Wang, Hongkai [1 ]
机构
[1] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350108, Peoples R China
[2] Fujian Normal Univ, Key Lab Optoelect Sci & Technol Med, Minist Educ, Fuzhou 350108, Peoples R China
[3] Fujian Normal Univ, Fujian Prov Key Lab Photon Technol, Fuzhou 350108, Peoples R China
[4] Fujian Prov Engn Res Ctr Optoelect Sensors & Inte, Fuzhou 350108, Peoples R China
关键词
Siamese network; visual object tracking; anchor-free regression strategy; multi-scale channel attention fusion; multi-scale depth-wise cross correlation; TRACKING;
D O I
10.1117/12.2680160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research takes the feature extraction network and depth-wise cross correlation learning method of the Siamese network as the starting point. Firstly, the regression strategy of the proposed framework is anchor-free, and the residual network ResNet50 is chosen as the backbone network, and add the channel attention mechanism SENet. The SE-MSCAM multi-scale channel attention model is proposed to make up for the lack of local feature extraction ability of the feature extraction network on the basis of SENet. On this basis, the attention fusion module AFFN is added to enhance the soft selection of attention. Combined with the SE-MSCAM multi-scale attention model and the attention fusion module AFFN, the ResNet50-AFFN multi-scale channel attention fusion network is proposed. Secondly, regarding the limitation of single-scale learning of SiamRPN++ depth-wise cross correlation, the MS-DWXCorr multi-scale depth-wise cross correlation is proposed which increases the diversity of learning feature scales to improve the efficiency of tracking network similarity learning. The experimental results show that, on the VOT2018 benchmark, the EAO of our method outperforms 4.0% of the mainstream algorithm SiamCAR, the tracking accuracy is improved by 3.4% and the tracking speed of our method maintains 40 FPS; the tracking success rate is improved by 2.0% and the tracking accuracy rate is improved by 3.2% compared to the mainstream algorithm SiamCAR. It has higher accuracy and robustness in dealing with occlusion, deformation, illumination variation, deformation, and other scenarios of visual tracking, and has better tracking performance.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [42] QRS multi-scale fusion detection algorithm
    Sun, Tao
    Zhang, Hong-Jian
    Zhou, Li
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2002, 36 (01): : 26 - 28
  • [43] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Shangguan, Zixuan
    Yuan, Xiaoyan
    Li, Yutong
    Hu, Bin
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023, 2023-August : 3417 - 3421
  • [44] Multi-scale feature fusion network with local attention for lung segmentation
    Xie, Yinghua
    Zhou, Yuntong
    Wang, Chen
    Ma, Yanshan
    Yang, Ming
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 119
  • [45] Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network
    Liu, Zhenguo
    Li, Zhao
    Ao, Wengang
    Zhang, Shaoshuang
    Liu, Wenlong
    He, Yizhi
    ELECTRONICS, 2023, 12 (07)
  • [46] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Shangguan, Zixuan
    Yuan, Xiaoyan
    Li, Yutong
    Hu, Bin
    INTERSPEECH 2023, 2023, : 3417 - 3421
  • [47] MAFormer: A transformer network with multi-scale attention fusion for visual recognition
    Sun, Huixin
    Wang, Yunhao
    Wang, Xiaodi
    Zhang, Bin
    Xin, Ying
    Zhang, Baochang
    Cao, Xianbin
    Ding, Errui
    Han, Shumin
    NEUROCOMPUTING, 2024, 595
  • [48] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [49] Heart Sound Classification Based on Multi-Scale Feature Fusion and Channel Attention Module
    Li, Mingzhe
    He, Zhaoming
    Wang, Hao
    BIOENGINEERING-BASEL, 2025, 12 (03):
  • [50] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
    Hu, Bin
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (04) : 608 - 615