Siamese Network Algorithm Based on Multi-Scale Channel Attention Fusion and Multi-Scale Depth-Wise Cross Correlation

被引：2

作者：

Chen, Qingjun ^{[1
]}

Zheng, Hua ^{[1
,2
,3
,4
]}

Pan, Hao ^{[1
]}

Liao, Xiaoqi ^{[1
]}

Wang, Hongkai ^{[1
]}

机构：

[1] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350108, Peoples R China

[2] Fujian Normal Univ, Key Lab Optoelect Sci & Technol Med, Minist Educ, Fuzhou 350108, Peoples R China

[3] Fujian Normal Univ, Fujian Prov Key Lab Photon Technol, Fuzhou 350108, Peoples R China

[4] Fujian Prov Engn Res Ctr Optoelect Sensors & Inte, Fuzhou 350108, Peoples R China

来源：

FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022 | 2022年 / 12705卷

关键词：

Siamese network; visual object tracking; anchor-free regression strategy; multi-scale channel attention fusion; multi-scale depth-wise cross correlation; TRACKING;

D O I：

10.1117/12.2680160

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The research takes the feature extraction network and depth-wise cross correlation learning method of the Siamese network as the starting point. Firstly, the regression strategy of the proposed framework is anchor-free, and the residual network ResNet50 is chosen as the backbone network, and add the channel attention mechanism SENet. The SE-MSCAM multi-scale channel attention model is proposed to make up for the lack of local feature extraction ability of the feature extraction network on the basis of SENet. On this basis, the attention fusion module AFFN is added to enhance the soft selection of attention. Combined with the SE-MSCAM multi-scale attention model and the attention fusion module AFFN, the ResNet50-AFFN multi-scale channel attention fusion network is proposed. Secondly, regarding the limitation of single-scale learning of SiamRPN++ depth-wise cross correlation, the MS-DWXCorr multi-scale depth-wise cross correlation is proposed which increases the diversity of learning feature scales to improve the efficiency of tracking network similarity learning. The experimental results show that, on the VOT2018 benchmark, the EAO of our method outperforms 4.0% of the mainstream algorithm SiamCAR, the tracking accuracy is improved by 3.4% and the tracking speed of our method maintains 40 FPS; the tracking success rate is improved by 2.0% and the tracking accuracy rate is improved by 3.2% compared to the mainstream algorithm SiamCAR. It has higher accuracy and robustness in dealing with occlusion, deformation, illumination variation, deformation, and other scenarios of visual tracking, and has better tracking performance.

引用

页数：12

共 50 条

[41] Multi-scale attention network for image inpainting
Qin, Jia
Bai, Huihui
Zhao, Yao
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
[42] QRS multi-scale fusion detection algorithm
Sun, Tao
Zhang, Hong-Jian
Zhou, Li
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2002, 36 (01): : 26 - 28
[43] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
Zhou, Li
Liu, Zhenyu
Shangguan, Zixuan
Yuan, Xiaoyan
Li, Yutong
Hu, Bin
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023, 2023-August : 3417 - 3421
[44] Multi-scale feature fusion network with local attention for lung segmentation
Xie, Yinghua
Zhou, Yuntong
Wang, Chen
Ma, Yanshan
Yang, Ming
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 119
[45] Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network
Liu, Zhenguo
Li, Zhao
Ao, Wengang
Zhang, Shaoshuang
Liu, Wenlong
He, Yizhi
ELECTRONICS, 2023, 12 (07)
[46] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
Zhou, Li
Liu, Zhenyu
Shangguan, Zixuan
Yuan, Xiaoyan
Li, Yutong
Hu, Bin
INTERSPEECH 2023, 2023, : 3417 - 3421
[47] MAFormer: A transformer network with multi-scale attention fusion for visual recognition
Sun, Huixin
Wang, Yunhao
Wang, Xiaodi
Zhang, Bin
Xin, Ying
Zhang, Baochang
Cao, Xianbin
Ding, Errui
Han, Shumin
NEUROCOMPUTING, 2024, 595
[48] Pyramid attention object detection network with multi-scale feature fusion
Chen, Xiu
Li, Yujie
Nakatoh, Yoshihisa
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
[49] Heart Sound Classification Based on Multi-Scale Feature Fusion and Channel Attention Module
Li, Mingzhe
He, Zhaoming
Wang, Hao
BIOENGINEERING-BASEL, 2025, 12 (03):
[50] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
Hu, Bin
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (04) : 608 - 615

← 1 2 3 4 5 →