Multi-View Fusion Network-Based Gesture Recognition Using sEMG Data

被引：14

作者：

Li, Gongfa ^{[1
]}

Zou, Cejing ^{[1
]}

Jiang, Guozhang ^{[2
]}

Jiang, Du ^{[3
]}

Yun, Juntong ^{[2
]}

Zhao, Guojun ^{[4
]}

Cheng, Yangwei ^{[5
]}

机构：

[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan 430081, Peoples R China

[2] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan 430081, Peoples R China

[3] Wuhan Univ Sci & Technol, Res Ctr Biomimet Robot & Intelligent Measurement &, Wuhan 430081, Peoples R China

[4] Hubei Longzhong Lab, Xiangyang 441000, Peoples R China

[5] Wuhan Univ Sci & Technol, Precis Mfg Res Inst, Wuhan 430081, Peoples R China

来源：

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS | 2024年 / 28卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Transfer learning; Data mining; Deep learning; Electromyography; Convolution; Neural networks; Electromyographic feature pictures; multi-view fusion network; multi-view learning; sparse sEMG; SwT;

D O I：

10.1109/JBHI.2023.3287979

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

sEMG(surface electromyography) signals have been widely used in rehabilitation medicine in the past decades because of their non-invasive, convenient and informative features, especially in human action recognition, which has developed rapidly. However, the research on sparse EMG in multi-view fusion has made less progress compared to high-density EMG signals, and for the problem of how to enrich sparse EMG feature information, a method that can effectively reduce the information loss of feature signals in the channel dimension is needed. In this article, a novel IMSE (Inception-MaxPooling-Squeeze- Excitation) network module is proposed to reduce the loss of feature information during deep learning. Then, multiple feature encoders are constructed to enrich the information of sparse sEMG feature maps based on the multi-core parallel processing method in multi-view fusion networks, while SwT (Swin Transformer) is used as the classification backbone network. By comparing the feature fusion effects of different decision layers of the multi-view fusion network, it is experimentally obtained that the fusion of decision layers can better improve the classification performance of the network. In NinaPro DB1, the proposed network achieves 93.96% average accuracy in gesture action classification with the feature maps obtained in 300ms time window, and the maximum variation range of action recognition rate of individuals is less than 11.2%. The results show that the proposed framework of multi-view learning plays a good role in reducing individuality differences and augmenting channel feature information, which provides a certain reference for non-dense biosignal pattern recognition.

引用

页码：4432 / 4443

页数：12

共 50 条

[1] Multi-stream fusion network for continuous gesture recognition based on sEMG
Li J.
Zou C.
Tang D.
Sun Y.
Fan H.
Li B.
Tang X.
International Journal of Wireless and Mobile Computing, 2024, 26 (04): : 374 - 383
[2] sEMG-Based Gesture Recognition via Multi-Feature Fusion Network
Chen, Zekun
Qiao, Xiupeng
Liang, Shili
Yan, Tao
Chen, Zhongye
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (04) : 2570 - 2580
[3] Neural network-based head pose estimation and multi-view fusion
Voit, Michael
Nickel, Kai
Stiefelhagen, Rainer
MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2007, 4122 : 291 - 298
[4] Gender recognition based on gait using multi-view fusion
School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing
100044, China
Open. Cybern. Syst. J., 1 (512-518):
[5] Multi-View Fusion Network for Crop Disease Recognition
Xie, Lihong
Han, Ruiling
Xie, Songhong
Chen, Dongjing
Chen, Yaxuan
5TH INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS, ICACS 2021, 2021, : 121 - 126
[6] Neural network-based ensemble approach for multi-view facial expression recognition
Altaf, Muhammad Faheem
Iqbal, Muhammad Waseem
Ali, Ghulam
Shinan, Khlood
Alhazmi, Hanan E.
Alanazi, Fatmah
Ashraf, M. Usman
PLOS ONE, 2025, 20 (03):
[7] Sign language recognition and translation network based on multi-view data
Ronghui Li
Lu Meng
Applied Intelligence, 2022, 52 : 14624 - 14638
[8] Sign language recognition and translation network based on multi-view data
Li, Ronghui
Meng, Lu
APPLIED INTELLIGENCE, 2022, 52 (13) : 14624 - 14638
[9] Gesture Recognition Based on Kinect and sEMG Signal Fusion
Ying Sun
Cuiqiao Li
Gongfa Li
Guozhang Jiang
Du Jiang
Honghai Liu
Zhigao Zheng
Wanneng Shu
Mobile Networks and Applications, 2018, 23 : 797 - 805
[10] MVFNet: Multi-View Fusion Network for Efficient Video Recognition
Wu, Wenhao
He, Dongliang
Lin, Tianwei
Li, Fu
Gan, Chuang
Ding, Errui
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2943 - 2951

← 1 2 3 4 5 →