Multi-View Fusion Network-Based Gesture Recognition Using sEMG Data

被引:14
|
作者
Li, Gongfa [1 ]
Zou, Cejing [1 ]
Jiang, Guozhang [2 ]
Jiang, Du [3 ]
Yun, Juntong [2 ]
Zhao, Guojun [4 ]
Cheng, Yangwei [5 ]
机构
[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan 430081, Peoples R China
[2] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan 430081, Peoples R China
[3] Wuhan Univ Sci & Technol, Res Ctr Biomimet Robot & Intelligent Measurement &, Wuhan 430081, Peoples R China
[4] Hubei Longzhong Lab, Xiangyang 441000, Peoples R China
[5] Wuhan Univ Sci & Technol, Precis Mfg Res Inst, Wuhan 430081, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transfer learning; Data mining; Deep learning; Electromyography; Convolution; Neural networks; Electromyographic feature pictures; multi-view fusion network; multi-view learning; sparse sEMG; SwT;
D O I
10.1109/JBHI.2023.3287979
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
sEMG(surface electromyography) signals have been widely used in rehabilitation medicine in the past decades because of their non-invasive, convenient and informative features, especially in human action recognition, which has developed rapidly. However, the research on sparse EMG in multi-view fusion has made less progress compared to high-density EMG signals, and for the problem of how to enrich sparse EMG feature information, a method that can effectively reduce the information loss of feature signals in the channel dimension is needed. In this article, a novel IMSE (Inception-MaxPooling-Squeeze- Excitation) network module is proposed to reduce the loss of feature information during deep learning. Then, multiple feature encoders are constructed to enrich the information of sparse sEMG feature maps based on the multi-core parallel processing method in multi-view fusion networks, while SwT (Swin Transformer) is used as the classification backbone network. By comparing the feature fusion effects of different decision layers of the multi-view fusion network, it is experimentally obtained that the fusion of decision layers can better improve the classification performance of the network. In NinaPro DB1, the proposed network achieves 93.96% average accuracy in gesture action classification with the feature maps obtained in 300ms time window, and the maximum variation range of action recognition rate of individuals is less than 11.2%. The results show that the proposed framework of multi-view learning plays a good role in reducing individuality differences and augmenting channel feature information, which provides a certain reference for non-dense biosignal pattern recognition.
引用
收藏
页码:4432 / 4443
页数:12
相关论文
共 50 条
  • [1] Multi-stream fusion network for continuous gesture recognition based on sEMG
    Li J.
    Zou C.
    Tang D.
    Sun Y.
    Fan H.
    Li B.
    Tang X.
    International Journal of Wireless and Mobile Computing, 2024, 26 (04): : 374 - 383
  • [2] sEMG-Based Gesture Recognition via Multi-Feature Fusion Network
    Chen, Zekun
    Qiao, Xiupeng
    Liang, Shili
    Yan, Tao
    Chen, Zhongye
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (04) : 2570 - 2580
  • [3] Neural network-based head pose estimation and multi-view fusion
    Voit, Michael
    Nickel, Kai
    Stiefelhagen, Rainer
    MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2007, 4122 : 291 - 298
  • [4] Gender recognition based on gait using multi-view fusion
    School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing
    100044, China
    Open. Cybern. Syst. J., 1 (512-518):
  • [5] Multi-View Fusion Network for Crop Disease Recognition
    Xie, Lihong
    Han, Ruiling
    Xie, Songhong
    Chen, Dongjing
    Chen, Yaxuan
    5TH INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS, ICACS 2021, 2021, : 121 - 126
  • [6] Neural network-based ensemble approach for multi-view facial expression recognition
    Altaf, Muhammad Faheem
    Iqbal, Muhammad Waseem
    Ali, Ghulam
    Shinan, Khlood
    Alhazmi, Hanan E.
    Alanazi, Fatmah
    Ashraf, M. Usman
    PLOS ONE, 2025, 20 (03):
  • [7] Sign language recognition and translation network based on multi-view data
    Ronghui Li
    Lu Meng
    Applied Intelligence, 2022, 52 : 14624 - 14638
  • [8] Sign language recognition and translation network based on multi-view data
    Li, Ronghui
    Meng, Lu
    APPLIED INTELLIGENCE, 2022, 52 (13) : 14624 - 14638
  • [9] Gesture Recognition Based on Kinect and sEMG Signal Fusion
    Ying Sun
    Cuiqiao Li
    Gongfa Li
    Guozhang Jiang
    Du Jiang
    Honghai Liu
    Zhigao Zheng
    Wanneng Shu
    Mobile Networks and Applications, 2018, 23 : 797 - 805
  • [10] MVFNet: Multi-View Fusion Network for Efficient Video Recognition
    Wu, Wenhao
    He, Dongliang
    Lin, Tianwei
    Li, Fu
    Gan, Chuang
    Ding, Errui
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2943 - 2951