A simplified adversarial architecture for cross-subject silent speech recognition using electromyography

被引:0
|
作者
Cui, Qiang [1 ,2 ,3 ]
Zhang, Xingyu [1 ,2 ,3 ]
Zhang, Yakun [1 ,2 ,3 ]
Zheng, Changyan [1 ,4 ]
Xie, Liang [1 ,2 ,3 ]
Yan, Ye [1 ,2 ,3 ]
Wu, Edmond Q. [5 ,6 ,7 ]
Yin, Erwei [1 ,2 ,3 ]
机构
[1] Acad Mil Sci AMS, Def Innovat Inst, Beijing 100071, Peoples R China
[2] Intelligent Game & Decis Lab, Beijing 100071, Peoples R China
[3] Tianjin Artificial Intelligence Innovat Ctr TAI, Tianjin 300450, Peoples R China
[4] High Tech Inst, Weifang 261000, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[6] Shanghai Jiao Tong Univ, Key Lab Syst Control & Informat Proc, Minist Educ China, Shanghai 200240, Peoples R China
[7] Shanghai Jiao Tong Univ, Shanghai Engn Res Ctr Intelligent Control & Manage, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
electromyography; silent speech recognition; cross-subject; nuclear-norm wasserstein discrepancy; domain adversarial learning; COMMUNICATION; ADAPTATION;
D O I
10.1088/1741-2552/ad7321
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. The decline in the performance of electromyography (EMG)-based silent speech recognition is widely attributed to disparities in speech patterns, articulation habits, and individual physiology among speakers. Feature alignment by learning a discriminative network that resolves domain offsets across speakers is an effective method to address this problem. The prevailing adversarial network with a branching discriminator specializing in domain discrimination renders insufficiently direct contribution to categorical predictions of the classifier. Approach. To this end, we propose a simplified discrepancy-based adversarial network with a streamlined end-to-end structure for EMG-based cross-subject silent speech recognition. Highly aligned features across subjects are obtained by introducing a Nuclear-norm Wasserstein discrepancy metric on the back end of the classification network, which could be utilized for both classification and domain discrimination. Given the low-level and implicitly noisy nature of myoelectric signals, we devise a cascaded adaptive rectification network as the front-end feature extraction network, adaptively reshaping the intermediate feature map with automatically learnable channel-wise thresholds. The resulting features effectively filter out domain-specific information between subjects while retaining domain-invariant features critical for cross-subject recognition. Main results. A series of sentence-level classification experiments with 100 Chinese sentences demonstrate the efficacy of our method, achieving an average accuracy of 89.46% tested on 40 new subjects by training with data from 60 subjects. Especially, our method achieves a remarkable 10.07% improvement compared to the state-of-the-art model when tested on 10 new subjects with 20 subjects employed for training, surpassing its result even with three times training subjects. Significance. Our study demonstrates an improved classification performance of the proposed adversarial architecture using cross-subject myoelectric signals, providing a promising prospect for EMG-based speech interactive application.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Cross-subject aesthetic preference recognition of Chinese dance posture using EEG
    Li, Jing
    Wu, Shen-rui
    Zhang, Xiang
    Luo, Tian-jian
    Li, Rui
    Zhao, Ying
    Liu, Bo
    Peng, Hua
    COGNITIVE NEURODYNAMICS, 2023, 17 (02) : 311 - 329
  • [22] Cross-subject aesthetic preference recognition of Chinese dance posture using EEG
    Jing Li
    Shen-rui Wu
    Xiang Zhang
    Tian-jian Luo
    Rui Li
    Ying Zhao
    Bo Liu
    Hua Peng
    Cognitive Neurodynamics, 2023, 17 : 311 - 329
  • [23] Cross-Subject emotion recognition from EEG using Convolutional Neural Networks
    Zhong, Xiaolong
    Yin, Zhong
    Zhang, Jianhua
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7516 - 7521
  • [24] A Facial Electromyography Activity Detection Method in Silent Speech Recognition
    Cai, Huihui
    Zhang, Yakun
    Xie, Liang
    Yan, Huijiong
    Qin, Wei
    Yan, Ye
    Yin, Erwei
    Xu, Minpeng
    Ming, Dong
    2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2021, : 246 - 251
  • [25] Domain Adversarial Neural Network with Reliable Pseudo-labels Iteration for cross-subject EEG emotion recognition
    Ju, Xiangyu
    Su, Jianpo
    Dai, Sheng
    Wu, Xu
    Li, Ming
    Hu, Dewen
    KNOWLEDGE-BASED SYSTEMS, 2025, 316
  • [26] Cross-Subject Multimodal Emotion Recognition Based on Hybrid Fusion
    Cimtay, Yucel
    Ekmekcioglu, Erhan
    Caglar-Ozhan, Seyma
    IEEE ACCESS, 2020, 8 : 168865 - 168878
  • [27] EEG-based Cross-subject Mental Fatigue Recognition
    Liu, Yisi
    Lan, Zirui
    Cui, Jian
    Sourina, Olga
    Muller-Wittig, Wolfgang
    2019 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2019, : 247 - 252
  • [28] Multisource Transfer Learning for Cross-Subject EEG Emotion Recognition
    Li, Jinpeng
    Qiu, Shuang
    Shen, Yuan-Yuan
    Liu, Cheng-Lin
    He, Huiguang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3281 - 3293
  • [29] An extended variational autoencoder for cross-subject electromyograph gesture recognition
    Zhang, Zhen
    Ming, Yuewei
    Shen, Quming
    Wang, Yanyu
    Zhang, Yuhui
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [30] On the Benefit of FMG and EMG Sensor Fusion for Gesture Recognition Using Cross-Subject Validation
    Rohr, Maurice
    Haidamous, Jad
    Schaefer, Niklas
    Schaumann, Stephan
    Latsch, Bastian
    Kupnik, Mario
    Antink, Christoph Hoog
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2025, 33 : 935 - 944