A simplified adversarial architecture for cross-subject silent speech recognition using electromyography

被引:0
|
作者
Cui, Qiang [1 ,2 ,3 ]
Zhang, Xingyu [1 ,2 ,3 ]
Zhang, Yakun [1 ,2 ,3 ]
Zheng, Changyan [1 ,4 ]
Xie, Liang [1 ,2 ,3 ]
Yan, Ye [1 ,2 ,3 ]
Wu, Edmond Q. [5 ,6 ,7 ]
Yin, Erwei [1 ,2 ,3 ]
机构
[1] Acad Mil Sci AMS, Def Innovat Inst, Beijing 100071, Peoples R China
[2] Intelligent Game & Decis Lab, Beijing 100071, Peoples R China
[3] Tianjin Artificial Intelligence Innovat Ctr TAI, Tianjin 300450, Peoples R China
[4] High Tech Inst, Weifang 261000, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[6] Shanghai Jiao Tong Univ, Key Lab Syst Control & Informat Proc, Minist Educ China, Shanghai 200240, Peoples R China
[7] Shanghai Jiao Tong Univ, Shanghai Engn Res Ctr Intelligent Control & Manage, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
electromyography; silent speech recognition; cross-subject; nuclear-norm wasserstein discrepancy; domain adversarial learning; COMMUNICATION; ADAPTATION;
D O I
10.1088/1741-2552/ad7321
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. The decline in the performance of electromyography (EMG)-based silent speech recognition is widely attributed to disparities in speech patterns, articulation habits, and individual physiology among speakers. Feature alignment by learning a discriminative network that resolves domain offsets across speakers is an effective method to address this problem. The prevailing adversarial network with a branching discriminator specializing in domain discrimination renders insufficiently direct contribution to categorical predictions of the classifier. Approach. To this end, we propose a simplified discrepancy-based adversarial network with a streamlined end-to-end structure for EMG-based cross-subject silent speech recognition. Highly aligned features across subjects are obtained by introducing a Nuclear-norm Wasserstein discrepancy metric on the back end of the classification network, which could be utilized for both classification and domain discrimination. Given the low-level and implicitly noisy nature of myoelectric signals, we devise a cascaded adaptive rectification network as the front-end feature extraction network, adaptively reshaping the intermediate feature map with automatically learnable channel-wise thresholds. The resulting features effectively filter out domain-specific information between subjects while retaining domain-invariant features critical for cross-subject recognition. Main results. A series of sentence-level classification experiments with 100 Chinese sentences demonstrate the efficacy of our method, achieving an average accuracy of 89.46% tested on 40 new subjects by training with data from 60 subjects. Especially, our method achieves a remarkable 10.07% improvement compared to the state-of-the-art model when tested on 10 new subjects with 20 subjects employed for training, surpassing its result even with three times training subjects. Significance. Our study demonstrates an improved classification performance of the proposed adversarial architecture using cross-subject myoelectric signals, providing a promising prospect for EMG-based speech interactive application.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] EMG-Based Cross-Subject Silent Speech Recognition Using Conditional Domain Adversarial Network
    Zhang, Yakun
    Cai, Huihui
    Wu, Jinghan
    Xie, Liang
    Xu, Minpeng
    Ming, Dong
    Yan, Ye
    Yin, Erwei
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 2282 - 2290
  • [2] Improving Cross-Subject Activity Recognition via Adversarial Learning
    Leite, Clayton Frederick Souza
    Xiao, Yu
    IEEE ACCESS, 2020, 8 : 90542 - 90554
  • [3] Cross-Subject Continuous Emotion Recognition using Speech and Body Motion in Dyadic Interactions
    Fatima, Syeda Narjis
    Erzin, Engin
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1731 - 1735
  • [4] Cross-subject transfer learning in human activity recognition systems using generative adversarial networks
    Soleimani, Elnaz
    Nazerfard, Ehsan
    NEUROCOMPUTING, 2021, 426 : 26 - 34
  • [5] Cross-subject mental workload recognition using bi-classifier domain adversarial learning
    Zhou, Yueying
    Wang, Pengpai
    Gong, Peiliang
    Wan, Peng
    Wen, Xuyun
    Zhang, Daoqiang
    COGNITIVE NEURODYNAMICS, 2025, 19 (01)
  • [6] Cross-Subject EEG-Based Emotion Recognition Using Deep Metric Learning and Adversarial Training
    Alameer, Hawraa Razzaq Abed
    Salehpour, Pedram
    Hadi Aghdasi, Seyyed
    Feizi-Derakhshi, Mohammad-Reza
    IEEE ACCESS, 2024, 12 : 130241 - 130252
  • [7] Cross-subject EEG-based Emotion Recognition Using Adversarial Domain Adaption with Attention Mechanism
    Ye, Yalan
    Zhu, Xin
    Li, Yunxia
    Pan, Tongjie
    He, Wenwen
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1140 - 1144
  • [8] A Novel Active Learning Framework for Cross-Subject Human Activity Recognition from Surface Electromyography
    Ding, Zhen
    Hu, Tao
    Li, Yanlong
    Li, Longfei
    Li, Qi
    Jin, Pengyu
    Yi, Chunzhi
    SENSORS, 2024, 24 (18)
  • [9] Silent Speech Recognition Based on Surface Electromyography
    Ma, Siyuan
    Jin, Dantong
    Zhang, Ming
    Zhang, Bixuan
    Wang, You
    Li, Guang
    Yang, Meng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4497 - 4501
  • [10] Cross-Subject Emotion Recognition Using Deep Adaptation Networks
    Li, He
    Jin, Yi-Ming
    Zheng, Wei-Long
    Lu, Bao-Liang
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 403 - 413