EMG-Based Cross-Subject Silent Speech Recognition Using Conditional Domain Adversarial Network

被引:7
|
作者
Zhang, Yakun [1 ,2 ]
Cai, Huihui [1 ,2 ,3 ]
Wu, Jinghan [2 ,3 ]
Xie, Liang [1 ,2 ]
Xu, Minpeng [3 ]
Ming, Dong [3 ]
Yan, Ye [1 ,2 ,3 ]
Yin, Erwei [1 ,2 ,3 ]
机构
[1] Acad Mil Sci, Natl Innovat Inst Def Technol, Beijing 100071, Peoples R China
[2] Tianjin Artificial Intelligence Innovat Ctr, Tianjin 300457, Peoples R China
[3] Tianjin Univ, Acad Med Engn & Translat Med, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network; cross-subject; electromyography (EMG); machine learning; silent speech recognition (SSR); MYOELECTRIC SIGNALS; CLASSIFICATION; INTERFACES; MODEL;
D O I
10.1109/TCDS.2023.3316701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning techniques have achieved great success in electromyography (EMG) decoding, but EMG-based cross-subject silent speech recognition (SSR) received less attention because of its high individual variability. Therefore, this article explores the field of cross-subject SSR to improve the recognition performance of EMG data collected from new subjects. First, this article reports on applying time-series features and 1-D convolutional neural networks (1D-CNNs) for cross-subject SSR. Second, this article proposes using a conditional domain adversarial network (CDAN) to solve the problem of reduced cross-subject SSR accuracy in the few samples' data sets. It innovatively integrates the maximum mean difference (MMD) loss to get an improved CDAN (ICDAN). While 1D-CNN is a feature extraction network that can meet the needs of cross-subject SSR in large data sets, the recognition effect will be weakened in small data sets. Adding an ICDAN network after the feature extraction network can improve the problem of data distribution differences between the two domains, and further enhance recognition performance. The results show that the 1D-CNN model based on time-series features yields better results in the SSR of new subjects, and the ICDAN model can further improve the classification accuracy of cross-subjects in a few sample data sets by 14.88%.
引用
收藏
页码:2282 / 2290
页数:9
相关论文
共 50 条
  • [41] Cross-subject transfer learning in human activity recognition systems using generative adversarial networks
    Soleimani, Elnaz
    Nazerfard, Ehsan
    NEUROCOMPUTING, 2021, 426 : 26 - 34
  • [42] Standardization-refinement domain adaptation method for cross-subject EEG-based classification in imagined speech recognition
    Jimenez-Guarneros, Magdiel
    Gomez-Gil, Pilar
    PATTERN RECOGNITION LETTERS, 2021, 141 : 54 - 60
  • [43] Dynamic Threshold Distribution Domain Adaptation Network: A Cross-Subject Fatigue Recognition Method Based on EEG Signals
    Ma, Chao
    Zhang, Meng
    Sun, Xinlin
    Wang, He
    Gao, Zhongke
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 190 - 201
  • [44] Unsupervised Cross-Lingual Speech Emotion Recognition Using Domain Adversarial Neural Network
    Cai, Xiong
    Wu, Zhiyong
    Zhong, Kuo
    Su, Bin
    Dai, Dongyang
    Meng, Helen
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [45] Cross-subject EEG Channel Optimization by Domain Adversarial Sparse Learning Model
    Wu, Zhenhua
    Zeng, Hong
    Zhao, Yue
    Li, Xiufeng
    Zhang, Jiaming
    Hattori, Motonobu
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1176 - 1179
  • [46] EMG-based speech recognition using hidden Markov models with global control variables
    Lee, Ki-Seung
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2008, 55 (03) : 930 - 940
  • [47] Cross-Subject Emotion Recognition Based on Domain Similarity of EEG Signal Transfer Learning
    Ma, Yuliang
    Zhao, Weicheng
    Meng, Ming
    Zhang, Qizhong
    She, Qingshan
    Zhang, Jianhai
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 936 - 943
  • [48] Pattern Learning with Deep Neural Networks in EMG-based Speech Recognition
    Wand, Michael
    Schultz, Tanja
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 4200 - 4203
  • [49] EEG-based cross-subject emotion recognition using multi-source domain transfer learning
    Quan, Jie
    Li, Ying
    Wang, Lingyue
    He, Renjie
    Yang, Shuo
    Guo, Lei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84
  • [50] Towards Real-life Application of EMG-based Speech Recognition by using Unsupervised Adaptation
    Wand, Michael
    Schultz, Tanja
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1189 - 1193