Speaker-Adaptive Speech Recognition Based on Surface Electromyography

被引:0
|
作者
Wand, Michael [1 ]
Schultz, Tanja [1 ]
机构
[1] Univ Karlsruhe TH, Karlsruhe, Germany
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present our recent advances in silent speech interfaces using electromyographic signals that capture the movements of the human articulatory muscles at the skin surface for recognizing continuously spoken speech. Previous systems were limited to speaker- and session-dependent recognition tasks on small amounts of training and test data. In this article we present speaker-independent and speaker-adaptive training methods which allow us to use a large corpus of data from many speakers to train acoustic models more reliably. We use the speaker-dependent system as baseline, carefully tuning the data preprocessing and acoustic modeling. Then on our corpus we compare the performance of speaker-dependent and speaker-independent acoustic models and carry out model adaptation experiments.
引用
收藏
页码:271 / 285
页数:15
相关论文
共 50 条
  • [21] HMM-based distributed text-to-speech synthesis incorporating speaker-adaptive training
    Jeon, Kwang Myung
    Choi, Seung Ho
    International Journal of Multimedia and Ubiquitous Engineering, 2014, 9 (05): : 107 - 119
  • [22] Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation
    Yu, Dong
    Deng, Li
    Acero, Alex
    COMPUTER SPEECH AND LANGUAGE, 2007, 21 (01): : 72 - 87
  • [23] DNN-BASED SPEAKER-ADAPTIVE POSTFILTERING WITH LIMITED ADAPTATION DATA FOR STATISTICAL SPEECH SYNTHESIS SYSTEMS
    Ozturk, Mirac Goksu
    Ulusoy, Okan
    Demiroglu, Cenk
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7030 - 7034
  • [24] Adaptive Speaker Normalization for CTC-Based Speech Recognition
    Ding, Penguin
    Guo, Wu
    Gu, Bin
    Ling, Zhenhua
    Du, Jun
    INTERSPEECH 2020, 2020, : 1266 - 1270
  • [25] DNN-HMM-Based Speaker-Adaptive Emotion Recognition Using MFCC and Epoch-Based Features
    Md. Shah Fahad
    Akshay Deepak
    Gayadhar Pradhan
    Jainath Yadav
    Circuits, Systems, and Signal Processing, 2021, 40 : 466 - 489
  • [26] Channel Reduction in Speech Recognition System based on Surface Electromyography
    Jong, Nida Sae
    Kiatweerasakul, Monthep
    Phukpattaranont, Pornchai
    2018 15TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2018, : 188 - 191
  • [27] DNN-HMM-Based Speaker-Adaptive Emotion Recognition Using MFCC and Epoch-Based Features
    Fahad, Md. Shah
    Deepak, Akshay
    Pradhan, Gayadhar
    Yadav, Jainath
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (01) : 466 - 489
  • [28] Fast Speaker Adaptive Training for Speech Recognition
    Povey, Daniel
    Kuo, Hong-Kwang J.
    Soltau, Hagen
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1245 - 1248
  • [29] A Robust Speaker-Adaptive and Text-Prompted Speaker Verification System
    Hong, Qingyang
    Wang, Sheng
    Liu, Zhijian
    BIOMETRIC RECOGNITION (CCBR 2014), 2014, 8833 : 385 - 393
  • [30] A robust speaker-adaptive and text-prompted speaker verification system
    Hong, Qingyang, 1600, Springer Verlag (8833):