Speaker-Adaptive Speech Recognition Based on Surface Electromyography

被引：0

作者：

Wand, Michael ^{[1
]}

Schultz, Tanja ^{[1
]}

机构：

[1] Univ Karlsruhe TH, Karlsruhe, Germany

来源：

BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES | 2010年 / 52卷

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We present our recent advances in silent speech interfaces using electromyographic signals that capture the movements of the human articulatory muscles at the skin surface for recognizing continuously spoken speech. Previous systems were limited to speaker- and session-dependent recognition tasks on small amounts of training and test data. In this article we present speaker-independent and speaker-adaptive training methods which allow us to use a large corpus of data from many speakers to train acoustic models more reliably. We use the speaker-dependent system as baseline, carefully tuning the data preprocessing and acoustic modeling. Then on our corpus we compare the performance of speaker-dependent and speaker-independent acoustic models and carry out model adaptation experiments.

引用

页码：271 / 285

页数：15

共 50 条

[21] HMM-based distributed text-to-speech synthesis incorporating speaker-adaptive training
Jeon, Kwang Myung
Choi, Seung Ho
International Journal of Multimedia and Ubiquitous Engineering, 2014, 9 (05): : 107 - 119
[22] Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation
Yu, Dong
Deng, Li
Acero, Alex
COMPUTER SPEECH AND LANGUAGE, 2007, 21 (01): : 72 - 87
[23] DNN-BASED SPEAKER-ADAPTIVE POSTFILTERING WITH LIMITED ADAPTATION DATA FOR STATISTICAL SPEECH SYNTHESIS SYSTEMS
Ozturk, Mirac Goksu
Ulusoy, Okan
Demiroglu, Cenk
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7030 - 7034
[24] Adaptive Speaker Normalization for CTC-Based Speech Recognition
Ding, Penguin
Guo, Wu
Gu, Bin
Ling, Zhenhua
Du, Jun
INTERSPEECH 2020, 2020, : 1266 - 1270
[25] DNN-HMM-Based Speaker-Adaptive Emotion Recognition Using MFCC and Epoch-Based Features
Md. Shah Fahad
Akshay Deepak
Gayadhar Pradhan
Jainath Yadav
Circuits, Systems, and Signal Processing, 2021, 40 : 466 - 489
[26] Channel Reduction in Speech Recognition System based on Surface Electromyography
Jong, Nida Sae
Kiatweerasakul, Monthep
Phukpattaranont, Pornchai
2018 15TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2018, : 188 - 191
[27] DNN-HMM-Based Speaker-Adaptive Emotion Recognition Using MFCC and Epoch-Based Features
Fahad, Md. Shah
Deepak, Akshay
Pradhan, Gayadhar
Yadav, Jainath
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (01) : 466 - 489
[28] Fast Speaker Adaptive Training for Speech Recognition
Povey, Daniel
Kuo, Hong-Kwang J.
Soltau, Hagen
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1245 - 1248
[29] A Robust Speaker-Adaptive and Text-Prompted Speaker Verification System
Hong, Qingyang
Wang, Sheng
Liu, Zhijian
BIOMETRIC RECOGNITION (CCBR 2014), 2014, 8833 : 385 - 393
[30] A robust speaker-adaptive and text-prompted speaker verification system
Hong, Qingyang, 1600, Springer Verlag (8833):

← 1 2 3 4 5 →