Integration of context-dependent durational knowledge into HMM-based speech recognition

被引：0

作者：

Wang, X

tenBosch, LFM

Pols, LCW

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents research on integrating context-dependent durational knowledge into HMM-based speech recognition. The first part of the paper presents work on obtaining relations between the parameters of the context-free HMMs and their durational behaviour, in preparation for the context-dependent durational modelling presented in the second part. Duration integration is realised via rescoring in the post-processing step of our N-best monophone recogniser. We use the multi-speaker TIMIT database for our analyses.

引用

页码：1073 / 1076

页数：4

共 50 条

[31] Incorporating the voicing information into HMM-based automatic speech recognition
Jancovic, Peter
Koekueer, Muenevver
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 42 - 46
[32] A maximum model distance approach for HMM-based speech recognition
Kwong, S
He, QH
Man, KF
Tang, KS
PATTERN RECOGNITION, 1998, 31 (03) : 219 - 229
[33] On estimating robust probability distribution in HMM-based speech recognition
Samsung Advanced Inst of Technology
IEEE Trans Speech Audio Process, 4 (279-285):
[34] The use of acoustic contextual information in HMM-Based speech recognition
Choi, IJ
Lee, SY
IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (05) : 108 - 110
[35] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
Terashima R.
Yoshimura T.
Wakita T.
Tokuda K.
Kitamura T.
IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564+3
[36] Improving Eye Motion Sequence Recognition Using Electrooculography Based on Context-Dependent HMM
Fang, Fuming
Shinozaki, Takahiro
Horiuchi, Yasuo
Kuroiwa, Shingo
Furui, Sadaoki
Musha, Toshimitsu
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[37] MDL-based context-dependent subword modeling for speech recognition
Shinoda, Koichi
Watanabe, Takao
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (02): : 79 - 86
[38] A frame-based context-dependent acoustic modeling for speech recognition
Terashima R.
Zen H.
Nankaku Y.
Tokuda K.
IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (10) : 1856 - 1864+24
[39] On the Use of Extended Context for HMM-based Spontaneous Conversational Speech Synthesis
Koriyama, Tomoki
Nose, Takashi
Kobayashi, Takao
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2668 - 2671
[40] HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling
Maeno, Yu
Nose, Takashi
Kobayashi, Takao
Ijima, Yusuke
Nakajima, Hideharu
Mizuno, Hideyuki
Yoshioka, Osamu
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1860 - +

← 1 2 3 4 5 →