Integration of context-dependent durational knowledge into HMM-based speech recognition

被引:0
|
作者
Wang, X
tenBosch, LFM
Pols, LCW
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents research on integrating context-dependent durational knowledge into HMM-based speech recognition. The first part of the paper presents work on obtaining relations between the parameters of the context-free HMMs and their durational behaviour, in preparation for the context-dependent durational modelling presented in the second part. Duration integration is realised via rescoring in the post-processing step of our N-best monophone recogniser. We use the multi-speaker TIMIT database for our analyses.
引用
收藏
页码:1073 / 1076
页数:4
相关论文
共 50 条
  • [31] Incorporating the voicing information into HMM-based automatic speech recognition
    Jancovic, Peter
    Koekueer, Muenevver
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 42 - 46
  • [32] A maximum model distance approach for HMM-based speech recognition
    Kwong, S
    He, QH
    Man, KF
    Tang, KS
    PATTERN RECOGNITION, 1998, 31 (03) : 219 - 229
  • [33] On estimating robust probability distribution in HMM-based speech recognition
    Samsung Advanced Inst of Technology
    IEEE Trans Speech Audio Process, 4 (279-285):
  • [34] The use of acoustic contextual information in HMM-Based speech recognition
    Choi, IJ
    Lee, SY
    IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (05) : 108 - 110
  • [35] Prediction method of speech recognition performance based on HMM-based speech synthesis technique
    Terashima R.
    Yoshimura T.
    Wakita T.
    Tokuda K.
    Kitamura T.
    IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (04) : 557 - 564+3
  • [36] Improving Eye Motion Sequence Recognition Using Electrooculography Based on Context-Dependent HMM
    Fang, Fuming
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    Furui, Sadaoki
    Musha, Toshimitsu
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [37] MDL-based context-dependent subword modeling for speech recognition
    Shinoda, Koichi
    Watanabe, Takao
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (02): : 79 - 86
  • [38] A frame-based context-dependent acoustic modeling for speech recognition
    Terashima R.
    Zen H.
    Nankaku Y.
    Tokuda K.
    IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (10) : 1856 - 1864+24
  • [39] On the Use of Extended Context for HMM-based Spontaneous Conversational Speech Synthesis
    Koriyama, Tomoki
    Nose, Takashi
    Kobayashi, Takao
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2668 - 2671
  • [40] HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling
    Maeno, Yu
    Nose, Takashi
    Kobayashi, Takao
    Ijima, Yusuke
    Nakajima, Hideharu
    Mizuno, Hideyuki
    Yoshioka, Osamu
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1860 - +