On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition

被引:0
|
作者
Leutnant, Volker [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, Paderborn, Germany
关键词
speech recognition; hybrid decoder architecture; acoustic modeling; linear dynamic models; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Linear dynamic models (LDMs) have been shown to be a viable alternative to hidden MARKOV models (HMMs) on small-vocabulary recognition tasks, such as phone classification. In this paper we investigate various statistical model combination approaches for a hybrid HMM-LDM recognizer, resulting in a phone classification performance that outperforms the best individual classifier. Further, we report on continuous speech recognition experiments on the AURORA4 corpus, where the model combination is carried out on wordgraph rescoring. While the hybrid system improves the HMM system in the case of monophone HMMs, the performance of the triphone HMM model could not be improved by monophone LDMs, asking for the need to introduce context-dependency also in the LDM model inventory.
引用
收藏
页码:2946 / 2949
页数:4
相关论文
共 50 条
  • [21] Automatic speech recognition using hidden Markov models
    Botros, N.M.
    Teh, C.K.
    Microcomputer Applications, 1994, 13 (01): : 6 - 12
  • [22] Large margin hidden Markov models for speech recognition
    Jiang, Hui
    Li, Xinwei
    Liu, Chaojun
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1584 - 1595
  • [23] Hidden-articulator Markov models for speech recognition
    Richardson, M
    Bilmes, J
    Diorio, C
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 511 - 529
  • [24] Group Sparse Hidden Markov Models for Speech Recognition
    Chien, Jen-Tzung
    Chiang, Cheng-Chun
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2645 - 2648
  • [25] Fuzzy hidden Markov models for speech and speaker recognition
    Tran, D
    Wagner, M
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 426 - 430
  • [26] Factor analysed hidden Markov models for speech recognition
    Rosti, AVI
    Gales, MJF
    COMPUTER SPEECH AND LANGUAGE, 2004, 18 (02): : 181 - 200
  • [27] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623
  • [28] BAYESIAN SENSING HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION
    Saon, George
    Chien, Jen-Tzung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5056 - 5059
  • [29] IMPROVED HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    AUBERT, X
    BOURLARD, H
    KAMP, Y
    WELLEKENS, CJ
    PHILIPS JOURNAL OF RESEARCH, 1988, 43 (3-4) : 224 - 245
  • [30] REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION
    Mao, Shuiyang
    Tao, Dehua
    Zhang, Guangyan
    Ching, P. C.
    Lee, Tan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6715 - 6719