Tone recognition for continuous Mandarin speech with limited training data using selected context-dependent hidden Markov models

被引:2
|
作者
Wang, Hsin-Min [1 ]
Lee, Lin-Shan [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
关键词
Markov processes - Mathematical models - Selection - Speech;
D O I
10.1080/02533839.1994.9677646
中图分类号
学科分类号
摘要
Mandarin Chinese is a tonal language, in which every syllable is assigned a tone that has a lexical meaning. Therefore tone recognition is very important for Mandarin speech. This paper presents a method for continuous speech tone recognition. Context-dependent discrete hidden Markov models (HMM's) are used taking into account the tones of the syllables on both sides, and special efforts were made in selecting the minimum number of key context-dependent models considering the characteristics of the tones. The results indicate that a total of 23 context-dependent models have very good potential to describe the complicated tone behavior for all 175 possible tone concatenation conditions in continuous speech, such that the required training data can be reduced to a minimum and the recognition process can be simplified significantly. The best achievable recognition rate is 83.55%.
引用
收藏
页码:775 / 784
相关论文
共 50 条
  • [31] MODELING HETEROGENEOUS DATA SOURCES FOR SPEECH RECOGNITION USING SYNCHRONOUS HIDDEN MARKOV MODELS
    Zhao, Yong
    Juang, Biing-Hwang
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7403 - 7407
  • [32] Performing speech recognition on multiple parallel files using continuous hidden Markov models on an FPGA
    Melnikoff, SJ
    Quigley, SF
    Russell, MJ
    2002 IEEE INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), PROCEEDINGS, 2002, : 399 - 402
  • [33] Environment-independent continuous speech recognition using neural networks and hidden Markov models
    Yuk, DS
    Che, CW
    Jin, LM
    Lin, QG
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3358 - 3361
  • [34] LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION WITH CONTEXT-DEPENDENT DBN-HMMS
    Dahl, George E.
    Yu, Dong
    Deng, Li
    Acero, Alex
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4688 - 4691
  • [35] A speaker-adaptation technique for context-dependent models represented by hidden Markov networks
    Takami, JI
    Sagayama, S
    SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (02) : 75 - 86
  • [36] Application of continuous state Hidden Markov Models to a classical problem in speech recognition
    Champion, Colin
    Houghton, S. M.
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 347 - 364
  • [37] CONTINUOUS SPEECH RECOGNITION - HIDDEN MARKOV-MODELS VS THE CONNECTIONIST HOPE
    BOURLARD, HA
    TWENTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2: CONFERENCE RECORD, 1989, : 331 - 335
  • [38] Tone recognition in continuous Cantonese speech using supratone models
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05): : 2936 - 2945
  • [39] Discriminative training of hidden Markov models by multiobjective optimization for visual speech recognition
    Lee, JS
    Park, CH
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2053 - 2058
  • [40] Large-margin discriminative training of hidden Markov models for speech recognition
    Yu, Dong
    Deng, Li
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 429 - +