Tone recognition for continuous Mandarin speech with limited training data using selected context-dependent hidden Markov models

被引：2

作者：

Wang, Hsin-Min ^{[1
]}

Lee, Lin-Shan ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

来源：

Journal of the Chinese Institute of Engineers, Transactions of the Chinese Institute of Engineers,Series A/Chung-kuo Kung Ch'eng Hsuch K'an | 1994年 / 17卷 / 06期

关键词：

Markov processes - Mathematical models - Selection - Speech;

D O I：

10.1080/02533839.1994.9677646

中图分类号：

学科分类号：

摘要：

Mandarin Chinese is a tonal language, in which every syllable is assigned a tone that has a lexical meaning. Therefore tone recognition is very important for Mandarin speech. This paper presents a method for continuous speech tone recognition. Context-dependent discrete hidden Markov models (HMM's) are used taking into account the tones of the syllables on both sides, and special efforts were made in selecting the minimum number of key context-dependent models considering the characteristics of the tones. The results indicate that a total of 23 context-dependent models have very good potential to describe the complicated tone behavior for all 175 possible tone concatenation conditions in continuous speech, such that the required training data can be reduced to a minimum and the recognition process can be simplified significantly. The best achievable recognition rate is 83.55%.

引用

页码：775 / 784

共 50 条

[31] MODELING HETEROGENEOUS DATA SOURCES FOR SPEECH RECOGNITION USING SYNCHRONOUS HIDDEN MARKOV MODELS
Zhao, Yong
Juang, Biing-Hwang
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7403 - 7407
[32] Performing speech recognition on multiple parallel files using continuous hidden Markov models on an FPGA
Melnikoff, SJ
Quigley, SF
Russell, MJ
2002 IEEE INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), PROCEEDINGS, 2002, : 399 - 402
[33] Environment-independent continuous speech recognition using neural networks and hidden Markov models
Yuk, DS
Che, CW
Jin, LM
Lin, QG
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3358 - 3361
[34] LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION WITH CONTEXT-DEPENDENT DBN-HMMS
Dahl, George E.
Yu, Dong
Deng, Li
Acero, Alex
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4688 - 4691
[35] A speaker-adaptation technique for context-dependent models represented by hidden Markov networks
Takami, JI
Sagayama, S
SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (02) : 75 - 86
[36] Application of continuous state Hidden Markov Models to a classical problem in speech recognition
Champion, Colin
Houghton, S. M.
COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 347 - 364
[37] CONTINUOUS SPEECH RECOGNITION - HIDDEN MARKOV-MODELS VS THE CONNECTIONIST HOPE
BOURLARD, HA
TWENTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2: CONFERENCE RECORD, 1989, : 331 - 335
[38] Tone recognition in continuous Cantonese speech using supratone models
Qian, Yao
Lee, Tan
Soong, Frank K.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05): : 2936 - 2945
[39] Discriminative training of hidden Markov models by multiobjective optimization for visual speech recognition
Lee, JS
Park, CH
PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2053 - 2058
[40] Large-margin discriminative training of hidden Markov models for speech recognition
Yu, Dong
Deng, Li
ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 429 - +

← 1 2 3 4 5 →