Mandarin digit string speech recognition using linear discriminant analysis and tone discrimination model

被引：0

作者：

Shi, YP ^{[1
]}

Liu, J ^{[1
]}

Liu, RS ^{[1
]}

机构：

[1] Tsing Hua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS | 2002年

关键词：

D O I：

10.1109/TENCON.2002.1181313

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The acoustic models based oil the conventional Hidden Markov Model have not high recognition performance for the connected Mandarin digit string, because highly confusable syllables exist. The state-specific linear discriminant analysis is adopted to reduce the substitution errors of confusable digits. The recognition rate for the isolated digit is increased from 97.16% to 99.32%; and the unknown length digit string from 86.5% to 88.18%. Furthermore experiments showing that most of the typical confusions call be discriminated by the pitch contour patterns the tone discrimination models are trained and the two-pass recognition algorithm to combine the acoustic model likelihood and the tone discrimination model likelihood is developed. By tone discrimination the relative digit string error rate is reduced by 37.4%. The unknown length digit string recognition rate and its digit recognition rate are increased from 88.18% and 97.54% to 92.6% and 9821%, respectively.

引用

页码：461 / 464

页数：4

共 50 条

[31] Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition
Jakovljevic, N.
Miskovic, D.
Janev, M.
Secujski, M.
Delic, V.
ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (07) : 76 - 79
[32] ROBUST AUDIOVISUAL SPEECH RECOGNITION USING NOISE-ADAPTIVE LINEAR DISCRIMINANT ANALYSIS
Zeiler, Steffen
Nickel, Robert
Ma, Ning
Brown, Guy J.
Kolossa, Dorothea
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2797 - 2801
[33] MAXIMUM ENTROPY BASED TONE MODELING FOR MANDARIN SPEECH RECOGNITION
Wang, Xinhao
Yu, Yansuo
Wu, Xihong
Chi, Huisheng
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4850 - 4853
[34] Speech and tone recognition for a Mandarin e-learning system
Su, Wei
Miao, Zhenjiang
TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 1248 - +
[35] Improved Tone Modeling for Mandarin Broadcast News Speech Recognition
Lei, Xin
Siu, Manhung
Hwang, Mei-Yuh
Ostendorf, Mari
Lee, Tan
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1237 - +
[36] WORD-LEVEL TONE MODELING FOR MANDARIN SPEECH RECOGNITION
Lei, Xin
Ostendorf, Mari
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 665 - +
[37] Tone recognition for Chinese speech: A comparative study of Mandarin and Cantonese
Peng, G
Zheng, HY
Wang, WSY
2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 233 - 236
[38] Tone recognition of continuous Mandarin speech assisted with prosodic information
1600, American Inst of Physics, Woodbury, NY, USA (96):
[39] Use formant trajectory to improve the performance of mandarin digit speech recognition
Tsinghua Univ, Beijing, China
Qinghua Daxue Xuebao, 9 (69-71):
[40] New neural network architecture with application in mandarin digit speech recognition
Zhong, Lin
Liu, Runsheng
2000, (40):

← 1 2 3 4 5 →