Mandarin digit string speech recognition using linear discriminant analysis and tone discrimination model

被引:0
|
作者
Shi, YP [1 ]
Liu, J [1 ]
Liu, RS [1 ]
机构
[1] Tsing Hua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
D O I
10.1109/TENCON.2002.1181313
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The acoustic models based oil the conventional Hidden Markov Model have not high recognition performance for the connected Mandarin digit string, because highly confusable syllables exist. The state-specific linear discriminant analysis is adopted to reduce the substitution errors of confusable digits. The recognition rate for the isolated digit is increased from 97.16% to 99.32%; and the unknown length digit string from 86.5% to 88.18%. Furthermore experiments showing that most of the typical confusions call be discriminated by the pitch contour patterns the tone discrimination models are trained and the two-pass recognition algorithm to combine the acoustic model likelihood and the tone discrimination model likelihood is developed. By tone discrimination the relative digit string error rate is reduced by 37.4%. The unknown length digit string recognition rate and its digit recognition rate are increased from 88.18% and 97.54% to 92.6% and 9821%, respectively.
引用
收藏
页码:461 / 464
页数:4
相关论文
共 50 条
  • [31] Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition
    Jakovljevic, N.
    Miskovic, D.
    Janev, M.
    Secujski, M.
    Delic, V.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (07) : 76 - 79
  • [32] ROBUST AUDIOVISUAL SPEECH RECOGNITION USING NOISE-ADAPTIVE LINEAR DISCRIMINANT ANALYSIS
    Zeiler, Steffen
    Nickel, Robert
    Ma, Ning
    Brown, Guy J.
    Kolossa, Dorothea
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2797 - 2801
  • [33] MAXIMUM ENTROPY BASED TONE MODELING FOR MANDARIN SPEECH RECOGNITION
    Wang, Xinhao
    Yu, Yansuo
    Wu, Xihong
    Chi, Huisheng
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4850 - 4853
  • [34] Speech and tone recognition for a Mandarin e-learning system
    Su, Wei
    Miao, Zhenjiang
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 1248 - +
  • [35] Improved Tone Modeling for Mandarin Broadcast News Speech Recognition
    Lei, Xin
    Siu, Manhung
    Hwang, Mei-Yuh
    Ostendorf, Mari
    Lee, Tan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1237 - +
  • [36] WORD-LEVEL TONE MODELING FOR MANDARIN SPEECH RECOGNITION
    Lei, Xin
    Ostendorf, Mari
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 665 - +
  • [37] Tone recognition for Chinese speech: A comparative study of Mandarin and Cantonese
    Peng, G
    Zheng, HY
    Wang, WSY
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 233 - 236
  • [38] Tone recognition of continuous Mandarin speech assisted with prosodic information
    1600, American Inst of Physics, Woodbury, NY, USA (96):
  • [39] Use formant trajectory to improve the performance of mandarin digit speech recognition
    Tsinghua Univ, Beijing, China
    Qinghua Daxue Xuebao, 9 (69-71):
  • [40] New neural network architecture with application in mandarin digit speech recognition
    Zhong, Lin
    Liu, Runsheng
    2000, (40):