Mandarin digit string speech recognition using linear discriminant analysis and tone discrimination model

被引:0
|
作者
Shi, YP [1 ]
Liu, J [1 ]
Liu, RS [1 ]
机构
[1] Tsing Hua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
D O I
10.1109/TENCON.2002.1181313
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The acoustic models based oil the conventional Hidden Markov Model have not high recognition performance for the connected Mandarin digit string, because highly confusable syllables exist. The state-specific linear discriminant analysis is adopted to reduce the substitution errors of confusable digits. The recognition rate for the isolated digit is increased from 97.16% to 99.32%; and the unknown length digit string from 86.5% to 88.18%. Furthermore experiments showing that most of the typical confusions call be discriminated by the pitch contour patterns the tone discrimination models are trained and the two-pass recognition algorithm to combine the acoustic model likelihood and the tone discrimination model likelihood is developed. By tone discrimination the relative digit string error rate is reduced by 37.4%. The unknown length digit string recognition rate and its digit recognition rate are increased from 88.18% and 97.54% to 92.6% and 9821%, respectively.
引用
收藏
页码:461 / 464
页数:4
相关论文
共 50 条
  • [22] Tone articulation modeling for mandarin spontaneous speech recognition
    Zhou, JL
    Tian, Y
    Shi, Y
    Huang, C
    Chang, E
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 997 - 1000
  • [23] Pitch tracking and tone features for Mandarin speech recognition
    Huang, HCH
    Seide, F
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1523 - 1526
  • [24] A quantitative assessment of the importance of tone in mandarin speech recognition
    Ng, T
    Siu, MH
    Ostendorf, M
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (12) : 867 - 870
  • [25] An Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition
    Guo, Chunyi
    Li, Runzhi
    Fan, Ming
    Liu, Kejun
    THIRD INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY WORKSHOPS (ISECS 2010), 2010, : 376 - 379
  • [26] Mandarin Speech Recognition Using Convolution Neural Network with Augmented Tone Features
    Hu, Xinhui
    Lu, Xugang
    Hori, Chiori
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 15 - 18
  • [27] Efficient decoding algorithms for Mandarin Connected Digit Speech Recognition
    Zhu, X
    Li, HS
    Lu, J
    Liu, RS
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 555 - 558
  • [28] Improve the Implementation of Pitch Features for Mandarin Digit String Recognition Task
    Ding, Pei
    He, Liqiang
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2617 - 2620
  • [29] Probabilistic Linear Discriminant Analysis with Bottleneck Features for Speech Recognition
    Lu, Liang
    Renals, Steve
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 910 - 914
  • [30] Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition
    Jakovljevic, N.
    Miskovic, D.
    Janev, M.
    Secujski, M.
    Delic, V.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (07) : 76 - 79