Mandarin digit string speech recognition using linear discriminant analysis and tone discrimination model

被引:0
|
作者
Shi, YP [1 ]
Liu, J [1 ]
Liu, RS [1 ]
机构
[1] Tsing Hua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
D O I
10.1109/TENCON.2002.1181313
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The acoustic models based oil the conventional Hidden Markov Model have not high recognition performance for the connected Mandarin digit string, because highly confusable syllables exist. The state-specific linear discriminant analysis is adopted to reduce the substitution errors of confusable digits. The recognition rate for the isolated digit is increased from 97.16% to 99.32%; and the unknown length digit string from 86.5% to 88.18%. Furthermore experiments showing that most of the typical confusions call be discriminated by the pitch contour patterns the tone discrimination models are trained and the two-pass recognition algorithm to combine the acoustic model likelihood and the tone discrimination model likelihood is developed. By tone discrimination the relative digit string error rate is reduced by 37.4%. The unknown length digit string recognition rate and its digit recognition rate are increased from 88.18% and 97.54% to 92.6% and 9821%, respectively.
引用
收藏
页码:461 / 464
页数:4
相关论文
共 50 条
  • [1] Discriminative HMM stream model for Mandarin digit string speech recognition
    Shi, YY
    Liu, J
    Liu, RS
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 528 - 531
  • [2] Use tone detection to improve performance of mandarin digit speech recognition
    Tsinghua Univ, Beijing, China
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 1998, 38 (09): : 36 - 39
  • [3] USING DURATION AND PITCH FOR MANDARIN DIGIT STRING RECOGNITION
    Zhao, Rui
    Kida, Yusuke
    Yan, Xiang
    Ding, Pei
    He, Lei
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4846 - 4849
  • [4] Noisy Chinese digit string speech recognition based on tone modeling
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    不详
    Shengxue Xuebao, 2007, 5 (454-460):
  • [5] Mandarin Digit Recognition Assisted by Selective Tone Distinction
    Wang, Xiao-Dong
    Owa, Kunihiko
    Shozakai, Makoto
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 857 - 860
  • [6] High performance digit mandarin speech recognition
    Li, Husheng
    Liu, Jia
    Liu, Runsheng
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2000, 40 (01): : 32 - 34
  • [7] Tone Recognition of Mandarin Speech using BP Neural Network
    Xie, Zhaoqiang
    Miao, Zhenjiang
    Geng, Jie
    PROCEEDINGS OF 2010 INTERNATIONAL SYMPOSIUM ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2010, : 199 - 203
  • [8] Modified Linear Discriminant Analysis for speech recognition
    Li, Xiao-Bing
    O'Shaughnessy, Douglas
    2007 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, 2007, : 1598 - 1601
  • [9] Tone recognition of continuous Mandarin speech based on tone nucleus model and neural network
    Wang, Xiao-Dong
    Hirose, Keikichi
    Zhang, Jin-Song
    Minematsu, Nobuaki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (06) : 1748 - 1755
  • [10] A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH
    He, Lei
    Hao, Jie
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1575 - 1578