High performance digit mandarin speech recognition

被引:0
|
作者
Li, Husheng
Liu, Jia
Liu, Runsheng
机构
关键词
Algorithms - Digital signal processing - Feature extraction;
D O I
暂无
中图分类号
学科分类号
摘要
High-performance mandarin digit speech recognition (MDSR) system is developed using MFCC (Mel frequency cepstrum coefficient) as the main parameter identifying the speech patterns. The formant trajectory and the nasal feature are extracted to identify confused words. A feature-based, real-time endpoint detection algorithm is proposed to reduce the system resource requirements and to improve the disturbance-proof ability. A two-stage recognition frame enhances discrimination by identifying candidate words in the first stage and confused word pairs in the second stage. These improvements result in a correct recognition rate of 98.8%.
引用
收藏
页码:32 / 34
相关论文
共 50 条
  • [31] Feature selection for emotion recognition of mandarin speech
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    不详
    Zhejiang Daxue Xuebao (Gongxue Ban), 2007, 11 (1816-1822):
  • [32] A simple statistical speech recognition of mandarin monosyllables
    Li, Tze Fen
    Chang, Shui-Ching
    Lee, Chung-Bow
    APPLIED MATHEMATICS AND COMPUTATION, 2006, 177 (02) : 644 - 651
  • [33] Distributed speech recognition of mandarin digits string
    Wang, Yih-Ru
    Lu, Bo-Xuan
    Liao, Yuan-Fu
    Chen, Sin-Horng
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 370 - +
  • [34] A study on Mandarin broadcast news speech recognition
    Chen, CL
    Wang, YR
    Chen, SH
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
  • [35] Tone Modeling for Continuous Mandarin Speech Recognition
    Cao, Yang
    Zhang, Shuwu
    Huang, Taiyi
    Xu, Bo
    International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
  • [36] Smoothed unit HMM in mandarin speech recognition
    He, Q
    Mao, SY
    Zhang, YW
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 792 - 795
  • [37] Pronunciation Modeling for Spontaneous Mandarin Speech Recognition
    Yi Liu
    Pascale Fung
    International Journal of Speech Technology, 2004, 7 (2-3) : 155 - 172
  • [38] Investigation on Mandarin Broadcast News Speech Recognition
    Hwang, Mei-Yuh
    Lei, Xin
    Wang, Wen
    Shinozaki, Takahiro
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
  • [39] A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH
    He, Lei
    Hao, Jie
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1575 - 1578
  • [40] Progress on Mandarin conversational telephone speech recognition
    Hwang, MY
    Lei, X
    Ng, T
    Bulyko, I
    Ostendorf, M
    Stolcke, A
    Wang, W
    Zheng, J
    Gadde, VRR
    Graciarena, M
    Siu, MH
    Huang, Y
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 1 - 4