High performance digit mandarin speech recognition

被引：0

作者：

Li, Husheng

Liu, Jia

Liu, Runsheng

机构：

来源：

Qinghua Daxue Xuebao/Journal of Tsinghua University | 2000年 / 40卷 / 01期

关键词：

Algorithms - Digital signal processing - Feature extraction;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

High-performance mandarin digit speech recognition (MDSR) system is developed using MFCC (Mel frequency cepstrum coefficient) as the main parameter identifying the speech patterns. The formant trajectory and the nasal feature are extracted to identify confused words. A feature-based, real-time endpoint detection algorithm is proposed to reduce the system resource requirements and to improve the disturbance-proof ability. A two-stage recognition frame enhances discrimination by identifying candidate words in the first stage and confused word pairs in the second stage. These improvements result in a correct recognition rate of 98.8%.

引用

页码：32 / 34

共 50 条

[31] Feature selection for emotion recognition of mandarin speech
College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
不详
Zhejiang Daxue Xuebao (Gongxue Ban), 2007, 11 (1816-1822):
[32] A simple statistical speech recognition of mandarin monosyllables
Li, Tze Fen
Chang, Shui-Ching
Lee, Chung-Bow
APPLIED MATHEMATICS AND COMPUTATION, 2006, 177 (02) : 644 - 651
[33] Distributed speech recognition of mandarin digits string
Wang, Yih-Ru
Lu, Bo-Xuan
Liao, Yuan-Fu
Chen, Sin-Horng
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 370 - +
[34] A study on Mandarin broadcast news speech recognition
Chen, CL
Wang, YR
Chen, SH
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
[35] Tone Modeling for Continuous Mandarin Speech Recognition
Cao, Yang
Zhang, Shuwu
Huang, Taiyi
Xu, Bo
International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
[36] Smoothed unit HMM in mandarin speech recognition
He, Q
Mao, SY
Zhang, YW
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 792 - 795
[37] Pronunciation Modeling for Spontaneous Mandarin Speech Recognition
Yi Liu
Pascale Fung
International Journal of Speech Technology, 2004, 7 (2-3) : 155 - 172
[38] Investigation on Mandarin Broadcast News Speech Recognition
Hwang, Mei-Yuh
Lei, Xin
Wang, Wen
Shinozaki, Takahiro
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1233 - +
[39] A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH
He, Lei
Hao, Jie
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1575 - 1578
[40] Progress on Mandarin conversational telephone speech recognition
Hwang, MY
Lei, X
Ng, T
Bulyko, I
Ostendorf, M
Stolcke, A
Wang, W
Zheng, J
Gadde, VRR
Graciarena, M
Siu, MH
Huang, Y
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 1 - 4

← 1 2 3 4 5 →