Real-time speaker-dependent syllable recognition system of complete vocabulary of Chinese

被引：0

作者：

Chen, Tao ^{[1
]}

Li, Changli ^{[1
]}

Mo, Fuyuan ^{[1
]}

机构：

[1] Inst of Acoustics, Acad Sinica, Beijing, China

来源：

Shengxue Xuebao/Acta Acustica | 1993年 / 18卷 / 03期

关键词：

Real time systems;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Based on a large number of speech experiments, Mandarin speech recognition approaches were thoroughly studied, and a real-time speaker-dependent all-syllable recognition system of Mandarin was developed on an IBM PC/AT microcomputer with a high-speed digital signal processing board TMS320C25-E. In accordance with the phonetic characteristics of Mandarin, the three-stage recognition strategy was adopted in this system. Experiments for the speech data of 4 times 1240 syllables show that, average correct rate of four tone recognition is about 99%, correct rates of the first 5 candidates of syllable recognition are 82%, 91%, 94%, 96%, and 97% respectively, and the whole system response time is less than 0.2 second. In addition, the Mandarin initials and finals confusion matrices, and the corresponding hierarchical clustering diagram of the similarity were obtained from the experiment results, and they were analyzed in comparison with the references so as to further improve the system performance.

引用

页码：161 / 171

共 50 条

[31] Large vocabulary isolated word recognition: A real-time implementation
Vicenzi, C.
Favareto, C.
Sciarra, D.
Carossino, A.
Colla, A.M.
Scagliola, C.
Pedrazzi, P.
IEE Proceedings I: Solid State and Electron Devices, 1989, 136 (02): : 127 - 132
[32] UNSUPERVISED VOCABULARY SELECTION FOR REAL-TIME SPEECH RECOGNITION OF LECTURES
Maergner, Paul
Waibel, Alex
Lane, Ian
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4417 - 4420
[33] LARGE VOCABULARY ISOLATED WORD RECOGNITION - A REAL-TIME IMPLEMENTATION
VICENZI, C
FAVARETO, C
SCIARRA, D
CAROSSINO, A
COLLA, AM
SCAGLIOLA, C
PEDRAZZI, P
IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02): : 127 - 132
[34] Speaker-Dependent Live Quranic Verses Recitation Recognition System Using Sphinx-4 Framework
Hafeez, Aurish Hammad
Mohiuddin, Khawaja
Ahmed, Sohaib
17TH IEEE INTERNATIONAL MULTI TOPIC CONFERENCE 2014, 2014, : 333 - 337
[35] Complete automatic target recognition system for real-time human face images
Liu, HS
Wu, MX
Jin, GF
Cheng, G
He, QS
APPLICATIONS OF DIGITAL IMAGE PROCESSING XXI, 1998, 3460 : 803 - 810
[36] EVALUATION OF A SPEAKER-DEPENDENT RECOGNITION METRIC AS A SUBSTITUTE FOR HUMAN JUDGMENTS OF SPEECH QUALITY
WATSON, CS
KEWLEYPORT, D
MAKI, D
REED, D
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S95 - S95
[37] Open vocabulary Chinese name recognition with the help of character description and syllable spelling recognition
Tsai, CH
Wang, NJC
Huang, P
Shen, JL
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1037 - 1040
[38] CONNECTED-DIGIT SPEAKER-DEPENDENT SPEECH RECOGNITION USING A NEURAL NETWORK WITH TIME-DELAYED CONNECTIONS
UNNIKRISHNAN, KP
HOPFIELD, JJ
TANK, DW
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (03) : 698 - 713
[39] An Automatic Real Time Speech-Speaker Recognition System: A Real Time Approach
Kakade, Mandar Nitin
Salunke, D. B.
ICCCE 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND CYBER-PHYSICAL ENGINEERING, 2020, 570 : 151 - 158
[40] Real-time Emotions Recognition System
Silva, Vinicius
Soares, Filomena
Esteves, Joao S.
Figueiredo, Joana
Leao, Celina P.
Santos, Cristina
Pereira, Ana Paula
2016 8TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2016, : 201 - 206

← 1 2 3 4 5 →