Real-time speaker-dependent syllable recognition system of complete vocabulary of Chinese

被引:0
|
作者
Chen, Tao [1 ]
Li, Changli [1 ]
Mo, Fuyuan [1 ]
机构
[1] Inst of Acoustics, Acad Sinica, Beijing, China
来源
Shengxue Xuebao/Acta Acustica | 1993年 / 18卷 / 03期
关键词
Real time systems;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Based on a large number of speech experiments, Mandarin speech recognition approaches were thoroughly studied, and a real-time speaker-dependent all-syllable recognition system of Mandarin was developed on an IBM PC/AT microcomputer with a high-speed digital signal processing board TMS320C25-E. In accordance with the phonetic characteristics of Mandarin, the three-stage recognition strategy was adopted in this system. Experiments for the speech data of 4 times 1240 syllables show that, average correct rate of four tone recognition is about 99%, correct rates of the first 5 candidates of syllable recognition are 82%, 91%, 94%, 96%, and 97% respectively, and the whole system response time is less than 0.2 second. In addition, the Mandarin initials and finals confusion matrices, and the corresponding hierarchical clustering diagram of the similarity were obtained from the experiment results, and they were analyzed in comparison with the references so as to further improve the system performance.
引用
收藏
页码:161 / 171
相关论文
共 50 条
  • [31] Large vocabulary isolated word recognition: A real-time implementation
    Vicenzi, C.
    Favareto, C.
    Sciarra, D.
    Carossino, A.
    Colla, A.M.
    Scagliola, C.
    Pedrazzi, P.
    IEE Proceedings I: Solid State and Electron Devices, 1989, 136 (02): : 127 - 132
  • [32] UNSUPERVISED VOCABULARY SELECTION FOR REAL-TIME SPEECH RECOGNITION OF LECTURES
    Maergner, Paul
    Waibel, Alex
    Lane, Ian
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4417 - 4420
  • [33] LARGE VOCABULARY ISOLATED WORD RECOGNITION - A REAL-TIME IMPLEMENTATION
    VICENZI, C
    FAVARETO, C
    SCIARRA, D
    CAROSSINO, A
    COLLA, AM
    SCAGLIOLA, C
    PEDRAZZI, P
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02): : 127 - 132
  • [34] Speaker-Dependent Live Quranic Verses Recitation Recognition System Using Sphinx-4 Framework
    Hafeez, Aurish Hammad
    Mohiuddin, Khawaja
    Ahmed, Sohaib
    17TH IEEE INTERNATIONAL MULTI TOPIC CONFERENCE 2014, 2014, : 333 - 337
  • [35] Complete automatic target recognition system for real-time human face images
    Liu, HS
    Wu, MX
    Jin, GF
    Cheng, G
    He, QS
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXI, 1998, 3460 : 803 - 810
  • [36] EVALUATION OF A SPEAKER-DEPENDENT RECOGNITION METRIC AS A SUBSTITUTE FOR HUMAN JUDGMENTS OF SPEECH QUALITY
    WATSON, CS
    KEWLEYPORT, D
    MAKI, D
    REED, D
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S95 - S95
  • [37] Open vocabulary Chinese name recognition with the help of character description and syllable spelling recognition
    Tsai, CH
    Wang, NJC
    Huang, P
    Shen, JL
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1037 - 1040
  • [38] CONNECTED-DIGIT SPEAKER-DEPENDENT SPEECH RECOGNITION USING A NEURAL NETWORK WITH TIME-DELAYED CONNECTIONS
    UNNIKRISHNAN, KP
    HOPFIELD, JJ
    TANK, DW
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (03) : 698 - 713
  • [39] An Automatic Real Time Speech-Speaker Recognition System: A Real Time Approach
    Kakade, Mandar Nitin
    Salunke, D. B.
    ICCCE 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND CYBER-PHYSICAL ENGINEERING, 2020, 570 : 151 - 158
  • [40] Real-time Emotions Recognition System
    Silva, Vinicius
    Soares, Filomena
    Esteves, Joao S.
    Figueiredo, Joana
    Leao, Celina P.
    Santos, Cristina
    Pereira, Ana Paula
    2016 8TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2016, : 201 - 206