Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引:0
|
作者
Oropeza Rodriguez, Jose Luis [1 ]
Suarez Guerra, Sergio [1 ]
机构
[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期
关键词
Speech recognition; Syllables recognition; Expert System; Speech processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.
引用
收藏
页码:270 / 286
页数:17
相关论文
共 50 条
  • [21] GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Xiao, Xiaoqiang
    Chen, Hong
    Zylak, Mark
    Sosa, Daniela
    Desu, Suma
    Krishnamoorthy, Mahesh
    Liu, Daben
    Paulik, Matthias
    Zhang, Yuchen
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6124 - 6128
  • [22] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Sak, Hasim
    Beaufays, Francoise
    Nakajima, Kaisuke
    Allauzen, Cyril
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
  • [23] Automatic emotional speech recognition in Serbian language
    Bojanic, Milana
    Delic, Vlado
    2013 21ST TELECOMMUNICATIONS FORUM (TELFOR), 2013, : 459 - 465
  • [24] Diagnostic assessment of childhood apraxia of speech using automatic speech recognition (ASR) methods
    Hosom, JP
    Shriberg, L
    Green, JR
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2004, 12 (04) : 167 - 171
  • [25] Comparative Experiments to Evaluate the Use of Syllables for Large-Vocabulary Automatic Speech Recognition
    Tolba, Hesham
    Azmi, Mohamed
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 250 - +
  • [26] Automatic Language Identification Using Speech Rhythm Features for Multi-Lingual Speech Recognition
    Kim, Hwamin
    Park, Jeong-Sik
    APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [27] INFORMATION RETRIEVAL METHODS FOR AUTOMATIC SPEECH RECOGNITION
    Xiao, Xiaoqiang
    Droppo, Jasha
    Acero, Alex
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5550 - 5553
  • [28] DYNAMIC ADJUSTMENT OF LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION USING WORD SIMILARITY
    Currey, Anna
    Illina, Irina
    Fohr, Dominique
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 426 - 432
  • [29] Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet
    Dhakal, Manish
    Chhetri, Arman
    Gupta, Aman Kumar
    Lamichhane, Prabin
    Pandey, Suraj
    Shakya, Subarna
    2022 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES, ICICT 2022, 2022, : 515 - 521
  • [30] Applications of automatic speech recognition to speech and language development in young children
    Russell, M
    Brown, C
    Skilling, A
    Series, R
    Wallace, J
    Bonham, B
    Barker, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 176 - 179