Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引:0
|
作者
Oropeza Rodriguez, Jose Luis [1 ]
Suarez Guerra, Sergio [1 ]
机构
[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期
关键词
Speech recognition; Syllables recognition; Expert System; Speech processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.
引用
收藏
页码:270 / 286
页数:17
相关论文
共 50 条
  • [31] Automatic speech recognition: A primer for speech-language pathology researchers
    Keshet, Joseph
    INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2018, 20 (06) : 599 - 609
  • [32] Automatic Speech Recognition for Supporting Endangered Language Documentation
    Prud'hommeaux, Emily
    Jimerson, Robbie
    Hatcher, Richard
    Michelson, Karin
    LANGUAGE DOCUMENTATION & CONSERVATION, 2021, 15 : 491 - 513
  • [33] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
    Saraclar, Murat
    Dikici, Erinc
    Arisoy, Ebru
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
  • [34] An Evaluation of Structured Language Modeling for Automatic Speech Recognition
    Bjorklund, Johanna
    Cleophas, Loek
    Karlsson, My
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2017, 23 (11) : 1019 - 1034
  • [35] Skrybot - A System for Automatic Speech Recognition of Polish Language
    Pawlaczyk, Leslaw
    Bosky, Pawel
    MAN-MACHINE INTERACTIONS, 2009, 59 : 381 - +
  • [36] IMPROVING AUTOMATIC SPEECH RECOGNITION ROBUSTNESS FOR THE ROMANIAN LANGUAGE
    Buzo, Andi
    Cucu, Horia
    Burileanu, Corneliu
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2119 - 2122
  • [37] Automatic Speech Recognition of Isolated Words in Hindi Language
    Wani, Priyanka
    Bormane, D. S.
    Patil, U. G.
    Shirbahadurkar, S. D.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [38] Automatic recognition of second language speech-in-noise
    Kim, Seung-Eun
    Chernyak, Bronya R.
    Seleznova, Olga
    Keshet, Joseph
    Goldrick, Matthew
    Bradlow, Ann R.
    JASA EXPRESS LETTERS, 2024, 4 (02):
  • [39] On the Design of an Automatic Speech Recognition System for Romanian Language
    Caranica, Alexandru
    Cucu, Horia
    Buzo, Audi
    Burileanu, Corneliu
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2016, 18 (02): : 65 - 76
  • [40] Effect of Language Resources on Automatic Speech Recognition for Amharic
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,