Development of Language Models for Continuous Uzbek Speech Recognition System

被引:6
|
作者
Mukhamadiyev, Abdinabi [1 ]
Mukhiddinov, Mukhriddin [1 ]
Khujayarov, Ilyos [2 ]
Ochilov, Mannon [3 ]
Cho, Jinsoo [1 ]
机构
[1] Gachon Univ, Dept Comp Engn, Seongnam Si 13120, South Korea
[2] Tashkent Univ Informat Technol, Dept Informat Technol, Samarkand Branch, Tashkent 140100, Uzbekistan
[3] Tashkent Univ Informat Technol, Dept Artificial Intelligence, Tashkent 100200, Uzbekistan
基金
新加坡国家研究基金会;
关键词
language model; Uzbek speech; recurrent neural networks; automatic speech recognition; neural networks; character-based language models; word-based language models;
D O I
10.3390/s23031145
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Automatic speech recognition systems with a large vocabulary and other natural language processing applications cannot operate without a language model. Most studies on pre-trained language models have focused on more popular languages such as English, Chinese, and various European languages, but there is no publicly available Uzbek speech dataset. Therefore, language models of low-resource languages need to be studied and created. The objective of this study is to address this limitation by developing a low-resource language model for the Uzbek language and understanding linguistic occurrences. We proposed the Uzbek language model named UzLM by examining the performance of statistical and neural-network-based language models that account for the unique features of the Uzbek language. Our Uzbek-specific linguistic representation allows us to construct more robust UzLM, utilizing 80 million words from various sources while using the same or fewer training words, as applied in previous studies. Roughly sixty-eight thousand different words and 15 million sentences were collected for the creation of this corpus. The experimental results of our tests on the continuous recognition of Uzbek speech show that, compared with manual encoding, the use of neural-network-based language models reduced the character error rate to 5.26%.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Development of integral model of speech recognition system for Uzbek language
    Musaev, Muhammadjon
    Khujayorov, Ilyos
    Ochilov, Mannon
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [2] First Automatic Fongbe Continuous Speech Recognition System: Development of Acoustic Models and Language Models
    LAleye, Frejus A. A.
    Besacier, Laurent
    Ezin, Eugene C.
    Motamed, Cina
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 477 - 482
  • [3] Language Models for Tamil Speech Recognition System
    Saraswathi, S.
    Geetha, T. V.
    IETE TECHNICAL REVIEW, 2007, 24 (05) : 375 - 383
  • [4] SySRA: A System of a Continuous Speech Recognition in Arab Language
    Abdelhamid, Samir
    Bouguechal, Noureddine
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 11, 2006, 11 : 207 - +
  • [5] Building language models for Tamil speech recognition system
    Saraswathi, S
    Geetha, TV
    APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 161 - 168
  • [6] A large vocabulary continuous speech recognition system for Persian language
    Hossein Sameti
    Hadi Veisi
    Mohammad Bahrani
    Bagher Babaali
    Khosro Hosseinzadeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [7] A large vocabulary continuous speech recognition system for Persian language
    Sameti, Hossein
    Veisi, Hadi
    Bahrani, Mohammad
    Babaali, Bagher
    Hosseinzadeh, Khosro
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
  • [8] CONTINUOUS SPEECH RECOGNITION OF KAZAKH LANGUAGE
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Mukhsina, Kuralay
    Keylan, Alimukhan
    BabaAli, Bagher
    Nabieva, Gulnaz
    Duisenbayeva, Aigerim
    Akhmetov, Bekturgan
    AMCSE 2018 - INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2019, 24
  • [9] LSTM-Based Language Models for Very Large Vocabulary Continuous Russian Speech Recognition System
    Kipyatkova, Irina
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 219 - 226
  • [10] Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language
    Mukhamadiyev, Abdinabi
    Khujayarov, Ilyos
    Djuraev, Oybek
    Cho, Jinsoo
    SENSORS, 2022, 22 (10)