Development of Language Models for Continuous Uzbek Speech Recognition System

被引：6

作者：

Mukhamadiyev, Abdinabi ^{[1
]}

Mukhiddinov, Mukhriddin ^{[1
]}

Khujayarov, Ilyos ^{[2
]}

Ochilov, Mannon ^{[3
]}

Cho, Jinsoo ^{[1
]}

机构：

[1] Gachon Univ, Dept Comp Engn, Seongnam Si 13120, South Korea

[2] Tashkent Univ Informat Technol, Dept Informat Technol, Samarkand Branch, Tashkent 140100, Uzbekistan

[3] Tashkent Univ Informat Technol, Dept Artificial Intelligence, Tashkent 100200, Uzbekistan

来源：

SENSORS | 2023年 / 23卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

language model; Uzbek speech; recurrent neural networks; automatic speech recognition; neural networks; character-based language models; word-based language models;

D O I：

10.3390/s23031145

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Automatic speech recognition systems with a large vocabulary and other natural language processing applications cannot operate without a language model. Most studies on pre-trained language models have focused on more popular languages such as English, Chinese, and various European languages, but there is no publicly available Uzbek speech dataset. Therefore, language models of low-resource languages need to be studied and created. The objective of this study is to address this limitation by developing a low-resource language model for the Uzbek language and understanding linguistic occurrences. We proposed the Uzbek language model named UzLM by examining the performance of statistical and neural-network-based language models that account for the unique features of the Uzbek language. Our Uzbek-specific linguistic representation allows us to construct more robust UzLM, utilizing 80 million words from various sources while using the same or fewer training words, as applied in previous studies. Roughly sixty-eight thousand different words and 15 million sentences were collected for the creation of this corpus. The experimental results of our tests on the continuous recognition of Uzbek speech show that, compared with manual encoding, the use of neural-network-based language models reduced the character error rate to 5.26%.

引用

页数：22

共 50 条

[1] Development of integral model of speech recognition system for Uzbek language
Musaev, Muhammadjon
Khujayorov, Ilyos
Ochilov, Mannon
2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
[2] First Automatic Fongbe Continuous Speech Recognition System: Development of Acoustic Models and Language Models
LAleye, Frejus A. A.
Besacier, Laurent
Ezin, Eugene C.
Motamed, Cina
PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 477 - 482
[3] Language Models for Tamil Speech Recognition System
Saraswathi, S.
Geetha, T. V.
IETE TECHNICAL REVIEW, 2007, 24 (05) : 375 - 383
[4] SySRA: A System of a Continuous Speech Recognition in Arab Language
Abdelhamid, Samir
Bouguechal, Noureddine
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 11, 2006, 11 : 207 - +
[5] Building language models for Tamil speech recognition system
Saraswathi, S
Geetha, TV
APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 161 - 168
[6] A large vocabulary continuous speech recognition system for Persian language
Hossein Sameti
Hadi Veisi
Mohammad Bahrani
Bagher Babaali
Khosro Hosseinzadeh
EURASIP Journal on Audio, Speech, and Music Processing, 2011
[7] A large vocabulary continuous speech recognition system for Persian language
Sameti, Hossein
Veisi, Hadi
Bahrani, Mohammad
Babaali, Bagher
Hosseinzadeh, Khosro
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
[8] CONTINUOUS SPEECH RECOGNITION OF KAZAKH LANGUAGE
Mamyrbayev, Orken
Turdalyuly, Mussa
Mekebayev, Nurbapa
Mukhsina, Kuralay
Keylan, Alimukhan
BabaAli, Bagher
Nabieva, Gulnaz
Duisenbayeva, Aigerim
Akhmetov, Bekturgan
AMCSE 2018 - INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2019, 24
[9] LSTM-Based Language Models for Very Large Vocabulary Continuous Russian Speech Recognition System
Kipyatkova, Irina
SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 219 - 226
[10] Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language
Mukhamadiyev, Abdinabi
Khujayarov, Ilyos
Djuraev, Oybek
Cho, Jinsoo
SENSORS, 2022, 22 (10)

← 1 2 3 4 5 →