Building CMU Sphinx language model for the Holy Quran using simplified Arabic phonemes

被引:11
|
作者
El Amrani, Mohamed Yassine [1 ,2 ]
Rahman, M. M. Hafizur [2 ]
Wahiddin, Mohamed Ridza [2 ]
Shah, Asadullah [2 ]
机构
[1] Jubail Univ Coll, Dept Comp Sci & Engn, Yanbu, Al Jubail, Saudi Arabia
[2] Int Islamic Univ Malaysia, Dept Comp Sci, Kulliah Informat Commun Technol, Kuala Lumpur, Selangor, Malaysia
关键词
Automatic speech recognition; Holy Quran recognition; Human voice;
D O I
10.1016/j.eij.2016.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of a simplified set of Arabic phonemes in an Arabic Speech Recognition system applied to Holy Quran. The CMU Sphinx 4 was used to train and evaluate a language model for the Hafs narration of the Holy Quran. The building of the language model was done using a simplified list of Arabic phonemes instead of the mainly used Romanized set in order to simplify the process of generating the language model. The experiments resulted in very low Word Error Rate (WER) reaching 1.5% while using a very small set of audio files during the training phase when using all the audio data for both the training and the testing phases. However, when using 90% and 80% of the training data, the WER obtained was respectively 50.0% and 55.7%. (C) 2016 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Information, Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:305 / 314
页数:10
相关论文
共 46 条
  • [41] Using Rasch Model to Assess Self-Assessment Speaking Skill Rubric for Non-Native Arabic Language Speakers
    ParahitaAnandi, Rizki
    Zailaini, Muhammad Azhar
    PERTANIKA JOURNAL OF SOCIAL SCIENCE AND HUMANITIES, 2019, 27 (03): : 1469 - 1480
  • [42] Simplified Agile Software Project Selection Model Using Natural Language Processing Based Upon Agile Team Skills
    Sharma, Sarika
    Kumar, Deepak
    Fayad, M. E.
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 264 - 269
  • [43] Dynamic analysis of soil-structure interaction effects on NPP building using simplified and solid FE model of layered subsoil
    Kralik, Juraj
    Kralik, Juraj Jr Jr
    JOURNAL OF MEASUREMENTS IN ENGINEERING, 2019, 7 (01) : 12 - 19
  • [44] Building a Language Model for Local Coherence in Multi-document Summaries using a Discourse-enriched Entity-based Model
    Castro Jorge, Maria Lucia R.
    Dias, Marcio S.
    Pardo, Thiago A. S.
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 44 - 49
  • [45] Novel approach for Arabic fake news classification using embedding from large language features with CNN-LSTM ensemble model and explainable AI
    Aboulola, Omar Ibrahim
    Umer, Muhammad
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [46] Validation of a FEM-program (frequency-domain) and a simplified RC-model (time-domain) for thermally activated building component systems (TABS) using measurement data
    Weber, T
    Jóhannesson, G
    Koschenz, M
    Lehmann, B
    Baumgartner, T
    ENERGY AND BUILDINGS, 2005, 37 (07) : 707 - 724