Building CMU Sphinx language model for the Holy Quran using simplified Arabic phonemes

被引:11
|
作者
El Amrani, Mohamed Yassine [1 ,2 ]
Rahman, M. M. Hafizur [2 ]
Wahiddin, Mohamed Ridza [2 ]
Shah, Asadullah [2 ]
机构
[1] Jubail Univ Coll, Dept Comp Sci & Engn, Yanbu, Al Jubail, Saudi Arabia
[2] Int Islamic Univ Malaysia, Dept Comp Sci, Kulliah Informat Commun Technol, Kuala Lumpur, Selangor, Malaysia
关键词
Automatic speech recognition; Holy Quran recognition; Human voice;
D O I
10.1016/j.eij.2016.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of a simplified set of Arabic phonemes in an Arabic Speech Recognition system applied to Holy Quran. The CMU Sphinx 4 was used to train and evaluate a language model for the Hafs narration of the Holy Quran. The building of the language model was done using a simplified list of Arabic phonemes instead of the mainly used Romanized set in order to simplify the process of generating the language model. The experiments resulted in very low Word Error Rate (WER) reaching 1.5% while using a very small set of audio files during the training phase when using all the audio data for both the training and the testing phases. However, when using 90% and 80% of the training data, the WER obtained was respectively 50.0% and 55.7%. (C) 2016 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Information, Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:305 / 314
页数:10
相关论文
共 46 条
  • [21] Towards a Speech Recognizer for Multiple Languages Using Arabic Acoustic Model Application to Amazigh Language
    Sadiqui, Ali
    Zinedine, Ahmed
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 67 - 78
  • [22] Simplified dynamic neural network model to predict heating load of a building using Taguchi method
    Sholahudin, S.
    Han, Hwataik
    ENERGY, 2016, 115 : 1672 - 1678
  • [23] Workshop: Developing Building Information Model Visualizations Using a Domain Specific Language
    Tauscher, Helga
    Scherer, Raimar J.
    ECAADE 2015: REAL TIME - EXTENDING THE REACH OF COMPUTATION, VOL 2, 2015, : 21 - 24
  • [24] Enhanced model for abstractive Arabic text summarization using natural language generation and named entity recognition
    Nada Essa
    M. M. El-Gayar
    Eman M. El-Daydamony
    Neural Computing and Applications, 2025, 37 (10) : 7279 - 7301
  • [25] A Real Time Arabic Sign Language Alphabets (ArSLA) Recognition Model Using Deep Learning Architecture
    Alsaadi, Zaran
    Alshamani, Easa
    Alrehaili, Mohammed
    Alrashdi, Abdulmajeed Ayesh D.
    Albelwi, Saleh
    Elfaki, Abdelrahman Osman
    COMPUTERS, 2022, 11 (05)
  • [26] Simplified mathematical model for analyzing the effects of urban heat island by using WRF and building thermal simulations
    Oropeza-Perez, Ivan
    PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 3334 - 3340
  • [27] Simplified building model for transient thermal performance estimation using GA-based parameter identification
    Wang, SW
    Xu, XH
    INTERNATIONAL JOURNAL OF THERMAL SCIENCES, 2006, 45 (04) : 419 - 432
  • [28] Assessing the accuracy of a simplified building energy simulation model using BESTEST: The case study of Brazilian regulation
    Melo, A. P.
    Costola, D.
    Lamberts, R.
    Hensen, J. L. M.
    ENERGY AND BUILDINGS, 2012, 45 : 219 - 228
  • [29] Building the Waray-waray Neural Language Model using Recurrent Neural Network
    Quiroz Jr, Fernando E.
    Sabinay, Chona B.
    Sabonsolin, Jeneffer A.
    MINDANAO JOURNAL OF SCIENCE AND TECHNOLOGY, 2023, 21 (01): : 48 - 58
  • [30] Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags
    Memduhoglu, Abdulkadir
    Fulman, Nir
    Zipf, Alexander
    EARTH SCIENCE INFORMATICS, 2024, 17 (06) : 5403 - 5418