The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research

被引:0
|
作者
Ahmed, Irfan [1 ]
Ahmad, Nasir [2 ]
Ali, Hazrat [1 ]
Ahmad, Gulzar [1 ]
机构
[1] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
Automatic Speech Recognition; Pashto Speech Corpus; Human Computer Interaction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The availability of standard speech database is of paramount importance in the automatic speech recognition (ASR) research in the context of providing a baseline for comparing the performance of automatic speech recognition approaches. This paper presents the development of a Medium-Vocabulary Speech Corpus for Pashto language. The vocabulary encompasses 161 isolated words of Pashto language, consisting of most frequently used words of Pashto language, names of the days of the week and digits from 0 to 25. The words were uttered by 30 speakers of different ages and genders, including both native and non-native speakers of Pashto language. Recording of the corpus was performed in a noise free office environment. The Corpus developed is then used for the development of an automatic speech recognition system for Pashto language.
引用
收藏
页码:139 / 143
页数:5
相关论文
共 50 条
  • [21] Efficient feature extraction and classification for the development of Pashto speech recognition system
    Ahmed, Irfan
    Irfan, Muhammad Abeer
    Iqbal, Abid
    Khalil, Amaad
    Siddiqui, Salman Ilahi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54081 - 54096
  • [22] Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Ali, Muhammad
    Manjotho, Ali Asghar
    COMMUNICATION TECHNOLOGIES, INFORMATION SECURITY AND SUSTAINABLE DEVELOPMENT, 2014, 414 : 24 - 34
  • [23] Efficient feature extraction and classification for the development of Pashto speech recognition system
    Irfan Ahmed
    Muhammad Abeer Irfan
    Abid Iqbal
    Amaad Khalil
    Salman Ilahi Siddiqui
    Multimedia Tools and Applications, 2024, 83 : 54081 - 54096
  • [24] INFLUENCE OF THE SURROUNDINGS ON THE AUTOMATIC RECOGNITION OF ISOLATED WORDS
    HIRSCH, HG
    ACUSTICA, 1988, 66 (04): : 197 - 202
  • [25] Using Automatic Speech Recognition in Spoken Corpus Curation
    Gorisch, Jan
    Gref, Michael
    Schmidt, Thomas
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6423 - 6428
  • [26] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Adiga, Devaraja
    Kumar, Rishabh
    Krishna, Amrith
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    Goyal, Pawan
    Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021, : 5039 - 5050
  • [27] Towards a Continuous Speech Corpus for Banking Domain Automatic Speech Recognition
    Suciu, George
    Toma, Stefan-Adrian
    Cheyeresan, Romulus
    2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,
  • [28] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Adiga, Devaraja
    Kumar, Rishabh
    Krishna, Amrith
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    Goyal, Pawan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 5039 - 5050
  • [29] PASHTO SPEECH RECOGNITION WITH LIMITED PRONUNCIATION LEXICON
    Prasad, Rohit
    Tsakalidis, Stavros
    Bulyko, Ivan
    Kao, Chia-lin
    Natarajan, Prem
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5086 - 5089
  • [30] Realization of Isolated-words Speech Recognition System
    Ren Wenxia
    Zhang Huili
    Lv Wenzhe
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 353 - 355