The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research

被引:0
|
作者
Ahmed, Irfan [1 ]
Ahmad, Nasir [2 ]
Ali, Hazrat [1 ]
Ahmad, Gulzar [1 ]
机构
[1] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
Automatic Speech Recognition; Pashto Speech Corpus; Human Computer Interaction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The availability of standard speech database is of paramount importance in the automatic speech recognition (ASR) research in the context of providing a baseline for comparing the performance of automatic speech recognition approaches. This paper presents the development of a Medium-Vocabulary Speech Corpus for Pashto language. The vocabulary encompasses 161 isolated words of Pashto language, consisting of most frequently used words of Pashto language, names of the days of the week and digits from 0 to 25. The words were uttered by 30 speakers of different ages and genders, including both native and non-native speakers of Pashto language. Recording of the corpus was performed in a noise free office environment. The Corpus developed is then used for the development of an automatic speech recognition system for Pashto language.
引用
收藏
页码:139 / 143
页数:5
相关论文
共 50 条
  • [1] The Development of Isolated Words Pashto Automatic Speech Recognition System
    Ahmed, Irfan
    Ahmad, Nasir
    Ali, Hazrat
    Ahmad, Gulzar
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 333 - 336
  • [2] Chhattisgarhi speech corpus for research and development in automatic speech recognition
    Londhe, Narendra D.
    Kshirsagar, Ghanahshyam B.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 193 - 210
  • [3] Pashto Spoken Digits Database for the Automatic Speech Recognition Research
    Abbas, Arbab Waseem
    Ahmad, Nasir
    Ali, Hazrat
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 348 - 351
  • [4] Automatic Speech Recognition of Isolated Words in Hindi Language
    Wani, Priyanka
    Bormane, D. S.
    Patil, U. G.
    Shirbahadurkar, S. D.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [5] Bangladeshi Bangla speech corpus for automatic speech recognition research
    Kibria, Shafkat
    Samin, Ahnaf Mozib
    Kobir, M. Humayon
    Rahman, M. Shahidur
    Selim, M. Reza
    Iqbal, M. Zafar
    SPEECH COMMUNICATION, 2022, 136 : 84 - 97
  • [6] Bangladeshi Bangla speech corpus for automatic speech recognition research
    Kibria, Shafkat
    Samin, Ahnaf Mozib
    Kobir, M. Humayon
    Rahman, M. Shahidur
    Selim, M. Reza
    Iqbal, M. Zafar
    Speech Communication, 2022, 136 : 84 - 97
  • [7] Database development and automatic speech recognition of isolated Pashto spoken digits using MFCC and K-NN
    Ali, Zakir
    Abbas, Arbab
    Thasleema, T.
    Uddin, Burhan
    Raaz, Tanzeela
    Abid, Sahibzada
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (02) : 271 - 275
  • [8] Corpus for automatic speech recognition
    Adda-Decker, Martine
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2007, 12 (01): : 71 - 84
  • [9] On speech recognition of isolated words
    Teh, CC
    Jong, CC
    Siek, L
    ISIC-99: 8TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS, DEVICES & SYSTEMS, PROCEEDINGS, 1999, : 431 - 434
  • [10] Automatic Speech Recognition of Isolated Words in Hindi Language using MFCC
    Patil, U. G.
    Shirbahadurkar, S. D.
    Paithane, A. N.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 433 - 438