The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research

被引:0
|
作者
Ahmed, Irfan [1 ]
Ahmad, Nasir [2 ]
Ali, Hazrat [1 ]
Ahmad, Gulzar [1 ]
机构
[1] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
Automatic Speech Recognition; Pashto Speech Corpus; Human Computer Interaction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The availability of standard speech database is of paramount importance in the automatic speech recognition (ASR) research in the context of providing a baseline for comparing the performance of automatic speech recognition approaches. This paper presents the development of a Medium-Vocabulary Speech Corpus for Pashto language. The vocabulary encompasses 161 isolated words of Pashto language, consisting of most frequently used words of Pashto language, names of the days of the week and digits from 0 to 25. The words were uttered by 30 speakers of different ages and genders, including both native and non-native speakers of Pashto language. Recording of the corpus was performed in a noise free office environment. The Corpus developed is then used for the development of an automatic speech recognition system for Pashto language.
引用
收藏
页码:139 / 143
页数:5
相关论文
共 50 条
  • [41] A speech corpus of Quechua Collao for automatic dimensional emotion recognition
    Paccotacya-Yanque, Rosa Y. G.
    Huanca-Anquise, Candy A.
    Escalante-Calcina, Judith
    Ramos-Lovon, Wilber R.
    Cuno-Parari, Alvaro E.
    SCIENTIFIC DATA, 2022, 9 (01)
  • [42] A speech corpus of Quechua Collao for automatic dimensional emotion recognition
    Rosa Y. G. Paccotacya-Yanque
    Candy A. Huanca-Anquise
    Judith Escalante-Calcina
    Wilber R. Ramos-Lovón
    Álvaro E. Cuno-Parari
    Scientific Data, 9
  • [43] The Hardware Accelerator of The Automatic Speech Recognition for The Continuous Korean Words
    Kim, Juyeob
    Kim, Yunjoo
    Kim, Wonjong
    Lee, Joohyun
    2015 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2015, : 213 - 214
  • [44] Automatic detection of accent nuclei at the head of words for speech recognition
    Minematsu, N
    Nakagawa, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1620 - 1623
  • [45] Automatic Recognition of Target Words in Infant-Directed Speech
    van der Klis, Anika
    Adriaans, Frans
    Han, Mengru
    Kager, Rene
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 522 - 522
  • [46] The BioVisualSpeech Corpus of Words with Sibilants for Speech Therapy Games Development
    Cavaco, Sofia
    Guimaraes, Isabel
    Ascensao, Mariana
    Abad, Alberto
    Anjos, Ivo
    Oliveira, Francisco
    Martins, Sofia
    Marques, Nuno
    Eskenazi, Maxine
    Magalhaes, Joao
    Grilo, Margarida
    INFORMATION, 2020, 11 (10) : 1 - 18
  • [47] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
    Taerungruang, Supawat
    Taninpong, Phimphaka
    Chunwijitra, Vataya
    Thatphithakkul, Sumonmas
    Kasuriya, Sawit
    Inthanon, Viroj
    Paksaranuwat, Pawat
    Thumronglaohapun, Salinee
    Nakharutai, Nawapon
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [48] Speech corpus recycling for acoustic cross-domain environments for automatic speech recognition
    Ichikawa, Osamu
    Rennie, Steven J.
    Fukuda, Takashi
    Willett, Daniel
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2016, 37 (02) : 55 - 65
  • [49] RODIGITS - A ROMANIAN CONNECTED-DIGITS SPEECH CORPUS FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
    Georgescu, Alexandru Lucian
    Caranica, Alexandru
    Cucu, Horia
    Burileanu, Corneliu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2018, 80 (03): : 45 - 62
  • [50] ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION
    Droua-Hamdani, Ghania
    Selouani, Sid Ahmed
    Boudraa, Malika
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C): : 157 - 166