The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research

被引:0
|
作者
Ahmed, Irfan [1 ]
Ahmad, Nasir [2 ]
Ali, Hazrat [1 ]
Ahmad, Gulzar [1 ]
机构
[1] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
Automatic Speech Recognition; Pashto Speech Corpus; Human Computer Interaction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The availability of standard speech database is of paramount importance in the automatic speech recognition (ASR) research in the context of providing a baseline for comparing the performance of automatic speech recognition approaches. This paper presents the development of a Medium-Vocabulary Speech Corpus for Pashto language. The vocabulary encompasses 161 isolated words of Pashto language, consisting of most frequently used words of Pashto language, names of the days of the week and digits from 0 to 25. The words were uttered by 30 speakers of different ages and genders, including both native and non-native speakers of Pashto language. Recording of the corpus was performed in a noise free office environment. The Corpus developed is then used for the development of an automatic speech recognition system for Pashto language.
引用
收藏
页码:139 / 143
页数:5
相关论文
共 50 条
  • [31] DEVELOPMENT OF A MULTILINGUAL ISOLATED DIGITS SPEECH CORPUS
    Malaay, Emmanuel
    Simora, Michael
    Cabatic, Ronald John
    Oco, Nathaniel
    Roxas, Rachel Edita
    2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 81 - 85
  • [32] An audio-visual corpus for speech perception and automatic speech recognition (L)
    Cooke, Martin
    Barker, Jon
    Cunningham, Stuart
    Shao, Xu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424
  • [33] DEVELOPMENT OF NEW SPEECH CORPUS FOR ELDERLY JAPANESE SPEECH RECOGNITION
    Iribe, Yurie
    Kitaoka, Norihide
    Segawa, Shuhei
    2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 27 - 31
  • [34] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition
    Maulana, Muhammad Rizki Aulia Rahman
    Fanany, Mohamad Ivan
    2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385
  • [35] Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition
    Kobayashi, Akio
    Yasu, Keiichi
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2294 - 2298
  • [36] MinSpeech: A Corpus of Southern Min Dialect for Automatic Speech Recognition
    Lin, Jiayan
    Lu, Shenghui
    Huang, Hukai
    Guan, Wenhao
    Xu, Binbin
    Bu, Hui
    Hong, Qingyang
    Li, Lin
    INTERSPEECH 2024, 2024, : 2330 - 2334
  • [37] TED-LIUM: an Automatic Speech Recognition dedicated corpus
    Rousseau, Anthony
    Deleglise, Paul
    Esteve, Yannick
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 125 - 129
  • [38] An audio-visual corpus for multimodal automatic speech recognition
    Czyzewski, Andrzej
    Kostek, Bozena
    Bratoszewski, Piotr
    Kotus, Jozef
    Szykulski, Marcin
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (02) : 167 - 192
  • [39] An audio-visual corpus for multimodal automatic speech recognition
    Andrzej Czyzewski
    Bozena Kostek
    Piotr Bratoszewski
    Jozef Kotus
    Marcin Szykulski
    Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
  • [40] Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus
    Hasegawa-Johnson, M
    Chen, K
    Cole, J
    Borys, S
    Kim, SS
    Cohen, A
    Zhang, T
    Choi, JY
    Kim, H
    Yoon, T
    Chavarria, S
    SPEECH COMMUNICATION, 2005, 46 (3-4) : 418 - 439