The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research

被引：0

作者：

Ahmed, Irfan ^{[1
]}

Ahmad, Nasir ^{[2
]}

Ali, Hazrat ^{[1
]}

Ahmad, Gulzar ^{[1
]}

机构：

[1] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan

[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan

来源：

2012 INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE (ICRAI) | 2012年

关键词：

Automatic Speech Recognition; Pashto Speech Corpus; Human Computer Interaction;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The availability of standard speech database is of paramount importance in the automatic speech recognition (ASR) research in the context of providing a baseline for comparing the performance of automatic speech recognition approaches. This paper presents the development of a Medium-Vocabulary Speech Corpus for Pashto language. The vocabulary encompasses 161 isolated words of Pashto language, consisting of most frequently used words of Pashto language, names of the days of the week and digits from 0 to 25. The words were uttered by 30 speakers of different ages and genders, including both native and non-native speakers of Pashto language. Recording of the corpus was performed in a noise free office environment. The Corpus developed is then used for the development of an automatic speech recognition system for Pashto language.

引用

页码：139 / 143

页数：5

共 50 条

[31] DEVELOPMENT OF A MULTILINGUAL ISOLATED DIGITS SPEECH CORPUS
Malaay, Emmanuel
Simora, Michael
Cabatic, Ronald John
Oco, Nathaniel
Roxas, Rachel Edita
2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 81 - 85
[32] An audio-visual corpus for speech perception and automatic speech recognition (L)
Cooke, Martin
Barker, Jon
Cunningham, Stuart
Shao, Xu
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424
[33] DEVELOPMENT OF NEW SPEECH CORPUS FOR ELDERLY JAPANESE SPEECH RECOGNITION
Iribe, Yurie
Kitaoka, Norihide
Segawa, Shuhei
2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 27 - 31
[34] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition
Maulana, Muhammad Rizki Aulia Rahman
Fanany, Mohamad Ivan
2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385
[35] Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition
Kobayashi, Akio
Yasu, Keiichi
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2294 - 2298
[36] MinSpeech: A Corpus of Southern Min Dialect for Automatic Speech Recognition
Lin, Jiayan
Lu, Shenghui
Huang, Hukai
Guan, Wenhao
Xu, Binbin
Bu, Hui
Hong, Qingyang
Li, Lin
INTERSPEECH 2024, 2024, : 2330 - 2334
[37] TED-LIUM: an Automatic Speech Recognition dedicated corpus
Rousseau, Anthony
Deleglise, Paul
Esteve, Yannick
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 125 - 129
[38] An audio-visual corpus for multimodal automatic speech recognition
Czyzewski, Andrzej
Kostek, Bozena
Bratoszewski, Piotr
Kotus, Jozef
Szykulski, Marcin
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (02) : 167 - 192
[39] An audio-visual corpus for multimodal automatic speech recognition
Andrzej Czyzewski
Bozena Kostek
Piotr Bratoszewski
Jozef Kotus
Marcin Szykulski
Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
[40] Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus
Hasegawa-Johnson, M
Chen, K
Cole, J
Borys, S
Kim, SS
Cohen, A
Zhang, T
Choi, JY
Kim, H
Yoon, T
Chavarria, S
SPEECH COMMUNICATION, 2005, 46 (3-4) : 418 - 439

← 1 2 3 4 5 →