Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition

被引：0

作者：

Kobayashi, Akio ^{[1
]}

Yasu, Keiichi ^{[2
]}

机构：

[1] Yamato Univ, Suita, Osaka, Japan

[2] Tsukuba Univ Technol, Tsukuba, Ibaraki, Japan

来源：

2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC | 2023年

关键词：

D O I：

10.1109/APSIPAASC58517.2023.10317192

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study explores automatic speech recognition (ASR) for the deaf and hard-of-hearing. Despite the recent progress in ASR for dysarthric speakers, existing research primarily focuses on people with motor speech disorders. Thus, the effect of speech diversity on the performance of ASR is not considered for ambiguous deaf speech owing to a lack of auditory feedback. Therefore, we compiled a corpus of speech of many profoundly deaf speakers to compare the ASR performance with that of normal-hearing speakers. The performance analysis is reported through a set of phoneme recognition experiments. Furthermore, we show that additional phonological features that reflect deaf speakers' articulation can improve performance in phoneme recognition for deaf speech.

引用

页码：2294 / 2298

页数：5

共 50 条

[41] Speech corpus recycling for acoustic cross-domain environments for automatic speech recognition
Ichikawa, Osamu
Rennie, Steven J.
Fukuda, Takashi
Willett, Daniel
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2016, 37 (02) : 55 - 65
[42] RODIGITS - A ROMANIAN CONNECTED-DIGITS SPEECH CORPUS FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
Georgescu, Alexandru Lucian
Caranica, Alexandru
Cucu, Horia
Burileanu, Corneliu
UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2018, 80 (03): : 45 - 62
[43] ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION
Droua-Hamdani, Ghania
Selouani, Sid Ahmed
Boudraa, Malika
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C): : 157 - 166
[44] Automatic speech activity detection, source localization, and speech recognition on the CHIL seminar corpus
Macho, D
Padrell, J
Abad, A
Nadeu, C
Hernando, J
McDonough, J
Wölfel, M
Klee, W
Omologo, M
Brutti, A
Svaizer, P
Potamianos, G
Chu, SM
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 877 - 880
[45] AUTOMATIC LEARNING - AN APPROACH TO THE ADAPTATION OF A SPEECH RECOGNITION SYSTEM TO ONE OR SEVERAL SPEAKERS
PISTERBOURJOT, C
HATON, JP
SPEECH COMMUNICATION, 1987, 6 (01) : 43 - 54
[46] Study of the performance of automatic speech recognition systems in speakers with Parkinson's Disease
Moro-Velazquez, Laureano
Cho, JaeJin
Watanabe, Shinji
Hasegawa-Johnson, Mark A.
Scharenborg, Odette
Kim, Heejin
Dehak, Najim
INTERSPEECH 2019, 2019, : 3875 - 3879
[47] Acoustic Analysis for Automatic Speech Recognition
O'Shaughnessy, Douglas
PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1038 - 1053
[48] CLAC: A Speech Corpus Of Healthy English Speakers
Haulcy, R'mani
Glass, James
INTERSPEECH 2021, 2021, : 2966 - 2970
[49] Automatic Speech Recognition of Vietnamese for a New Large-Scale Corpus
Tran, Linh Thi Thuc
Kim, Han-Gyu
La, Hoang Minh
Pham, Su Van
ELECTRONICS, 2024, 13 (05)
[50] The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research
Ahmed, Irfan
Ahmad, Nasir
Ali, Hazrat
Ahmad, Gulzar
2012 INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE (ICRAI), 2012, : 139 - 143

← 1 2 3 4 5 →