Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition

被引：0

作者：

Kobayashi, Akio ^{[1
]}

Yasu, Keiichi ^{[2
]}

机构：

[1] Yamato Univ, Suita, Osaka, Japan

[2] Tsukuba Univ Technol, Tsukuba, Ibaraki, Japan

来源：

2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC | 2023年

关键词：

D O I：

10.1109/APSIPAASC58517.2023.10317192

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study explores automatic speech recognition (ASR) for the deaf and hard-of-hearing. Despite the recent progress in ASR for dysarthric speakers, existing research primarily focuses on people with motor speech disorders. Thus, the effect of speech diversity on the performance of ASR is not considered for ambiguous deaf speech owing to a lack of auditory feedback. Therefore, we compiled a corpus of speech of many profoundly deaf speakers to compare the ASR performance with that of normal-hearing speakers. The performance analysis is reported through a set of phoneme recognition experiments. Furthermore, we show that additional phonological features that reflect deaf speakers' articulation can improve performance in phoneme recognition for deaf speech.

引用

页码：2294 / 2298

页数：5

共 50 条

[31] An audio-visual corpus for multimodal automatic speech recognition
Andrzej Czyzewski
Bozena Kostek
Piotr Bratoszewski
Jozef Kotus
Marcin Szykulski
Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
[32] A speech corpus of Quechua Collao for automatic dimensional emotion recognition
Paccotacya-Yanque, Rosa Y. G.
Huanca-Anquise, Candy A.
Escalante-Calcina, Judith
Ramos-Lovon, Wilber R.
Cuno-Parari, Alvaro E.
SCIENTIFIC DATA, 2022, 9 (01)
[33] A speech corpus of Quechua Collao for automatic dimensional emotion recognition
Rosa Y. G. Paccotacya-Yanque
Candy A. Huanca-Anquise
Judith Escalante-Calcina
Wilber R. Ramos-Lovón
Álvaro E. Cuno-Parari
Scientific Data, 9
[34] Quantification of Automatic Speech Recognition System Performance on d/Deaf and Hard of Hearing Speech
Zhao, Robin
Choi, Anna S. G.
Koenecke, Allison
Rameau, Anais
LARYNGOSCOPE, 2025, 135 (01): : 191 - 197
[35] Cued Speech automatic recognition in normal-hearing and deaf subjects
Heracleous, Panikos
Beautemps, Denis
Aboutabit, Noureddine
SPEECH COMMUNICATION, 2010, 52 (06) : 504 - 512
[36] Automatic Speech Recognition Services: Deaf and Hard-of-Hearing Usability
Glasser, Abraham
CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[37] Corpus of deaf speech for acoustic and speech production research
Mendel, Lisa Lucks (lmendel@memphis.edu), 2017, Acoustical Society of America (142):
[38] Corpus of deaf speech for acoustic and speech production research
Mendel, Lisa Lucks
Lee, Sungmin
Pousson, Monique
Patro, Chhayakanta
McSorley, Skylar
Banerjee, Bonny
Najnin, Shamima
Kapourchali, Masoumeh Heidari
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (01): : EL102 - EL107
[39] Construction and Analysis of Indonesian Emotional Speech Corpus
Lubis, Nurul
Lestari, Dessi
Purwarianti, Ayu
Sakti, Sakriani
Nakamura, Satoshi
2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 2014,
[40] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
Taerungruang, Supawat
Taninpong, Phimphaka
Chunwijitra, Vataya
Thatphithakkul, Sumonmas
Kasuriya, Sawit
Inthanon, Viroj
Paksaranuwat, Pawat
Thumronglaohapun, Salinee
Nakharutai, Nawapon
Inkeaw, Papangkorn
Bootkrajang, Jakramate
COMPUTER SPEECH AND LANGUAGE, 2025, 89

← 1 2 3 4 5 →