Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition

被引:0
|
作者
Kobayashi, Akio [1 ]
Yasu, Keiichi [2 ]
机构
[1] Yamato Univ, Suita, Osaka, Japan
[2] Tsukuba Univ Technol, Tsukuba, Ibaraki, Japan
关键词
D O I
10.1109/APSIPAASC58517.2023.10317192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores automatic speech recognition (ASR) for the deaf and hard-of-hearing. Despite the recent progress in ASR for dysarthric speakers, existing research primarily focuses on people with motor speech disorders. Thus, the effect of speech diversity on the performance of ASR is not considered for ambiguous deaf speech owing to a lack of auditory feedback. Therefore, we compiled a corpus of speech of many profoundly deaf speakers to compare the ASR performance with that of normal-hearing speakers. The performance analysis is reported through a set of phoneme recognition experiments. Furthermore, we show that additional phonological features that reflect deaf speakers' articulation can improve performance in phoneme recognition for deaf speech.
引用
收藏
页码:2294 / 2298
页数:5
相关论文
共 50 条
  • [31] An audio-visual corpus for multimodal automatic speech recognition
    Andrzej Czyzewski
    Bozena Kostek
    Piotr Bratoszewski
    Jozef Kotus
    Marcin Szykulski
    Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
  • [32] A speech corpus of Quechua Collao for automatic dimensional emotion recognition
    Paccotacya-Yanque, Rosa Y. G.
    Huanca-Anquise, Candy A.
    Escalante-Calcina, Judith
    Ramos-Lovon, Wilber R.
    Cuno-Parari, Alvaro E.
    SCIENTIFIC DATA, 2022, 9 (01)
  • [33] A speech corpus of Quechua Collao for automatic dimensional emotion recognition
    Rosa Y. G. Paccotacya-Yanque
    Candy A. Huanca-Anquise
    Judith Escalante-Calcina
    Wilber R. Ramos-Lovón
    Álvaro E. Cuno-Parari
    Scientific Data, 9
  • [34] Quantification of Automatic Speech Recognition System Performance on d/Deaf and Hard of Hearing Speech
    Zhao, Robin
    Choi, Anna S. G.
    Koenecke, Allison
    Rameau, Anais
    LARYNGOSCOPE, 2025, 135 (01): : 191 - 197
  • [35] Cued Speech automatic recognition in normal-hearing and deaf subjects
    Heracleous, Panikos
    Beautemps, Denis
    Aboutabit, Noureddine
    SPEECH COMMUNICATION, 2010, 52 (06) : 504 - 512
  • [36] Automatic Speech Recognition Services: Deaf and Hard-of-Hearing Usability
    Glasser, Abraham
    CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [37] Corpus of deaf speech for acoustic and speech production research
    Mendel, Lisa Lucks (lmendel@memphis.edu), 2017, Acoustical Society of America (142):
  • [38] Corpus of deaf speech for acoustic and speech production research
    Mendel, Lisa Lucks
    Lee, Sungmin
    Pousson, Monique
    Patro, Chhayakanta
    McSorley, Skylar
    Banerjee, Bonny
    Najnin, Shamima
    Kapourchali, Masoumeh Heidari
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (01): : EL102 - EL107
  • [39] Construction and Analysis of Indonesian Emotional Speech Corpus
    Lubis, Nurul
    Lestari, Dessi
    Purwarianti, Ayu
    Sakti, Sakriani
    Nakamura, Satoshi
    2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 2014,
  • [40] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
    Taerungruang, Supawat
    Taninpong, Phimphaka
    Chunwijitra, Vataya
    Thatphithakkul, Sumonmas
    Kasuriya, Sawit
    Inthanon, Viroj
    Paksaranuwat, Pawat
    Thumronglaohapun, Salinee
    Nakharutai, Nawapon
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    COMPUTER SPEECH AND LANGUAGE, 2025, 89