Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition

被引:0
|
作者
Kobayashi, Akio [1 ]
Yasu, Keiichi [2 ]
机构
[1] Yamato Univ, Suita, Osaka, Japan
[2] Tsukuba Univ Technol, Tsukuba, Ibaraki, Japan
关键词
D O I
10.1109/APSIPAASC58517.2023.10317192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores automatic speech recognition (ASR) for the deaf and hard-of-hearing. Despite the recent progress in ASR for dysarthric speakers, existing research primarily focuses on people with motor speech disorders. Thus, the effect of speech diversity on the performance of ASR is not considered for ambiguous deaf speech owing to a lack of auditory feedback. Therefore, we compiled a corpus of speech of many profoundly deaf speakers to compare the ASR performance with that of normal-hearing speakers. The performance analysis is reported through a set of phoneme recognition experiments. Furthermore, we show that additional phonological features that reflect deaf speakers' articulation can improve performance in phoneme recognition for deaf speech.
引用
收藏
页码:2294 / 2298
页数:5
相关论文
共 50 条
  • [1] AUTOMATIC RECOGNITION OF DEAF SPEECH
    ABDELHAMIED, K
    WALDRON, M
    FOX, RA
    VOLTA REVIEW, 1990, 92 (03) : 121 - 130
  • [2] Corpus for automatic speech recognition
    Adda-Decker, Martine
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2007, 12 (01): : 71 - 84
  • [3] Corpus Construction for Aviation Speech Recognition
    Cui, Yiyi
    Wang, Zhen
    Lu, Yanyu
    Fu, Shan
    HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 238 - 250
  • [4] Creation of Marathi Speech Corpus for Automatic Speech Recognition
    Gaikwad, Santosh
    Gawali, Bharti
    Mehrotra, Suresh
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [5] The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
    Mukiibi, Jonathan
    Katumba, Andrew
    Nakatumba-Nabende, Joyce
    Hussein, Ali
    Meyer, Josh
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1945 - 1954
  • [6] Automatic Construction of the Finnish Parliament Speech Corpus
    Mansikkaniemi, Andre
    Smit, Peter
    Kurimo, Mikko
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3762 - 3766
  • [7] Multimodal English corpus for automatic speech recognition
    Kunka, Bartosz
    Kupryjanow, Adam
    Dalka, Piotr
    Bratoszewski, Piotr
    Szczodrak, Maciej
    Spaleniak, Pawel
    Szykulski, Marcin
    Czyzewski, Andrzej
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
  • [8] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
    Santiago Omar Caballero Morales
    Stephen J. Cox
    EURASIP Journal on Advances in Signal Processing, 2009
  • [9] CEASR: A Corpus for Evaluating Automatic Speech Recognition
    Ulasik, Malgorzata Anna
    Huerlimann, Manuela
    Germann, Fabian
    Gedik, Esin
    Benites, Fernando
    Cieliebak, Mark
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6477 - 6485
  • [10] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
    Morales, Santiago Omar Caballero
    Cox, Stephen J.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,