Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers

被引:0
|
作者
Santiago Omar Caballero Morales
Stephen J. Cox
机构
[1] University of East Anglia,Speech, Language, and Music Group, School of Computing Sciences
关键词
Recognition Accuracy; Confusion Matrix; Automatic Speech Recognition; Acoustic Model; Speech Disorder;
D O I
暂无
中图分类号
学科分类号
摘要
Dysarthria is a motor speech disorder characterized by weakness, paralysis, or poor coordination of the muscles responsible for speech. Although automatic speech recognition (ASR) systems have been developed for disordered speech, factors such as low intelligibility and limited phonemic repertoire decrease speech recognition accuracy, making conventional speaker adaptation algorithms perform poorly on dysarthric speakers. In this work, rather than adapting the acoustic models, we model the errors made by the speaker and attempt to correct them. For this task, two techniques have been developed: (1) a set of "metamodels" that incorporate a model of the speaker's phonetic confusion matrix into the ASR process; (2) a cascade of weighted finite-state transducers at the confusion matrix, word, and language levels. Both techniques attempt to correct the errors made at the phonetic level and make use of a language model to find the best estimate of the correct word sequence. Our experiments show that both techniques outperform standard adaptation techniques.
引用
收藏
相关论文
共 50 条
  • [21] A personalised speech communication application for dysarthric speakers
    Gibson, Matthew
    Karaulov, Ievgen
    Zhelo, Oleksii
    Jurcicek, Filip
    INTERSPEECH 2023, 2023, : 666 - 667
  • [22] Intelligibility of dysarthric speech: perceptions of speakers and listeners
    Walshe, Margaret
    Miller, Nick
    Leahy, Margaret
    Murray, Aisling
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2008, 43 (06) : 633 - 648
  • [23] Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech
    Yilmaz, Emre
    Ganzeboom, Mario
    Cucchiarini, Catia
    Strik, Helmer
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2685 - 2689
  • [24] Exploring Alternative Data Augmentation Methods in Dysarthric Automatic Speech Recognition
    Gracelli, Ricardo
    Almeida, Jurandy
    2024 IEEE 37TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS 2024, 2024, : 243 - 248
  • [25] Analysis of Articulation Errors in Dysarthric Speech
    Goswami, Upashana
    Nirmala, S. R.
    Vikram, C. M.
    Kalita, Sishir
    Prasanna, S. R. M.
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2020, 49 (01) : 163 - 174
  • [26] DEVELOPING SUCCESSFUL SPEAKERS FOR AN AUTOMATIC SPEECH RECOGNITION SYSTEM
    DANIS, CM
    PROCEEDINGS OF THE HUMAN FACTORS SOCIETY 33RD ANNUAL MEETING, VOL 1: PERSPECTIVES, 1989, : 301 - 304
  • [27] Analysis of Articulation Errors in Dysarthric Speech
    Upashana Goswami
    S. R. Nirmala
    C. M. Vikram
    Sishir Kalita
    S. R. M. Prasanna
    Journal of Psycholinguistic Research, 2020, 49 : 163 - 174
  • [28] Optimization of dysarthric speech recognition
    Chen, FX
    Kostov, A
    PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 1436 - 1439
  • [29] Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test
    Kim, Eungbeom
    Chae, Yunkee
    Sim, Jaeheon
    Lee, Kyogu
    INTERSPEECH 2023, 2023, : 1508 - 1512
  • [30] Dysarthric Speech Transformer: A Sequence-to-Sequence Dysarthric Speech Recognition System
    Shahamiri, Seyed Reza
    Lal, Vanshika
    Shah, Dhvani
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 3407 - 3416