Intelligibility is more than a single word:: Quantification of speech intelligibility by ASR and prosody

被引:0
|
作者
Maier, Andreas [1 ,3 ]
Haderlein, Tino [1 ]
Schuster, Maria [1 ]
Nkenke, Emeka [2 ]
Noeth, Elmar [3 ]
机构
[1] Univ Erlangen Nurnberg, Abt Phoiatrie & Padaudiol, Bohlenpl 21, D-91054 Erlangen, Germany
[2] Univ Erlangen Nurnberg, Mund Kiefer & Gesichtschirurgische Klin, D-91054 Erlangen, Germany
[3] Univ Erlangen Nurnberg, Lehrstuhl Mustererkennung, D-91058 Erlangen, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we examine the quality of the prediction of intelligibility scores of human experts. Furthermore, we investigate the differences between subjective expert raters who evaluated speech disorders of laryngectomees and children with cleft lip and palate. We use the recognition rate of a word recognizer and prosodic features to predict the intelligibility score of each individual expert. For each expert and the mean opinion of all experts we present the best features to model their scoring behavior according to the mean rank obtained during a 10-fold cross-validation. In this manner all individual speech experts were modeled with a correlation coefficient of at least r >.75. The mean opinion of all raters is predicted with a correlation of r =.90 for the laryngectomees and r =.86 for the children.
引用
收藏
页码:278 / +
页数:3
相关论文
共 50 条
  • [41] The Evaluation Process Automation of Phrase and Word Intelligibility Using Speech Recognition Systems
    Kostuchenko, Evgeny
    Novokhrestova, Dariya
    Tirskaya, Marina
    Shelupanov, Alexander
    Nemirovich-Danchenko, Mikhail
    Choynzonov, Evgeny
    Balatskaya, Lidiya
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 237 - 246
  • [42] Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility
    Falk, Tiago H.
    Chan, Wai-Yip
    Shein, Fraser
    SPEECH COMMUNICATION, 2012, 54 (05) : 622 - 631
  • [43] Using envelope modulation to explain speech intelligibility in the presence of a single reflection
    Muralimanohar, Ramesh Kumar
    Kates, James M.
    Arehart, Kathryn H.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (05): : EL482 - EL487
  • [44] SUBJECTIVE SPEECH QUALITY AND SPEECH INTELLIGIBILITY EVALUATION OF SINGLE-CHANNEL DEREVERBERATION ALGORITHMS
    Warzybok, Anna
    Kodrasi, Ina
    Jungmann, Jan Ole
    Habets, Emanuel
    Gerkmann, Timo
    Mertins, Alfred
    Doclo, Simon
    Kollmeier, Birger
    Goetze, Stefan
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 332 - 336
  • [45] Assessing speech intelligibility of pathological speech in sentences and word lists: The contribution of phoneme-level measures
    Xue, Wei
    van Hout, Roeland
    Cucchiarini, Catia
    Strik, Helmer
    JOURNAL OF COMMUNICATION DISORDERS, 2023, 102
  • [46] On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Verdu, Elena
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (02): : 78 - 89
  • [47] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [48] THE USE OF SINGLE-CHANNEL COMPRESSION FOR THE IMPROVEMENT OF SPEECH-INTELLIGIBILITY
    DRESCHLER, WA
    EBERHARDT, D
    MELK, PW
    SCANDINAVIAN AUDIOLOGY, 1984, 13 (04): : 231 - 236
  • [49] Effects of spatial and temporal integration of a single early reflection on speech intelligibility
    Warzybok, Anna
    Rennies, Jan
    Brand, Thomas
    Doclo, Simon
    Kollmeier, Birger
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 269 - 282
  • [50] Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-based ASR System
    Arai, Kenichi
    Araki, Shoko
    Ogawa, Atsunori
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    Irino, Toshio
    INTERSPEECH 2020, 2020, : 1156 - 1160