Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data

被引:0
|
作者
机构
[1] Payton, Karen L.
[2] Shrestha, Mona
来源
Payton, K.L. (kpayton@umassd.edu) | 1600年 / Acoustical Society of America卷 / 134期
关键词
Several algorithms have been shown to generate a metric corresponding to the Speech Transmission Index (STI) using speech as a probe stimulus [e.g; Goldsworthy and Greenberg; J; Acoust; Soc; Am; 116; 3679-3689 (2004)]. The time-domain approaches work well on long speech segments and have the added potential to be used for short-time analysis. This study investigates the performance of the Envelope Regression (ER) time-domain STI method as a function of window length; in acoustically degraded environments with multiple talkers and speaking styles. The ER method is compared with a short-time Theoretical STI; derived from octave-band signal-to-noise ratios and reverberation times. For windows as short as 0.3 s; the ER method tracks short-time Theoretical STI changes in stationary speech-shaped noise; fluctuating restaurant babble and stationary noise plus reverberation. The metric is also compared to intelligibility scores on conversational speech and speech articulated clearly but at normal speaking rates (Clear/Norm) in stationary noise. Correlation between the metric and intelligibility scores is high and; consistent with the subject scores; the metrics are higher for Clear/Norm speech than for conversational speech and higher for the first word in a sentence than for the last word. © 2013 Acoustical Society of America;
D O I
暂无
中图分类号
学科分类号
摘要
Conference article (CA)
引用
收藏
相关论文
共 50 条
  • [21] Application of a short-time version of the Equalization-Cancellation model to speech intelligibility experiments with speech maskers
    Wan, Rui
    Durlach, Nathaniel I.
    Colburn, H. Steven
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (02): : 768 - 776
  • [22] Coherence and the speech intelligibility index
    Kates, JM
    Arehart, KH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (04): : 2224 - 2237
  • [23] AN INTELLIGIBILITY METRIC BASED ON A SIMPLE MODEL OF SPEECH COMMUNICATION
    Van Kuyk, Steven
    Kleijn, W. Bastiaan
    Hendriks, Richard C.
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [24] TRANSIENT-BASED SPEECH TRANSMISSION INDEX FOR PREDICTING INTELLIGIBILITY IN NONLINEAR SPEECH ENHANCEMENT PROCESSORS
    Schlesinger, Anton
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3993 - 3996
  • [26] Using the Speech Transmission Index for predicting non-native speech intelligibility
    van Wijngaarden, SJ
    Bronkhorst, AW
    Houtgast, T
    Steeneken, HJM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 115 (03): : 1281 - 1291
  • [27] Speech intelligibility assessment in a helium environment. II. The speech intelligibility index
    Mendel, LL
    Hamill, BW
    Hendrix, JE
    Crepeau, LJ
    Pelton, JD
    Miley, MD
    Kadlec, EE
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 104 (03): : 1609 - 1615
  • [28] JOINT FAR- AND NEAR-END SPEECH INTELLIGIBILITY ENHANCEMENT BASED ON THE APPROXIMATED SPEECH INTELLIGIBILITY INDEX
    Fuglsig, Andreas Jonas
    Ostergaard, Jan
    Jensen, Jesper
    Bertelsen, Lars Sondergaard
    Mariager, Peter
    Tan, Zheng-Hua
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7752 - 7756
  • [29] Speech enhancement by speech intelligibility index In sensor network
    Parija, Smita
    Sahu, Prasanna Kumar
    Singh, Sudhansu Sekhar
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [30] Development of the Cantonese speech intelligibility index
    Wong, Lena L. N.
    Ho, Amy H. S.
    Chua, Elizabeth W. W.
    Soli, Sigfrid D.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (04): : 2350 - 2361