Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data

被引:0
|
作者
机构
[1] Payton, Karen L.
[2] Shrestha, Mona
来源
Payton, K.L. (kpayton@umassd.edu) | 1600年 / Acoustical Society of America卷 / 134期
关键词
Several algorithms have been shown to generate a metric corresponding to the Speech Transmission Index (STI) using speech as a probe stimulus [e.g; Goldsworthy and Greenberg; J; Acoust; Soc; Am; 116; 3679-3689 (2004)]. The time-domain approaches work well on long speech segments and have the added potential to be used for short-time analysis. This study investigates the performance of the Envelope Regression (ER) time-domain STI method as a function of window length; in acoustically degraded environments with multiple talkers and speaking styles. The ER method is compared with a short-time Theoretical STI; derived from octave-band signal-to-noise ratios and reverberation times. For windows as short as 0.3 s; the ER method tracks short-time Theoretical STI changes in stationary speech-shaped noise; fluctuating restaurant babble and stationary noise plus reverberation. The metric is also compared to intelligibility scores on conversational speech and speech articulated clearly but at normal speaking rates (Clear/Norm) in stationary noise. Correlation between the metric and intelligibility scores is high and; consistent with the subject scores; the metrics are higher for Clear/Norm speech than for conversational speech and higher for the first word in a sentence than for the last word. © 2013 Acoustical Society of America;
D O I
暂无
中图分类号
学科分类号
摘要
Conference article (CA)
引用
收藏
相关论文
共 50 条
  • [41] Relationship between Chinese speech intelligibility and speech transmission index in rooms using dichotic listening
    Peng JianXin
    CHINESE SCIENCE BULLETIN, 2008, 53 (18): : 2748 - 2752
  • [42] WORD INTELLIGIBILITY OF TIME COMPRESSED SPEECH
    KLUMPP, RG
    WEBSTER, JC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1957, 29 (11): : 1255 - 1255
  • [43] Relationship between Chinese speech intelligibility and speech transmission index in rooms using dichotic listening
    PENG JianXin School of Physics
    ChineseScienceBulletin, 2008, (18) : 2748 - 2752
  • [44] INTELLIGIBILITY OF TIME-COMPRESSED SPEECH
    KLUMPP, RG
    WEBSTER, JC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (03): : 265 - &
  • [45] On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement
    Kolbaek, Morten
    Tan, Zheng-Hua
    Jensen, Jesper
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (02) : 283 - 295
  • [46] Intelligibility of locally time-reversed speech: A multilingual comparison
    Ueda, Kazuo
    Nakajima, Yoshitaka
    Ellermeier, Wolfgang
    Kattner, Florian
    SCIENTIFIC REPORTS, 2017, 7
  • [47] Intelligibility of locally time-reversed speech: A multilingual comparison
    Kazuo Ueda
    Yoshitaka Nakajima
    Wolfgang Ellermeier
    Florian Kattner
    Scientific Reports, 7
  • [48] Bispectral feature speech intelligibility assessment metric based on auditory model
    Chen, Xiaomei
    Wang, Xiaowei
    Zhong, Bo
    Yang, Jiayan
    Shang, Yingying
    COMPUTER SPEECH AND LANGUAGE, 2023, 80
  • [49] A Visual Speech Intelligibility Benefit Based on Speech Rhythm
    Kawase, Saya
    Davis, Chris
    Kim, Jeesun
    BRAIN SCIENCES, 2023, 13 (06)
  • [50] MONAURAL SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS BY MAXIMIZING A SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE
    Kolbaek, Morten
    Tan, Zheng-Hua
    Jensen, Jesper
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5059 - 5063