Analysis of speech-based speech transmission index methods with implications for nonlinear operations

被引:160
|
作者
Goldsworthy, RL
Greenberg, JE
机构
[1] MIT, Elect Res Lab, Cambridge, MA 02139 USA
[2] Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USA
来源
关键词
D O I
10.1121/1.1804628
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Speech Transmission Index (STI) is a physical. metric that is well correlated with the intelligibility of speech degraded by additive noise and reverberation. The traditional STI uses modulated noise as a probe signal and is valid for assessing degradations that result from linear operations on the speech signal. Researchers have attempted to extend the STI to predict the intelligibility of nonlinearly processed speech by proposing variations that use speech as a probe signal. This work considers four previously proposed speech-based STI methods and four novel methods, studied under conditions of additive noise, reverberation, and two nonlinear operations (envelope thresholding and spectral subtraction). Analyzing intermediate metrics in the STI calculation reveals why some methods fail for nonlinear operations. Results indicate that none of the previously proposed methods is adequate for all of the conditions considered, while four proposed methods produce qualitatively reasonable results and warrant further study. The discussion considers the relevance of this work to predicting the intelligibility of cochlear-implant processed speech. (C) 2004 Acoustical Society of America.
引用
收藏
页码:3679 / 3689
页数:11
相关论文
共 50 条
  • [1] Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data
    Payton, Karen L.
    Shrestha, Mona
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (05): : 3818 - 3827
  • [2] Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data
    Payton, K.L. (kpayton@umassd.edu), 1600, Acoustical Society of America (134):
  • [3] Speech-based services
    Furman, DS
    Cosky, MJ
    Thomson, DL
    O'Brien, SA
    Sumner, EE
    BELL LABS TECHNICAL JOURNAL, 1999, 4 (02) : 88 - 97
  • [4] TRANSIENT-BASED SPEECH TRANSMISSION INDEX FOR PREDICTING INTELLIGIBILITY IN NONLINEAR SPEECH ENHANCEMENT PROCESSORS
    Schlesinger, Anton
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3993 - 3996
  • [5] WHITMAN AND SPEECH-BASED PROSODY
    JARVIS, DR
    WALT WHITMAN REVIEW, 1981, 27 (02): : 51 - 62
  • [6] Speech-based Class Attendance
    Amri, Umar Faizel
    Hashim, Nik Nur Wahidah Nik
    Hanif, Noor Hazrin Hany Mohamad
    6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260
  • [7] Speech-Based Meaning of Music
    Karbanova, Alice
    PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 385 - 397
  • [8] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, P
    Hongisto, V
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02): : 1106 - 1117
  • [9] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, Petra
    Hongisto, Valtteri
    Journal of the Acoustical Society of America, 2006, 119 (02): : 1106 - 1117
  • [10] Experimental comparisons of speech transmission index prediction methods
    Zhu, Peisheng
    Tao, Wanqi
    Mo, Fangshuo
    Lu, Xiaodong
    Zhang, Hongchi
    APPLIED ACOUSTICS, 2024, 220