Analysis of speech-based speech transmission index methods with implications for nonlinear operations

被引:160
|
作者
Goldsworthy, RL
Greenberg, JE
机构
[1] MIT, Elect Res Lab, Cambridge, MA 02139 USA
[2] Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USA
来源
关键词
D O I
10.1121/1.1804628
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Speech Transmission Index (STI) is a physical. metric that is well correlated with the intelligibility of speech degraded by additive noise and reverberation. The traditional STI uses modulated noise as a probe signal and is valid for assessing degradations that result from linear operations on the speech signal. Researchers have attempted to extend the STI to predict the intelligibility of nonlinearly processed speech by proposing variations that use speech as a probe signal. This work considers four previously proposed speech-based STI methods and four novel methods, studied under conditions of additive noise, reverberation, and two nonlinear operations (envelope thresholding and spectral subtraction). Analyzing intermediate metrics in the STI calculation reveals why some methods fail for nonlinear operations. Results indicate that none of the previously proposed methods is adequate for all of the conditions considered, while four proposed methods produce qualitatively reasonable results and warrant further study. The discussion considers the relevance of this work to predicting the intelligibility of cochlear-implant processed speech. (C) 2004 Acoustical Society of America.
引用
收藏
页码:3679 / 3689
页数:11
相关论文
共 50 条
  • [21] Analysis of a Speech-Based Intersection Assistant in Real Urban Traffic
    Orth, Dennis
    Steinhardt, Nico
    Bolder, Bram
    Dunn, Mark
    Kolossa, Dorothea
    Heckmann, Martin
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 1273 - 1278
  • [22] Difficulties in Automatic Speech Recognition of Dysarthric Speakers and Implications for Speech-Based Applications Used by the Elderly: A Literature Review
    Young, Victoria
    Mihailidis, Alex
    ASSISTIVE TECHNOLOGY, 2010, 22 (02) : 99 - 112
  • [23] Analysis of the quality of remote working experience: a speech-based approach
    Simone Porcu
    Alessandro Floris
    Luigi Atzori
    Quality and User Experience, 2022, 7 (1)
  • [24] Speech-based interaction with in-vehicle computers: The effect of speech-based e-mail on drivers' attention to the roadway
    Lee, JD
    Caven, B
    Haake, S
    Brown, TL
    HUMAN FACTORS, 2001, 43 (04) : 631 - 640
  • [25] Speaker normalisation for speech-based emotion detection
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathainby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 611 - +
  • [26] VOICE: a framework for speech-based mobile systems
    Sharp, Adam
    Kurkovsky, Stan
    21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 38 - +
  • [27] Speech-Based Annotation and Retrieval of Digital Photographs
    Hazen, Timothy J.
    Sherry, Brennan
    Adler, Mark
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2077 - +
  • [28] An Exploration of Speech-Based Productivity Support in the Car
    Martelaro, Nikolas
    Teevan, Jaime
    Iqbal, Shamsi T.
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [29] Speech-based Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI'14), 2014, : 567 - 568
  • [30] Speech transmission index or rapid speech transmission index for classrooms? A designer's point of view
    Tang, SK
    Yeung, MH
    JOURNAL OF SOUND AND VIBRATION, 2004, 276 (1-2) : 431 - 439