SNR loss: A new objective measure for predicting the intelligibility of noise-suppressed speech

被引:66
|
作者
Ma, Jianfen [1 ,2 ]
Loizou, Philipos C. [1 ]
机构
[1] Univ Texas Dallas, Dept Elect Engn, Richardson, TX 75083 USA
[2] Taiyuan Univ Technol, Taiyuan 030024, Shanxi, Peoples R China
关键词
Speech intelligibility; Speech enhancement; Speech intelligibility indices; RECEPTION THRESHOLD; SUBSPACE APPROACH; ENHANCEMENT; PARAMETERS; REDUCTION; COHERENCE; INDEX;
D O I
10.1016/j.specom.2010.10.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most of the existing intelligibility measures do not account for the distortions present in processed speech, such as those introduced by speech-enhancement algorithms. In the present study, we propose three new objective measures that can be used for prediction of intelligibility of processed (e.g., via an enhancement algorithm) speech in noisy conditions. All three measures use a critical-band spectral representation of the clean and noise-suppressed signals and are based on the measurement of the SNR loss incurred in each critical band after the corrupted signal goes through a speech enhancement algorithm. The proposed measures are flexible in that they can provide different weights to the two types of spectral distortions introduced by enhancement algorithms, namely spectral attenuation and spectral amplification distortions. The proposed measures were evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech (consonants and sentences) corrupted by four different maskers (car, babble, train and street interferences). Highest correlation (r = -0.85) with sentence recognition scores was obtained using a variant of the SNR loss measure that only included vowel/consonant transitions and weak consonant information. High correlation was maintained for all noise types, with a maximum correlation (r = -0.88) achieved in street noise conditions. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:340 / 354
页数:15
相关论文
共 50 条
  • [41] EFFECTS OF MODULATED NOISE ON SPEECH INTELLIGIBILITY OF PEOPLE WITH SENSORINEURAL HEARING-LOSS
    SHAPIRO, MT
    VERMEULEN, V
    MELNICK, W
    ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 1972, 81 (02): : 241 - +
  • [42] Evaluation of Objective Measures Applied on the Noise Suppressed Speech Signals with Chinese Content
    Ding, Huijun
    Pan, Jia
    Shen, Minmin
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 892 - 895
  • [43] Impaired noise adaptation contributes to speech intelligibility problems in people with hearing loss
    Marrufo-Perez, Miriam I.
    Fumero, Milagros J.
    Eustaquio-Martin, Almudena
    Lopez-Poveda, Enrique A.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [44] EVALUATION OF OBJECTIVE MEASURES FOR INTELLIGIBILITY PREDICTION OF HMM-BASED SYNTHETIC SPEECH IN NOISE
    Valentini-Botinhao, Cassia
    Yamagishi, Junichi
    King, Simon
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5112 - 5115
  • [45] A SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE FOR TIME-FREQUENCY WEIGHTED NOISY SPEECH
    Taal, Cees H.
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4214 - 4217
  • [46] A new binary mask based on noise constraints for improved speech intelligibility
    Kim, Gibak
    Loizou, Philipos C.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1632 - 1635
  • [47] Effects of Speech Rate, Background Noise, and Simulated Hearing Loss on Speech Rate Judgment and Speech Intelligibility in Young Listeners
    Adams, Elizabeth M.
    Moore, Robert E.
    JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2009, 20 (01) : 28 - 39
  • [48] Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English
    Li, Junfeng
    Xia, Risheng
    Ying, Dongwen
    Yan, Yonghong
    Akagi, Masato
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (06): : 3301 - 3312
  • [49] AMPLIFICATION BANDWIDTH AND INTELLIGIBILITY OF SPEECH IN QUIET AND NOISE FOR LISTENERS WITH SENSORINEURAL HEARING-LOSS
    SKINNER, MW
    MILLER, JD
    AUDIOLOGY, 1983, 22 (03): : 253 - 279
  • [50] Intelligibility of speech in noise at high presentation levels: Effects of hearing loss and frequency region
    Summers, Van
    Cord, Mary T.
    Journal of the Acoustical Society of America, 2007, 122 (02): : 1130 - 1137