SNR loss: A new objective measure for predicting the intelligibility of noise-suppressed speech

被引:66
|
作者
Ma, Jianfen [1 ,2 ]
Loizou, Philipos C. [1 ]
机构
[1] Univ Texas Dallas, Dept Elect Engn, Richardson, TX 75083 USA
[2] Taiyuan Univ Technol, Taiyuan 030024, Shanxi, Peoples R China
关键词
Speech intelligibility; Speech enhancement; Speech intelligibility indices; RECEPTION THRESHOLD; SUBSPACE APPROACH; ENHANCEMENT; PARAMETERS; REDUCTION; COHERENCE; INDEX;
D O I
10.1016/j.specom.2010.10.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most of the existing intelligibility measures do not account for the distortions present in processed speech, such as those introduced by speech-enhancement algorithms. In the present study, we propose three new objective measures that can be used for prediction of intelligibility of processed (e.g., via an enhancement algorithm) speech in noisy conditions. All three measures use a critical-band spectral representation of the clean and noise-suppressed signals and are based on the measurement of the SNR loss incurred in each critical band after the corrupted signal goes through a speech enhancement algorithm. The proposed measures are flexible in that they can provide different weights to the two types of spectral distortions introduced by enhancement algorithms, namely spectral attenuation and spectral amplification distortions. The proposed measures were evaluated with intelligibility scores obtained by normal-hearing listeners in 72 noisy conditions involving noise-suppressed speech (consonants and sentences) corrupted by four different maskers (car, babble, train and street interferences). Highest correlation (r = -0.85) with sentence recognition scores was obtained using a variant of the SNR loss measure that only included vowel/consonant transitions and weak consonant information. High correlation was maintained for all noise types, with a maximum correlation (r = -0.88) achieved in street noise conditions. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:340 / 354
页数:15
相关论文
共 50 条
  • [1] Analysis of a simplified normalized covariance measure based on binary weighting functions for predicting the intelligibility of noise-suppressed speech
    Chen, Fei
    Loizou, Philipos C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (06): : 3715 - 3723
  • [2] A New Data-driven Band-weighting Function for Predicting the Intelligibility of Noise-suppressed Speech
    Liu, Zexin
    Ma, Heather T.
    Chen, Fei
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 492 - 496
  • [3] Objective measures for quality assessment of noise-suppressed speech
    Ding, Huijun
    Lee, Tan
    Soon, Ing Yann
    Yee, Chai Kiat
    Dai, Peng
    Dan, Guo
    SPEECH COMMUNICATION, 2015, 71 : 62 - 73
  • [4] A Hilbert-fine-structure-derived physical metric for predicting the intelligibility of noise-distorted and noise-suppressed speech
    Chen, Fei
    Wong, Lena L. N.
    Hu, Yi
    SPEECH COMMUNICATION, 2013, 55 (10) : 1011 - 1020
  • [5] Non-intrusive Intelligibility Prediction of Noise-suppressed Speech Based on Neural Network
    Ye, Fuqiang
    Liu, Zexin
    Chen, Fei
    2022 31ST WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC), 2022, : 171 - 174
  • [6] ASSESSING THE SEGMENTAL CONTRIBUTION TO THE NON-INTRUSIVE INTELLIGIBILITY PREDICTION OF NOISE-SUPPRESSED SPEECH
    Wang, Lei
    Chen, Fei
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [7] A NEW MASK-BASED OBJECTIVE MEASURE FOR PREDICTING THE INTELLIGIBILITY OF BINARY MASKED SPEECH
    Yu, Chengzhu
    Wojcicki, Kamil K.
    Loizou, P. C.
    Hansen, John H. L.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7030 - 7033
  • [8] Computer Speech Recognition as an Objective Measure of Intelligibility
    McHenry, Monica A.
    LaConte, Stephen M.
    JOURNAL OF MEDICAL SPEECH-LANGUAGE PATHOLOGY, 2010, 18 (04) : 99 - 103
  • [9] Objective Measures of Perceptual Quality for Predicting Speech Intelligibility in Sensorineural Hearing Loss
    Chiaramello, E.
    Moriconi, S.
    Tognola, G.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 5577 - 5580
  • [10] Predicting the Intelligibility of Cochlear-implant Vocoded Speech from Objective Quality Measure
    Chen, Fei
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2012, 32 (03) : 189 - 193