Model-Based Speech Enhancement in the Modulation Domain

被引:21
|
作者
Wang, Yu [1 ]
Brookes, Mike [2 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
关键词
Speech enhancement; modulation-domain Kalman filter; statistical modelling; minimum mean-square error (MMSE) estimator; SPECTRAL AMPLITUDE ESTIMATION; SQUARE ERROR ESTIMATION; NOISE; SUPPRESSION; ESTIMATORS; QUALITY;
D O I
10.1109/TASLP.2017.2786863
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents an algorithm for modulation-domain speech enhancement using a Kalman filter. The proposed estimator jointly models the estimated dynamics of the spectral amplitudes of speech and noise to obtain an MMSE estimation of the speech amplitude spectrum with the assumption that the speech and noise are additive in the complex domain. In order to include the dynamics of noise amplitudes with those of speech amplitudes, we propose a statistical "Gaussring" model that comprises a mixture of Gaussians whose centers lie in a circle on the complex plane. The performance of the proposed algorithm is evaluated using the perceptual evaluation of speech quality measure, segmental SNR measure, and short-time objective intelligibility measure. For speech quality measures, the proposed algorithm is shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms. Speech recognition experiments also showthat the Gaussring-model-based algorithm performs well for two types of noise.
引用
收藏
页码:580 / 594
页数:15
相关论文
共 50 条
  • [1] Adaptive model-based speech enhancement
    Logan, B
    Robinson, T
    SPEECH COMMUNICATION, 2001, 34 (04) : 351 - 368
  • [2] INDIRECT MODEL-BASED SPEECH ENHANCEMENT
    Le Roux, Jonathan
    Hershey, John R.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4045 - 4048
  • [3] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
    Jiang Wenbin
    Ying Rendong
    Liu Peilin
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
  • [4] Model-based eigenspectrum estimation for speech enhancement
    Bhunjun, Vinesh
    Brookes, Mike
    Naylor, Patrick
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1331 - +
  • [5] ON THE INFLUENCE OF INHARMONICITIES IN MODEL-BASED SPEECH ENHANCEMENT
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [6] Model-Based Speech Enhancement for Automotive Applications
    Krini, Mohamed
    Schmidt, Gerhard
    2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 638 - 643
  • [7] Compressive speech enhancement in the modulation domain
    Low, Siow Yong
    SPEECH COMMUNICATION, 2018, 102 : 87 - 99
  • [8] A Model-Based Soft Decision Approach for Speech Enhancement
    Xianyun Wang
    Changchun Bao
    Feng Bao
    中国通信, 2017, 14 (09) : 11 - 22
  • [9] Spectral difference for statistical model-based speech enhancement in speech recognition
    Soojeong Lee
    Joon-Hyuk Chang
    Multimedia Tools and Applications, 2017, 76 : 24917 - 24929
  • [10] Model-Based Feature Enhancement for Reverberant Speech Recognition
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707