Model-Based Speech Enhancement in the Modulation Domain

被引：21

作者：

Wang, Yu ^{[1
]}

Brookes, Mike ^{[2
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

[2] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2018年 / 26卷 / 03期

关键词：

Speech enhancement; modulation-domain Kalman filter; statistical modelling; minimum mean-square error (MMSE) estimator; SPECTRAL AMPLITUDE ESTIMATION; SQUARE ERROR ESTIMATION; NOISE; SUPPRESSION; ESTIMATORS; QUALITY;

D O I：

10.1109/TASLP.2017.2786863

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents an algorithm for modulation-domain speech enhancement using a Kalman filter. The proposed estimator jointly models the estimated dynamics of the spectral amplitudes of speech and noise to obtain an MMSE estimation of the speech amplitude spectrum with the assumption that the speech and noise are additive in the complex domain. In order to include the dynamics of noise amplitudes with those of speech amplitudes, we propose a statistical "Gaussring" model that comprises a mixture of Gaussians whose centers lie in a circle on the complex plane. The performance of the proposed algorithm is evaluated using the perceptual evaluation of speech quality measure, segmental SNR measure, and short-time objective intelligibility measure. For speech quality measures, the proposed algorithm is shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms. Speech recognition experiments also showthat the Gaussring-model-based algorithm performs well for two types of noise.

引用

页码：580 / 594

页数：15

共 50 条

[1] Adaptive model-based speech enhancement
Logan, B
Robinson, T
SPEECH COMMUNICATION, 2001, 34 (04) : 351 - 368
[2] INDIRECT MODEL-BASED SPEECH ENHANCEMENT
Le Roux, Jonathan
Hershey, John R.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4045 - 4048
[3] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
Jiang Wenbin
Ying Rendong
Liu Peilin
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
[4] Model-based eigenspectrum estimation for speech enhancement
Bhunjun, Vinesh
Brookes, Mike
Naylor, Patrick
2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1331 - +
[5] ON THE INFLUENCE OF INHARMONICITIES IN MODEL-BASED SPEECH ENHANCEMENT
Norholm, Sidsel Marie
Jensen, Jesper Rindom
Christensen, Mads Graesboll
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[6] Model-Based Speech Enhancement for Automotive Applications
Krini, Mohamed
Schmidt, Gerhard
2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 638 - 643
[7] Compressive speech enhancement in the modulation domain
Low, Siow Yong
SPEECH COMMUNICATION, 2018, 102 : 87 - 99
[8] A Model-Based Soft Decision Approach for Speech Enhancement
Xianyun Wang
Changchun Bao
Feng Bao
中国通信, 2017, 14 (09) : 11 - 22
[9] Spectral difference for statistical model-based speech enhancement in speech recognition
Soojeong Lee
Joon-Hyuk Chang
Multimedia Tools and Applications, 2017, 76 : 24917 - 24929
[10] Model-Based Feature Enhancement for Reverberant Speech Recognition
Krueger, Alexander
Haeb-Umbach, Reinhold
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707

← 1 2 3 4 5 →