Iterative speech enhancement using a non-linear dynamic state model of speech and its parameters

被引：0

作者：

Windmann, Stefan ^{[1
]}

Haeb-Umbach, Reinhold ^{[1
]}

机构：

[1] Univ Paderborn, Dept Commun Engn, D-33098 Paderborn, Germany

来源：

2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13 | 2006年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A marginalized particle filter is proposed for performing single channel speech enhancement with a non-linear dynamic state model. The system consists of a particle filter for tracking line spectral pair (LSP) parameters and a Kalman filter per particle for speech enhancement. The state model for the LSPs has been learnt on clean speech training data. In our approach parameters and speech samples are processed at different time scales by assuming the parameters to be constant for small blocks of data. Further enhancement is obtained by an iteration which can be applied on these small blocks. The experiments show that similar SNR gains are obtained as with the Kalman-EM-iterative algorithm. However better values of the noise level and the log-spectral distance are achieved.

引用

页码：465 / 468

页数：4

共 50 条

[31] Non-Linear Speech Processing (NOLISP 2013) Preface
Drugman, Thomas
COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01): : 1 - 2
[32] Non-linear independent component analysis for speech recognition
Omar, MK
Hasegawa-Johnson, M
CCCT 2003, VOL6, PROCEEDINGS: COMPUTER, COMMUNICATION AND CONTROL TECHNOLOGIES: III, 2003, : 204 - 209
[33] Non-linear predictor based on ANN in speech coding
Li, LS
Sun, ZY
Wang, AH
Li, ZH
2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 1098 - 1103
[34] Non-Linear Dynamic Analysis of Inter-Word Time Intervals In Psychotic Speech
Todder, Doron
Avissar, Sofia
Schreiber, Gabriel
IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2013, 1
[35] Blind separation of non-linear convolved speech mixtures
Koutras, A
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 913 - 916
[36] NON-LINEAR SOFT-SOUNDS ENHANCEMENT FOR NEAR-END SPEECH INTELLIGIBILITY IMPROVEMENT
Dokku, Rajyalakshmi
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[37] Automatic Emotion Recognition in Compressed Speech Using Acoustic and Non-Linear Features
Garcia, N.
Vasquez-Correa, J. C.
Arias-Londono, J. D.
Vargas-Bonilla, J. F.
Orozco-Arroyave, J. R.
2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
[38] Single-channel Music/Speech Separation Using Non-linear Masks
Mowlaee, P.
Sayadian, A.
Sheikhan, M.
Fallah, M.
2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 782 - +
[39] PITCH DETERMINATION OF SPEECH SIGNALS USING A NON-LINEAR DIGITAL-FILTER
HESS, W
FREQUENZ, 1980, 34 (05) : 152 - 156
[40] Speech Enhancement Using New Iterative Minimum Statistics Approach
Gouhar, Tahmina
Jaber, Nabih
Kuntumalla, Pallavi
2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,

← 1 2 3 4 5 →