Iterative speech enhancement using a non-linear dynamic state model of speech and its parameters

被引:0
|
作者
Windmann, Stefan [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, D-33098 Paderborn, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A marginalized particle filter is proposed for performing single channel speech enhancement with a non-linear dynamic state model. The system consists of a particle filter for tracking line spectral pair (LSP) parameters and a Kalman filter per particle for speech enhancement. The state model for the LSPs has been learnt on clean speech training data. In our approach parameters and speech samples are processed at different time scales by assuming the parameters to be constant for small blocks of data. Further enhancement is obtained by an iteration which can be applied on these small blocks. The experiments show that similar SNR gains are obtained as with the Kalman-EM-iterative algorithm. However better values of the noise level and the log-spectral distance are achieved.
引用
收藏
页码:465 / 468
页数:4
相关论文
共 50 条
  • [31] Non-Linear Speech Processing (NOLISP 2013) Preface
    Drugman, Thomas
    COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01): : 1 - 2
  • [32] Non-linear independent component analysis for speech recognition
    Omar, MK
    Hasegawa-Johnson, M
    CCCT 2003, VOL6, PROCEEDINGS: COMPUTER, COMMUNICATION AND CONTROL TECHNOLOGIES: III, 2003, : 204 - 209
  • [33] Non-linear predictor based on ANN in speech coding
    Li, LS
    Sun, ZY
    Wang, AH
    Li, ZH
    2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 1098 - 1103
  • [34] Non-Linear Dynamic Analysis of Inter-Word Time Intervals In Psychotic Speech
    Todder, Doron
    Avissar, Sofia
    Schreiber, Gabriel
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2013, 1
  • [35] Blind separation of non-linear convolved speech mixtures
    Koutras, A
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 913 - 916
  • [36] NON-LINEAR SOFT-SOUNDS ENHANCEMENT FOR NEAR-END SPEECH INTELLIGIBILITY IMPROVEMENT
    Dokku, Rajyalakshmi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [37] Automatic Emotion Recognition in Compressed Speech Using Acoustic and Non-Linear Features
    Garcia, N.
    Vasquez-Correa, J. C.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    Orozco-Arroyave, J. R.
    2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
  • [38] Single-channel Music/Speech Separation Using Non-linear Masks
    Mowlaee, P.
    Sayadian, A.
    Sheikhan, M.
    Fallah, M.
    2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 782 - +
  • [39] PITCH DETERMINATION OF SPEECH SIGNALS USING A NON-LINEAR DIGITAL-FILTER
    HESS, W
    FREQUENZ, 1980, 34 (05) : 152 - 156
  • [40] Speech Enhancement Using New Iterative Minimum Statistics Approach
    Gouhar, Tahmina
    Jaber, Nabih
    Kuntumalla, Pallavi
    2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,