Speech enhancement using the modified phase-opponency model

被引:8
|
作者
Deshmukh, Om D. [1 ]
Espy-Wilson, Carol Y.
Carney, Laurel H.
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Syracuse Univ, Dept Biomed & Chem Engn, Syracuse, NY 13244 USA
[4] Syracuse Univ, Inst Sensory Res, Syracuse, NY 13244 USA
来源
关键词
D O I
10.1121/1.2714913
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a model called the Modified Phase-Opponency (MPO) model for single-channel speech enhancement when the speech is corrupted by additive noise. The MPO,model is based on the auditory PO model, proposed for detection of tones in noise. The PO model includes a physiologically realistic mechanism for processing the information in neural discharge times and exploits the frequency-dependent phase properties of the tuned filters in the auditory periphery by using a cross-auditory-nerve-fiber coincidence detection for extracting temporal cues. The MPO model alters the components of the PO model such that the basic functionality of the PO model is maintained but the properties of the model can be analyzed and modified independently. The MPO-based speech enhancement scheme does not need to estimate the noise characteristics nor does it assume that the noise satisfies any statistical model. The MPO technique leads to the lowest value of the LPC-based objective measures and the highest value of the perceptual evaluation of speech quality measure compared to other methods when the speech signals are corrupted by fluctuating noise. Combining the MPO speech enhancement technique with our aperiodicity, periodicity, and pitch detector further improves its performance. (c) 2007 Acoustical Society of America.
引用
收藏
页码:3886 / 3898
页数:13
相关论文
共 50 条
  • [41] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
    Shen, Guanghu
    Jung, Ho-Youl
    Chung, Hyun-Yeol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
  • [42] PHASE ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT USING PHASE INVARIANCE CONSTRAINTS
    Pirolt, Michael
    Stahl, Johannes
    Mowlaee, Pejman
    Vorobiov, Vasili I.
    Barysenka, Siarhei Y.
    Davydov, Andrew G.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5585 - 5589
  • [43] Auditory phase opponency: A temporal model for masked detection at low frequencies
    Carney, LH
    Heinz, MG
    Evilsizer, ME
    Gilkey, RH
    Colburn, HS
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2002, 88 (03) : 334 - 347
  • [44] RESTORATION OF INSTANTANEOUS AMPLITUDE AND PHASE USING KALMAN FILTER FOR SPEECH ENHANCEMENT
    Nower, Naushin
    Liu, Yang
    Unoki, Masashi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [46] New approaches to speech enhancement using phase correction in Wiener filtering
    Fardkhaleghi P.
    Savoji M.H.
    2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 895 - 899
  • [48] A Modified Speech Enhancement Algorithm Based on the Subspace
    Jia, Hairong
    Zhang, Xueying
    Jin, Chensheng
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 344 - 347
  • [49] IRM WITH PHASE PARAMETERIZATION FOR SPEECH ENHANCEMENT
    Wang, Xianyun
    Bao, Changchun
    Cheng, Rui
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 209 - 213
  • [50] SPEECH ENHANCEMENT AND THE INSTANTANEOUS PHASE SIGNAL
    WALSH, SJ
    CLARKSON, PM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S80 - S80