Speech enhancement using the modified phase-opponency model

被引:8
|
作者
Deshmukh, Om D. [1 ]
Espy-Wilson, Carol Y.
Carney, Laurel H.
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Syracuse Univ, Dept Biomed & Chem Engn, Syracuse, NY 13244 USA
[4] Syracuse Univ, Inst Sensory Res, Syracuse, NY 13244 USA
来源
关键词
D O I
10.1121/1.2714913
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a model called the Modified Phase-Opponency (MPO) model for single-channel speech enhancement when the speech is corrupted by additive noise. The MPO,model is based on the auditory PO model, proposed for detection of tones in noise. The PO model includes a physiologically realistic mechanism for processing the information in neural discharge times and exploits the frequency-dependent phase properties of the tuned filters in the auditory periphery by using a cross-auditory-nerve-fiber coincidence detection for extracting temporal cues. The MPO model alters the components of the PO model such that the basic functionality of the PO model is maintained but the properties of the model can be analyzed and modified independently. The MPO-based speech enhancement scheme does not need to estimate the noise characteristics nor does it assume that the noise satisfies any statistical model. The MPO technique leads to the lowest value of the LPC-based objective measures and the highest value of the perceptual evaluation of speech quality measure compared to other methods when the speech signals are corrupted by fluctuating noise. Combining the MPO speech enhancement technique with our aperiodicity, periodicity, and pitch detector further improves its performance. (c) 2007 Acoustical Society of America.
引用
收藏
页码:3886 / 3898
页数:13
相关论文
共 50 条
  • [31] Speech Enhancement Method with Geometric Phase Estimation By Incorporating MIXMAX Model
    Wang, Xianyun
    Bao, Changchun
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [32] Speech enhancement based on the modified phase using signal-to-noise ratio information and time-frequency characteristics
    Jia H.
    Wang W.
    Ji H.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (05): : 162 - 170
  • [33] Spoofing Speech Detection Using Modified Relative Phase Information
    Wang, Longbiao
    Nakagawa, Seiichi
    Zhang, Zhaofeng
    Yoshida, Yohei
    Kawakami, Yuta
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 660 - 670
  • [34] Enhancement of esophagus speech using harmonic plus noise model
    Lehana, PK
    Gupta, RK
    Kumari, S
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A669 - A672
  • [35] Improved perceptually inspired speech enhancement using a psychoacoustic model
    Hu, RQ
    Anderson, DV
    CONFERENCE RECORD OF THE THIRTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2004, : 440 - 444
  • [36] Subspace Based Speech Enhancement Using Gaussian Mixture Model
    Kundu, Achintya
    Chatterjee, Saikat
    Sreenivas, T. V.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 395 - 398
  • [37] MMSE-BASED SPEECH ENHANCEMENT USING THE HARMONIC MODEL
    Stark, Yair
    Tabrikian, Joseph
    2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 616 - 620
  • [38] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    Xiang, Yang
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
  • [39] Auditory phase opponency: A temporal model for masked detection at low frequencies
    Carney, Laurel H.
    Heinz, Michael G.
    Evilsizer, Mary E.
    Gilkey, Robert H.
    Colburn, H. Steven
    Acta Acustica united with Acustica, 2002, 88 (03): : 334 - 347
  • [40] An Enhancement of Japanese Acoustic Model using Korean Speech Database
    Lee, Minkyu
    Kim, Sanghun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (05): : 438 - 445