Speech enhancement using the modified phase-opponency model

被引:8
|
作者
Deshmukh, Om D. [1 ]
Espy-Wilson, Carol Y.
Carney, Laurel H.
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Syracuse Univ, Dept Biomed & Chem Engn, Syracuse, NY 13244 USA
[4] Syracuse Univ, Inst Sensory Res, Syracuse, NY 13244 USA
来源
关键词
D O I
10.1121/1.2714913
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a model called the Modified Phase-Opponency (MPO) model for single-channel speech enhancement when the speech is corrupted by additive noise. The MPO,model is based on the auditory PO model, proposed for detection of tones in noise. The PO model includes a physiologically realistic mechanism for processing the information in neural discharge times and exploits the frequency-dependent phase properties of the tuned filters in the auditory periphery by using a cross-auditory-nerve-fiber coincidence detection for extracting temporal cues. The MPO model alters the components of the PO model such that the basic functionality of the PO model is maintained but the properties of the model can be analyzed and modified independently. The MPO-based speech enhancement scheme does not need to estimate the noise characteristics nor does it assume that the noise satisfies any statistical model. The MPO technique leads to the lowest value of the LPC-based objective measures and the highest value of the perceptual evaluation of speech quality measure compared to other methods when the speech signals are corrupted by fluctuating noise. Combining the MPO speech enhancement technique with our aperiodicity, periodicity, and pitch detector further improves its performance. (c) 2007 Acoustical Society of America.
引用
收藏
页码:3886 / 3898
页数:13
相关论文
共 50 条
  • [1] Speech Enhancement Using Modified Phase Opponency Model
    Deshmukh, Om D.
    Espy-Wilson, Carol Y.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 269 - +
  • [2] Modified Phase Opponency Based Solution To The Speech Separation Challenge
    Deshmukh, Om D.
    Espy-Wilson, Carol Y.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 101 - +
  • [3] Speech Enhancement Using Modified Magnitude and Phase Spectra
    Hossain, Sk. Imran
    Chowdhury, Md. Fahim Hossain
    Amin, Md. Faijul
    Murase, Kazuyuki
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2013,
  • [4] A Modified Speech Enhancement Algorithm Using A Universal Speaker Model
    Guo, Li
    Jiang, Wenbin
    Ying, Rendong
    Liu, Peilin
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 521 - 526
  • [5] Speech Enhancement Using Modified MMSE-LSA and Phase Reconstruction in Voiced and Unvoiced Speech
    Jia, Hairong
    Wang, Weimei
    Wang, Dong
    Zhang, Xueying
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (02)
  • [6] Phase-based dual-microphone speech enhancement using a prior speech model
    Shi, Guangji
    Aarabi, Parham
    Jiang, Hui
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 109 - 118
  • [7] SPEECH ENHANCEMENT USING ARCH MODEL
    Atkins, Aviva
    Cohen, Israel
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [8] Speech Enhancement with Phase Correction based on Modified DNN Architecture
    Cheng, Rui
    Bao, Changchun
    Xiang, Yang
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1222 - 1227
  • [9] Speech enhancement using modified IMCRA and OMLSA methods
    Tien Dung Tran
    Quoc Cuong Nguyen
    Dang Khoa Nguyen
    2010 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2010, : 195 - 200
  • [10] Esophageal Speech Enhancement using Modified Voicing Source
    Ishaq, Rizwan
    Zapirain, Begona Garcia
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 210 - 214