Speech enhancement using the modified phase-opponency model

被引：8

作者：

Deshmukh, Om D. ^{[1
]}

Espy-Wilson, Carol Y.

Carney, Laurel H.

机构：

[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA

[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA

[3] Syracuse Univ, Dept Biomed & Chem Engn, Syracuse, NY 13244 USA

[4] Syracuse Univ, Inst Sensory Res, Syracuse, NY 13244 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2007年 / 121卷 / 06期

关键词：

D O I：

10.1121/1.2714913

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we present a model called the Modified Phase-Opponency (MPO) model for single-channel speech enhancement when the speech is corrupted by additive noise. The MPO,model is based on the auditory PO model, proposed for detection of tones in noise. The PO model includes a physiologically realistic mechanism for processing the information in neural discharge times and exploits the frequency-dependent phase properties of the tuned filters in the auditory periphery by using a cross-auditory-nerve-fiber coincidence detection for extracting temporal cues. The MPO model alters the components of the PO model such that the basic functionality of the PO model is maintained but the properties of the model can be analyzed and modified independently. The MPO-based speech enhancement scheme does not need to estimate the noise characteristics nor does it assume that the noise satisfies any statistical model. The MPO technique leads to the lowest value of the LPC-based objective measures and the highest value of the perceptual evaluation of speech quality measure compared to other methods when the speech signals are corrupted by fluctuating noise. Combining the MPO speech enhancement technique with our aperiodicity, periodicity, and pitch detector further improves its performance. (c) 2007 Acoustical Society of America.

引用

页码：3886 / 3898

页数：13

共 50 条

[31] Speech Enhancement Method with Geometric Phase Estimation By Incorporating MIXMAX Model
Wang, Xianyun
Bao, Changchun
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[32] Speech enhancement based on the modified phase using signal-to-noise ratio information and time-frequency characteristics
Jia H.
Wang W.
Ji H.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (05): : 162 - 170
[33] Spoofing Speech Detection Using Modified Relative Phase Information
Wang, Longbiao
Nakagawa, Seiichi
Zhang, Zhaofeng
Yoshida, Yohei
Kawakami, Yuta
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 660 - 670
[34] Enhancement of esophagus speech using harmonic plus noise model
Lehana, PK
Gupta, RK
Kumari, S
TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A669 - A672
[35] Improved perceptually inspired speech enhancement using a psychoacoustic model
Hu, RQ
Anderson, DV
CONFERENCE RECORD OF THE THIRTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2004, : 440 - 444
[36] Subspace Based Speech Enhancement Using Gaussian Mixture Model
Kundu, Achintya
Chatterjee, Saikat
Sreenivas, T. V.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 395 - 398
[37] MMSE-BASED SPEECH ENHANCEMENT USING THE HARMONIC MODEL
Stark, Yair
Tabrikian, Joseph
2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 616 - 620
[38] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
Huang, Qizheng
Bao, Changchun
Wang, Xianyun
Xiang, Yang
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
[39] Auditory phase opponency: A temporal model for masked detection at low frequencies
Carney, Laurel H.
Heinz, Michael G.
Evilsizer, Mary E.
Gilkey, Robert H.
Colburn, H. Steven
Acta Acustica united with Acustica, 2002, 88 (03): : 334 - 347
[40] An Enhancement of Japanese Acoustic Model using Korean Speech Database
Lee, Minkyu
Kim, Sanghun
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (05): : 438 - 445

← 1 2 3 4 5 →