Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability

被引：10

作者：

Wood, Sean U. N. ^{[1
]}

Stahl, Johannes K. W. ^{[1
]}

Mowlaee, Pejman ^{[1
,2
]}

机构：

[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria

[2] Widex AS, DK-3540 Lynge, Denmark

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2019年 / 27卷 / 12期

基金：

奥地利科学基金会;

关键词：

Speech enhancement; Speech coding; Noise reduction; Noise measurement; Estimation; Indexes; Binaural speech enhancement; atomic speech presence probability; nonnegative matrix factorization; interaural transfer function; QUALITY ASSESSMENT; SOURCE SEPARATION; NOISE-REDUCTION; LOCALIZATION; HEARING; PRESERVATION; ENVIRONMENT; MODEL;

D O I：

10.1109/TASLP.2019.2937174

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability (ASPP) is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference (IPD and ILD), the interaural coherence magnitude (ICM), as well as a combined version leveraging the full interaural transfer function (ITF). We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.

引用

页码：2150 / 2161

页数：12

共 50 条

[1] Codebook-Based Speech Enhancement Using Markov Process and Speech-presence Probability
He, Qi
Bao, Chang-chun
Bao, Feng
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1780 - 1784
[2] Codebook-based Bayesian speech enhancement
Srinivasan, S
Samuelsson, J
Kleijn, WB
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1077 - 1080
[3] Codebook-based Bayesian speech enhancement for nonstationary environments
Srinivasan, Sriram
Samuelsson, Jonas
Kleijn, W. Bastiaan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 441 - 452
[4] Speech Enhancement in Modulation Domain Using Codebook-based Speech and Noise Estimation
Mani, Vidhyasagar
Champagne, Benoit
Zhu, Wei-Ping
2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 707 - 711
[5] Improved Codebook-based Speech Enhancement based on MBE Model
Huang, Qizheng
Bao, Changchun
Wang, Xianyun
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3627 - 3631
[6] BINAURAL SPEECH ENHANCEMENT USING A CODEBOOK BASED APPROACH
Kavalekalam, Mathew Shaji
Christensen, Mads Grsboll
Boldt, Jesper B.
2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[7] Blind Bandwidth Extension for Codebook-based Bayesian Speech Enhancement
Li, Yaxing
Kim, Jonghyeon
Kang, Sangwon
18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
[8] Codebook-based Speech Enhancement with Bayesian LP Parameters Estimation
Wang, Qing
Bao, Chang-chun
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1245 - 1248
[9] Real-Time Codebook-based Speech Enhancement with GPUs
Prasanna, A. N. Sai
Gurumurthyt, Iver Chandrashekaran
Naidu, D. H. R.
Baruith, Pallav Kuniar
2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 306 - 311
[10] Modeling the Temporal Evolution of LPC Parameters for Codebook-Based Speech Enhancement
Rosenkranz, Tobias
2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 59 - 64

← 1 2 3 4 5 →