Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability

被引:10
|
作者
Wood, Sean U. N. [1 ]
Stahl, Johannes K. W. [1 ]
Mowlaee, Pejman [1 ,2 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
[2] Widex AS, DK-3540 Lynge, Denmark
基金
奥地利科学基金会;
关键词
Speech enhancement; Speech coding; Noise reduction; Noise measurement; Estimation; Indexes; Binaural speech enhancement; atomic speech presence probability; nonnegative matrix factorization; interaural transfer function; QUALITY ASSESSMENT; SOURCE SEPARATION; NOISE-REDUCTION; LOCALIZATION; HEARING; PRESERVATION; ENVIRONMENT; MODEL;
D O I
10.1109/TASLP.2019.2937174
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability (ASPP) is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference (IPD and ILD), the interaural coherence magnitude (ICM), as well as a combined version leveraging the full interaural transfer function (ITF). We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.
引用
收藏
页码:2150 / 2161
页数:12
相关论文
共 50 条
  • [1] Codebook-Based Speech Enhancement Using Markov Process and Speech-presence Probability
    He, Qi
    Bao, Chang-chun
    Bao, Feng
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1780 - 1784
  • [2] Codebook-based Bayesian speech enhancement
    Srinivasan, S
    Samuelsson, J
    Kleijn, WB
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1077 - 1080
  • [3] Codebook-based Bayesian speech enhancement for nonstationary environments
    Srinivasan, Sriram
    Samuelsson, Jonas
    Kleijn, W. Bastiaan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 441 - 452
  • [4] Speech Enhancement in Modulation Domain Using Codebook-based Speech and Noise Estimation
    Mani, Vidhyasagar
    Champagne, Benoit
    Zhu, Wei-Ping
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 707 - 711
  • [5] Improved Codebook-based Speech Enhancement based on MBE Model
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3627 - 3631
  • [6] BINAURAL SPEECH ENHANCEMENT USING A CODEBOOK BASED APPROACH
    Kavalekalam, Mathew Shaji
    Christensen, Mads Grsboll
    Boldt, Jesper B.
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [7] Blind Bandwidth Extension for Codebook-based Bayesian Speech Enhancement
    Li, Yaxing
    Kim, Jonghyeon
    Kang, Sangwon
    18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,
  • [8] Codebook-based Speech Enhancement with Bayesian LP Parameters Estimation
    Wang, Qing
    Bao, Chang-chun
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1245 - 1248
  • [9] Real-Time Codebook-based Speech Enhancement with GPUs
    Prasanna, A. N. Sai
    Gurumurthyt, Iver Chandrashekaran
    Naidu, D. H. R.
    Baruith, Pallav Kuniar
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 306 - 311
  • [10] Modeling the Temporal Evolution of LPC Parameters for Codebook-Based Speech Enhancement
    Rosenkranz, Tobias
    2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 59 - 64