Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability

被引:10
|
作者
Wood, Sean U. N. [1 ]
Stahl, Johannes K. W. [1 ]
Mowlaee, Pejman [1 ,2 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
[2] Widex AS, DK-3540 Lynge, Denmark
基金
奥地利科学基金会;
关键词
Speech enhancement; Speech coding; Noise reduction; Noise measurement; Estimation; Indexes; Binaural speech enhancement; atomic speech presence probability; nonnegative matrix factorization; interaural transfer function; QUALITY ASSESSMENT; SOURCE SEPARATION; NOISE-REDUCTION; LOCALIZATION; HEARING; PRESERVATION; ENVIRONMENT; MODEL;
D O I
10.1109/TASLP.2019.2937174
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability (ASPP) is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference (IPD and ILD), the interaural coherence magnitude (ICM), as well as a combined version leveraging the full interaural transfer function (ITF). We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.
引用
收藏
页码:2150 / 2161
页数:12
相关论文
共 50 条
  • [31] Robust Keyword Spotting for Noisy Environments by Leveraging Speech Enhancement and Speech Presence Probability
    Yang, Chouchang
    Saidutta, Yashas Malur
    Srinivasa, Rakshith Sharma
    Lee, Ching-Hua
    Shen, Yilin
    Jin, Hongxia
    INTERSPEECH 2023, 2023, : 1638 - 1642
  • [32] Speech Enhancement Based on Codebook Constrained Nonnegative Matrix Factorization
    Bai, Zhigang
    Bao, Changchun
    Yan, Bofang
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 361 - 365
  • [33] A CODEBOOK-DRIVFN SPEECH ENHANCEMENT METHOD BY EXPLOITING SPEECH HARMONICITY
    Xiang, Yang
    Bao, Changchun
    2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2017,
  • [34] Speech enhancement using voiced speech probability based wavelet decomposition
    Bhowmick, Anirban
    Chandra, Mahesh
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 706 - 718
  • [35] Codebook constrained Wiener filtering for speech enhancement
    Sreenivas, TV
    Kirnapure, P
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (05): : 383 - 389
  • [36] Speech enhancement using a generic noise codebook
    Srinivasan, Sriram
    Naidu, D. Hanumantha Rao
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (02): : EL161 - EL167
  • [37] Integrating Codebook and Wiener Filtering for Speech Enhancement
    Zhang, Dong-ming
    Bao, Chang-chun
    Deng, Feng
    2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), 2015, : 193 - 197
  • [38] SPATIAL FILTERING BASED SPEECH ENHANCEMENT FOR BINAURAL HEARING AID
    Daoud, Dhouha
    Kallel, Fathi
    Ghorbel, Mohamed
    Ben Hamida, Ahmed
    2009 6TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2009, : 671 - 676
  • [39] Binaural Speech Enhancement based on DNN for the Application of Virtual Reality
    Wang, Jin
    Wang, Jing
    Liu, Ming
    Yan, Zhaoyu
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 629 - 633
  • [40] Model based Estimation of STP parameters for Binaural Speech Enhancement
    Kavalekalam, Mathew Shaji
    Nielsen, Jesper Kjaer
    Christensen, Mads Graesboll
    Boldt, Jesper
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2479 - 2483