Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability

被引:10
|
作者
Wood, Sean U. N. [1 ]
Stahl, Johannes K. W. [1 ]
Mowlaee, Pejman [1 ,2 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
[2] Widex AS, DK-3540 Lynge, Denmark
基金
奥地利科学基金会;
关键词
Speech enhancement; Speech coding; Noise reduction; Noise measurement; Estimation; Indexes; Binaural speech enhancement; atomic speech presence probability; nonnegative matrix factorization; interaural transfer function; QUALITY ASSESSMENT; SOURCE SEPARATION; NOISE-REDUCTION; LOCALIZATION; HEARING; PRESERVATION; ENVIRONMENT; MODEL;
D O I
10.1109/TASLP.2019.2937174
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability (ASPP) is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference (IPD and ILD), the interaural coherence magnitude (ICM), as well as a combined version leveraging the full interaural transfer function (ITF). We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.
引用
收藏
页码:2150 / 2161
页数:12
相关论文
共 50 条
  • [21] Unsupervised Speech Enhancement Using Optimal Transport and Speech Presence Probability
    Jiang, Wenbin
    Yu, Kai
    Wen, Fei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4445 - 4455
  • [22] Speech enhancement using a structured codebook
    Naidu, D. Hanumantha Rao
    Srinivasan, Sriram
    Rao, G. V. Prabhakara
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04): : EL329 - EL335
  • [23] Optimized Sigmoid Functions for Speech Presence Probability and Gain Function in Speech Enhancement
    Dam, Hai Huyen
    Nordholm, Sven
    Yong, Pei Chee
    Low, Siow Yong
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (05) : 2891 - 2908
  • [24] Optimized Sigmoid Functions for Speech Presence Probability and Gain Function in Speech Enhancement
    Hai Huyen Dam
    Sven Nordholm
    Pei Chee Yong
    Siow Yong Low
    Circuits, Systems, and Signal Processing, 2024, 43 : 2891 - 2908
  • [25] Speech Enhancement Combining NMF Weighted by Speech Presence Probability and Statistical Model
    Hu, Yonggang
    Zhang, Xiongwei
    Zou, Xia
    Min, Gang
    Sun, Meng
    Zheng, Yunfei
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (12) : 2701 - 2704
  • [26] Speech enhancement methods based on binaural cue coding
    Xianyun Wang
    Changchun Bao
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [27] Speech enhancement methods based on binaural cue coding
    Wang, Xianyun
    Bao, Changchun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
  • [28] MODEL BASED BINAURAL ENHANCEMENT OF VOICED AND UNVOICED SPEECH
    Kavalekalam, Mathew Shaji
    Christensen, Mads Graesboll
    Boldt, Jesper B.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 666 - 670
  • [29] BINAURAL NOISE PSD ESTIMATION FOR BINAURAL SPEECH ENHANCEMENT
    Azarpour, Masoumeh
    Enzner, Gerald
    Martin, Rainer
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [30] Binaural consequences of speech envelope enhancement
    Baltzell, Lucas S.
    Cardosi, Daniel
    Swaminathan, Jayaganesh
    Best, Virginia
    JASA EXPRESS LETTERS, 2022, 2 (11):