Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability

被引:10
|
作者
Wood, Sean U. N. [1 ]
Stahl, Johannes K. W. [1 ]
Mowlaee, Pejman [1 ,2 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
[2] Widex AS, DK-3540 Lynge, Denmark
基金
奥地利科学基金会;
关键词
Speech enhancement; Speech coding; Noise reduction; Noise measurement; Estimation; Indexes; Binaural speech enhancement; atomic speech presence probability; nonnegative matrix factorization; interaural transfer function; QUALITY ASSESSMENT; SOURCE SEPARATION; NOISE-REDUCTION; LOCALIZATION; HEARING; PRESERVATION; ENVIRONMENT; MODEL;
D O I
10.1109/TASLP.2019.2937174
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability (ASPP) is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference (IPD and ILD), the interaural coherence magnitude (ICM), as well as a combined version leveraging the full interaural transfer function (ITF). We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.
引用
收藏
页码:2150 / 2161
页数:12
相关论文
共 50 条
  • [41] Binaural speech enhancement algorithm based on attention and deep learning
    Li R.
    Li Q.
    Zhao F.
    Liu S.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (09): : 125 - 131and166
  • [42] A multichannel subspace approach with signal presence probability for speech enhancement
    Jungpyo Hong
    Multidimensional Systems and Signal Processing, 2019, 30 : 2045 - 2058
  • [43] A multichannel subspace approach with signal presence probability for speech enhancement
    Hong, Jungpyo
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2019, 30 (04) : 2045 - 2058
  • [44] An Analysis of Dependency of Prior Probability for Codebook-Based Image Representation
    Shinomiya, Yuki
    Hoshino, Yukinobu
    2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2016, : 103 - 108
  • [45] Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2080 - 2091
  • [46] An Analysis of Dependency of Prior Probability for Codebook-Based Image Representation
    1600, Institute of Electrical and Electronics Engineers Inc., United States
  • [47] SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON INTEGRATED TIME-FREQUENCY MINIMUM TRACKING FOR SPEECH ENHANCEMENT IN ADVERSE ENVIRONMENTS
    Fu, Zhong-Hua
    Wang, Jhing-Fa
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4258 - 4261
  • [48] DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FORMULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Tammen, Marvin
    Fischer, Doerte
    Meyer, Bernd T.
    Doclo, Simon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 191 - 195
  • [49] A Generalized Subspace Approach for Multichannel Speech Enhancement Using Machine Learning-Based Speech Presence Probability Estimation
    Ke, Yuxuan
    Hu, Yi
    Li, Jian
    Zheng, Chengshi
    Li, Xiaodong
    146TH AES CONVENTION, 2019,
  • [50] A SPEECH PRESENCE MICROPHONE ARRAY BEAMFORMER USING MODEL BASED SPEECH PRESENCE PROBABILITY ESTIMATION
    Yu, Tao
    Hansen, John H. L.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 213 - 216