Binary Mask Estimation for Improved Speech Intelligibility in Reverberant Environments

被引:0
|
作者
Hazrati, Oldooz [1 ]
Lee, Jaewook [1 ]
Loizou, Philipos [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA
关键词
Binary mask; cochlear implant (CI); dereverberation; NOISE; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A blind (non-ideal) time-frequency (T-F) masking technique is proposed for suppressing reverberation. A binary mask is estimated at each T-F unit by extracting a single variance-based feature from the reverberant signal and comparing its value against an adaptive threshold. The performance of the estimated binary mask is evaluated using intelligibility listening tests with hearing impaired listeners in four moderate to highly reverberant conditions. Results indicated that the proposed T-F masking technique yielded significant improvements in intelligibility even in highly reverberant conditions (T-60 = 1.0 s). This improvement was attributed to the recovery of the vowel/consonant boundaries which are severely smeared in reverberation.
引用
收藏
页码:162 / 165
页数:4
相关论文
共 50 条
  • [21] USING AUTOMATIC SPEECH RECOGNITION AND SPEECH SYNTHESIS TO IMPROVE THE INTELLIGIBILITY OF COCHLEAR IMPLANT USERS IN REVERBERANT LISTENING ENVIRONMENTS
    Chu, Kevin
    Collins, Leslie
    Mainsah, Boyla
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6929 - 6933
  • [22] A Speech Preprocessing Method Based on Overlap-Masking Reduction to Increase Intelligibility in Reverberant Environments
    Grosse, Julian
    van de Par, Steven
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (1-2): : 31 - 41
  • [23] Evaluation of speech naturalness with steady-state zero padding for improving intelligibility in reverberant environments
    Matsukaze, Yohei
    Arai, Takayuki
    Suzuki, Toshimasa
    Yasu, Keiichi
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2012, 33 (06) : 370 - 371
  • [24] Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners
    Arai, Takayuki
    Hodoshima, Nao
    Yasu, Keiichi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1775 - 1780
  • [25] Padding zero into steady-state portions of speech as a preprocess for improving intelligibility in reverberant environments
    Arai, Takayuki
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (05) : 459 - 461
  • [26] A Speech Preprocessing Method Based on Perceptually Optimized Envelope Processing to Increase Intelligibility in Reverberant Environments
    Fallah, Ali
    van de Par, Steven
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [28] Improved Speech Source Localization in Reverberant Environments Based on Correlation Dimension
    Wan, Xinwang
    Wu, Zhenyang
    2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 1540 - 1543
  • [29] Cochlear implant speech intelligibility outcomes with structured and unstructured binary mask errors
    Kressner, Abigail A.
    Westermann, Adam
    Buchholz, Joerg M.
    Rozell, Christopher J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (02): : 800 - 810
  • [30] Role of mask pattern in intelligibility of ideal binary-masked noisy speech
    Kjems, Ulrik
    Boldt, Jesper B.
    Pedersen, Michael S.
    Lunner, Thomas
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (03): : 1415 - 1426