Speech Understanding Performance of Cochlear Implant Subjects Using Time-Frequency Masking-Based Noise Reduction

被引:24
|
作者
Qazi, Obaid Ur Rehman [1 ]
van Dijk, Bas [1 ]
Moonen, Marc [3 ]
Wouters, Jan [2 ]
机构
[1] Cochlear Technol Ctr Belgium, B-2800 Mechelen, Belgium
[2] Katholieke Univ Leuven, Dept Neurosci, ExpORL, B-3000 Louvain, Belgium
[3] Katholieke Univ Leuven, Dept Elect Engn, B-3000 Louvain, Belgium
关键词
Cochlear implants (CIs); phase error variance; speech processing; time-frequency (TF) masking; MONAURAL SPEECH; ENHANCEMENT; HEARING; RECOGNITION; PERCEPTION; ALGORITHMS; SEPARATION; STRATEGY; SYSTEM; SPEAK;
D O I
10.1109/TBME.2012.2187650
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Cochlear implant (CI) recipients report severe degradation of speech understanding under noisy conditions. Most CI recipients typically can require about 10-25 dB higher signal-to-noise ratio than normal hearing (NH) listeners in order to achieve similar speech understanding performance. In recent years, significant emphasis has been put on binaural algorithms, which not only make use of the head shadow effect, but also have two or more microphone signals at their disposal to generate binaural inputs. Most of the CI recipients today are unilaterally implanted but they can still benefit from the binaural processing utilizing a contralateral microphone. The phase error filtering (PEF) algorithm tries to minimize the phase error variance utilizing a time-frequency mask for noise reduction. Potential improvement in speech intelligibility offered by the algorithm is evaluated with four different kinds of mask functions. The study reveals that the PEF algorithm which uses a contralateral microphone but unilateral presentation provides considerable improvement in intelligibility for both NH and CI subjects. Further, preference rating test suggests that CI subjects can tolerate higher levels of distortions than NH subjects, and therefore, more aggressive noise reduction for CI recipients is possible.
引用
收藏
页码:1364 / 1373
页数:10
相关论文
共 50 条
  • [31] Stereo-input Speech Recognition using Sparseness-based Time-frequency Masking in a Reverberant Environment
    Izumi, Yosuke
    Nishiki, Kenta
    Watanabe, Shinji
    Nishimoto, Takuya
    Ono, Nobutaka
    Sagayama, Shigeki
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1907 - +
  • [32] Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks
    Yu, Yang
    Wang, Wenwu
    Han, Peng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016,
  • [33] A Comparative Study on Speech in Noise Understanding with a Direct Acoustic Cochlear Implant in Subjects with Severe to Profound Mixed Hearing Loss
    Lenarz, Thomas
    Verhaert, Nicolas
    Desloovere, Christian
    Desmet, Jolien
    D'hondt, Christiane
    Falcon Gonzalez, Juan Carlos
    Kludt, Eugen
    Ramos Macias, Angel
    Skarzynski, Henryk
    Van de Heyning, Paul
    Vyncke, Caroline
    Wasowski, Arkadiusz
    AUDIOLOGY AND NEURO-OTOLOGY, 2014, 19 (03) : 164 - 174
  • [34] Online blind speech separation using multiple acoustic speaker tracking and time-frequency masking
    Pertila, P.
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 683 - 702
  • [35] Impact of phase estimation on single-channel speech separation based on time-frequency masking
    Mayer, Florian
    Williamson, Donald S.
    Mowlaee, Pejman
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4668 - 4679
  • [36] TIME-FREQUENCY MASKING BASED ONLINE SPEECH ENHANCEMENT WITH MULTI-CHANNEL DATA USING CONVOLUTIONAL NEURAL NETWORKS
    Chakrabarty, Soumitro
    Wang, DeLiang
    Habets, Emanuel A. P.
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 476 - 480
  • [37] TIME DIFFERENCE OF ARRIVAL ESTIMATION OF SPEECH SIGNALS USING DEEP NEURAL NETWORKS WITH INTEGRATED TIME-FREQUENCY MASKING
    Pertila, Pasi
    Parviainen, Mikko
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 436 - 440
  • [38] SOFT MASKING BASED ADAPTATION FOR TIME-FREQUENCY BEAMFORMERS UNDER REVERBERANT AND BACKGROUND NOISE ENVIRONMENTS
    Kawaguchi, Yohei
    Togami, Masahito
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 736 - 740
  • [39] Deep Learning-Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients
    Lai, Ying-Hui
    Tsao, Yu
    Lu, Xugang
    Chen, Fei
    Su, Yu-Ting
    Chen, Kuang-Chao
    Chen, Yu-Hsuan
    Chen, Li-Ching
    Li, Lieber Po-Hung
    Lee, Chin-Hui
    EAR AND HEARING, 2018, 39 (04): : 795 - 809
  • [40] Speech enhancement with natural sounding residual noise based on connected time-frequency speech presence regions
    Sorensen, KV
    Andersen, SV
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2954 - 2964