Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

被引:2
|
作者
Poluboina, Venkateswarlu [1 ]
Pulikala, Aparna [1 ]
Muthu, Arivudai Nambi Pitchai [2 ]
机构
[1] Natl Inst Technol Karnataka, Dept Elect & Commun, Mangalore 575025, Karnataka, India
[2] Dept Audiol & Speech Language Pathol, Mangalore 575001, Karnataka, India
关键词
Cochlear implant signal processing; Temporal fine structure; Proportional frequency compression; Vocoder simulation; Speech recognition; PERFORMANCE; HEARING; ENCODER; PITCH;
D O I
10.1016/j.apacoust.2021.108616
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The study investigated the effect of proportionally frequency compressed encoding of temporal fine structure information on speech perception in noise using vocoder simulations of cochlear implant signal processing. The study proposed a pitch synchronous overlap-add algorithm (PSOLA) for downward frequency shifting of TFS. The speech recognition scores (SRS) were measured at-10 dB, 0 dB, and +10 dB for eight signal processing conditions corresponding to sinewave vocoder without TFS (NOTFS), four unshifted TFS conditions including full band TFS, TFS up to 2000, 1000, and 600 Hz, and three conditions with PSOLA which shifted 2000, 1000 and 600 Hz TFS to 1000, 500 and 300 Hz respectively. The original envelope was unchanged across the conditions. SRS at +10 dB and-10 dB SNR reached ceiling and floor respectively, in most conditions. Hence, SRS at 0 dB SNR was compared across the conditions. The results showed that the SRS was highest with full band TFS and lowest for the NO-TFS condition.The SRS for TFS 600 Hz shifted to 300 Hz through PSOLA was higher than the NO-TFS condition. Study findings suggest that encoding TFS by proportional frequency compression results in better speech perception in noise compared to NO-TFS. An important observation of this current study is that the speech recognition was better than the sine wave vocoder for all TFS conditions including frequency compressed 600 Hz TFS.(c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:5
相关论文
共 48 条
  • [31] Unintelligible low-frequency sound enhances simulated cochlear-implant speech recognition in noise
    Chang, Janice E.
    Bai, John Y.
    Zeng, Fan-Gang
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (12) : 2598 - 2601
  • [32] Spatial hearing benefits demonstrated with presentation of acoustic temporal fine structure cues in bilateral cochlear implant listeners
    Churchill, Tyler H.
    Kan, Alan
    Goupell, Matthew J.
    Litovsky, Ruth Y.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (03): : 1246 - 1256
  • [33] The contribution of visual information to the perception of speech in noise with and without informative temporal fine structure
    Stacey, Paula C.
    Kitterick, Padraig T.
    Morris, Saffron D.
    Sumner, Christian J.
    HEARING RESEARCH, 2016, 336 : 17 - 28
  • [34] Influence of Matching the Processing Delays of Cochlear Implant and Hearing Aid Devices for Bimodal Listeners on Speech Recognition in Noise
    Richter, Margaret E.
    Rooth, Meredith A.
    Dillon, Margaret T.
    AMERICAN JOURNAL OF AUDIOLOGY, 2024, 33 (04) : 1350 - 1355
  • [35] Optimal Combination of Neural Temporal Envelope and Fine Structure Cues to Explain Speech Identification in Background Noise
    Moon, Il Joon
    Won, Jong Ho
    Park, Min-Hyun
    Ives, D. Timothy
    Nie, Kaibao
    Heinz, Michael G.
    Lorenzi, Christian
    Rubinstein, Jay T.
    JOURNAL OF NEUROSCIENCE, 2014, 34 (36): : 12145 - 12154
  • [36] Temporal fine structure in cochlear implants: Preliminary speech perception results in Cantonese-speaking implant users
    Schatzer, Reinhold
    Krenmayr, Andreas
    Au, Dennis K. K.
    Kals, Mathiaskals
    Zierhofer, Clemens
    ACTA OTO-LARYNGOLOGICA, 2010, 130 (09) : 1031 - 1039
  • [37] Beamforming and Single-Microphone Noise Reduction: Effects on Signal-to-Noise Ratio and Speech Recognition of Bimodal Cochlear Implant Users
    Stronks, H. Christiaan
    Briaire, Jeroen
    Frijns, Johan
    TRENDS IN HEARING, 2022, 26
  • [38] Effect of cochlear implant n-of-m strategy on signal-to-noise ratio below which noise hinders speech recognition
    Stam, Lucas
    Goverts, S. Theo
    Smits, Cas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (05): : EL417 - EL422
  • [39] Beamforming and Single-Microphone Noise Reduction: Effects on Signal-to-Noise Ratio and Speech Recognition of Bimodal Cochlear Implant Users
    Stronks, H. Christiaan
    Briaire, Jeroen J.
    Frijns, Johan H. M.
    TRENDS IN HEARING, 2022, 26
  • [40] Role and relative contribution of temporal envelope and fine structure cues in sentence recognition by normal-hearing listeners
    Apoux, Frederic
    Yoho, Sarah E.
    Youngdahl, Carla L.
    Healy, Eric W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (03): : 2205 - 2212