Combining spectral and temporal modification techniques for speech intelligibility enhancement

被引:6
|
作者
Cooke, Martin [1 ,2 ]
Aubanel, Vincent [3 ]
Garcia Lecumberri, Maria Luisa [2 ]
机构
[1] Ikerbasque Basque Sci Fdn, Bilbao, Spain
[2] Univ Basque Country, Language & Speech Lab, Vitoria 01006, Spain
[3] Univ Grenoble Alpes, Ctr Natl Rech Sci, GIPSA Lab, Grenoble, France
来源
关键词
Speech modification; Intelligibility; Retiming; Glimpsing; COCHLEA-SCALED ENTROPY; NOISE; CLEAR; INTENSITY;
D O I
10.1016/j.csl.2018.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modifying clean speech prior to output in noisy conditions can lead to substantial intelligibility gains. Most algorithms operate by redistributing energy across the signal, leaving the timing of the underlying speech sounds intact. Other techniques do alter the timing of speech relative to the masker. Both classes of approach - spectral and temporal - lead to a reduction in energetic masking. The current study examines how their combination affects intelligibility. Arguments can be made for both synergy and redundancy, and the presence of distortions introduced by both spectral and temporal approaches might even lead to an antagonistic combination. A cohort of native Spanish listeners identified keywords in sentences in unmodified form and following spectral, temporal and spectro-temporal modification, in the presence of a fluctuating masker. Errors in the spectro-temporal condition were substantially lower than following spectral or temporal modification alone, with a three-fold reduction compared to unmodified speech. Spectro-temporal gains were observed for all phonemes. A glimpse-based model of energetic masking incorporating speech rate changes predicts intelligibility (r = .96), and a glimpsing analysis provides further insights into the distinct mechanisms through which spectral and temporal approaches lead to a release from energetic masking. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [1] Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise
    Chennupati, Nivedita
    Kadiri, Sudarsana Reddy
    Yegnanarayana, B.
    COMPUTER SPEECH AND LANGUAGE, 2019, 54 : 86 - 105
  • [2] Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise
    Valentini-Botinhao, Cassia
    Yamagishi, Junichi
    King, Simon
    Stylianou, Yannis
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3534 - 3538
  • [3] LSP-based speech modification for intelligibility enhancement
    McLoughlin, IV
    Chance, RJ
    DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 591 - 594
  • [4] Effects of temporal and spectral factors of maskers on speech intelligibility
    Hara, Yoshifumi
    Tohyama, Mikio
    Miyoshi, Kazunori
    APPLIED ACOUSTICS, 2012, 73 (09) : 893 - 899
  • [5] Effects of Enhancement of Spectral Changes on Speech Quality and Subjective Speech Intelligibility
    Chen, Jing
    Baer, Thomas
    Moore, Brian C. J.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1640 - 1643
  • [6] Learning static spectral weightings for speech intelligibility enhancement in noise
    Tang, Yan
    Cooke, Martin
    COMPUTER SPEECH AND LANGUAGE, 2018, 49 : 1 - 16
  • [7] Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility in Noise
    Koutsogiannaki, Maria
    Stylianou, Yannis
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2508 - 2512
  • [8] Objective Intelligibility Prediction of Speech by Combining Correlation and Distortion based Techniques
    Gomez, Angel M.
    Schwerin, Belinda
    Paliwal, Kuldip
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1232 - 1235
  • [9] Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment
    Hikosaka, Shu
    Seki, Shogo
    Hayashi, Tomoki
    Kobayashi, Kazuhiro
    Takeda, Kazuya
    Banno, Hideki
    Toda, Tomoki
    INTERSPEECH 2020, 2020, : 4059 - 4063
  • [10] Spectral contrast enhancement improves speech intelligibility in noise for cochlear implants
    Nogueira, Waldo
    Rode, Thilo
    Buechner, Andreas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (02): : 728 - 739