Bispectral feature speech intelligibility assessment metric based on auditory model

被引:2
|
作者
Chen, Xiaomei [1 ]
Wang, Xiaowei [1 ]
Zhong, Bo [2 ]
Yang, Jiayan [3 ]
Shang, Yingying [3 ]
机构
[1] North China Elect Power Univ, Dept Elect & Elect Engn, Beijing 102206, Peoples R China
[2] Natl Inst Metrol, Div Mech & Acoust Metrol, Beijing 100029, Peoples R China
[3] Chinese Acad Med Sci, Peking Union Med Coll Hosp, Dept Otolaryngol, Beijing 100730, Peoples R China
来源
关键词
Speech intelligibility; Gammatone filter banks; Inner hair cell; Auditory model; Bispectrum; PREDICTION; INDEX; QUALITY; REVERBERANT;
D O I
10.1016/j.csl.2023.101492
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A bispectral feature based predictive speech intelligibility metric (GMBSIM) using a more refined functional auditory model of ear is proposed. In the auditory model of ear, Gammatone filter banks and Meddis inner hair cell auditory model is combined to simulate the ear function. With input speech signal divided into 32 auditory subbands, and each subband signal passed through the inner hair cell model, the bispectrum of each subband signal in time domain is estimated by frames. And then bispectral features are extracted and chosen to calculate the speech intelligi-bility. The proposed GMBSIM has relative low computational complexity by omitting the spec-trogram or neurogram image transformation. Considering the ear's perception and processing of speech signals makes the metric is advantageous to the classical metrics. And the last but not the least, the proposed GMBSIM metric is verified favorably across a range of conditions spanning reverberation, additive noise, and distortion such as jitter, which means it can be applied in most kinds of complex background noise environment.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] EFFECTS OF AUDITORY FATIGUE ON MASKED-SPEECH INTELLIGIBILITY
    PARKER, DE
    MARTENS, WL
    JOHNSTON, PA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S118 - S119
  • [32] Speech Intelligibility Improvement with Concrete and Abstract Auditory Cues
    Kim, Hyun Seung
    COMMUNICATION SCIENCES AND DISORDERS-CSD, 2023, 28 (03): : 536 - 553
  • [33] INFLUENCE OF AUDITORY FATIGUE ON MASKED SPEECH-INTELLIGIBILITY
    PARKER, DE
    MARTENS, WL
    JOHNSTON, PA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (04): : 1392 - 1393
  • [34] Impact of Otoplasty on Speech Intelligibility and Auditory Function in Noise
    Alaskarov, Elvin
    Aliyeva, Aynur
    INDIAN JOURNAL OF OTOLARYNGOLOGY AND HEAD & NECK SURGERY, 2024, : 291 - 297
  • [35] Auditory efferents involved in speech-in-noise intelligibility
    Giraud, AL
    Garnier, S
    Micheyl, C
    Lina, G
    Chays, A
    CheryCroze, S
    NEUROREPORT, 1997, 8 (07) : 1779 - 1783
  • [36] EFFECTS OF AUDITORY FATIGUE ON MASKED-SPEECH INTELLIGIBILITY
    PARKER, DE
    MARTENS, WL
    JOHNSTON, PA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S37 - S37
  • [37] The Effects of Auditory Contrast Tuning upon Speech Intelligibility
    Killian, Nathan J.
    Watkins, Paul V.
    Davidson, Lisa S.
    Barbour, Dennis L.
    FRONTIERS IN PSYCHOLOGY, 2016, 7
  • [38] Factors affecting predicted speech intelligibility with cochlear implants in an auditory model for electrical stimulation
    Fredelake, Stefan
    Hohmann, Volker
    HEARING RESEARCH, 2012, 287 (1-2) : 76 - 90
  • [39] Automatic Speech-to-Background Ratio Selection to Maintain Speech Intelligibility in Broadcasts Using an Objective Intelligibility Metric
    Tang, Yan
    Fazenda, Bruno M.
    Cox, Trevor J.
    APPLIED SCIENCES-BASEL, 2018, 8 (01):
  • [40] Speech technology-based assessment of phoneme intelligibility in dysarthria
    Van Nuffelen, Gwen
    Middag, Catherine
    De Bodt, Marc
    Martens, Jean-Pierre
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2009, 44 (05) : 716 - 730