Bispectral feature speech intelligibility assessment metric based on auditory model

被引:2
|
作者
Chen, Xiaomei [1 ]
Wang, Xiaowei [1 ]
Zhong, Bo [2 ]
Yang, Jiayan [3 ]
Shang, Yingying [3 ]
机构
[1] North China Elect Power Univ, Dept Elect & Elect Engn, Beijing 102206, Peoples R China
[2] Natl Inst Metrol, Div Mech & Acoust Metrol, Beijing 100029, Peoples R China
[3] Chinese Acad Med Sci, Peking Union Med Coll Hosp, Dept Otolaryngol, Beijing 100730, Peoples R China
来源
关键词
Speech intelligibility; Gammatone filter banks; Inner hair cell; Auditory model; Bispectrum; PREDICTION; INDEX; QUALITY; REVERBERANT;
D O I
10.1016/j.csl.2023.101492
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A bispectral feature based predictive speech intelligibility metric (GMBSIM) using a more refined functional auditory model of ear is proposed. In the auditory model of ear, Gammatone filter banks and Meddis inner hair cell auditory model is combined to simulate the ear function. With input speech signal divided into 32 auditory subbands, and each subband signal passed through the inner hair cell model, the bispectrum of each subband signal in time domain is estimated by frames. And then bispectral features are extracted and chosen to calculate the speech intelligi-bility. The proposed GMBSIM has relative low computational complexity by omitting the spec-trogram or neurogram image transformation. Considering the ear's perception and processing of speech signals makes the metric is advantageous to the classical metrics. And the last but not the least, the proposed GMBSIM metric is verified favorably across a range of conditions spanning reverberation, additive noise, and distortion such as jitter, which means it can be applied in most kinds of complex background noise environment.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] AN INTELLIGIBILITY METRIC BASED ON A SIMPLE MODEL OF SPEECH COMMUNICATION
    Van Kuyk, Steven
    Kleijn, W. Bastiaan
    Hendriks, Richard C.
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [2] Prediction of speech intelligibility based on an auditory preprocessing model
    Christiansen, Claus
    Pedersen, Michael Syskind
    Dau, Torsten
    SPEECH COMMUNICATION, 2010, 52 (7-8) : 678 - 692
  • [3] Optimizing linguistic materials for feature-based intelligibility assessment in speech impairments
    Marczyk, A.
    Ghio, A.
    Lalain, M.
    Rebourg, M.
    Fredouille, C.
    Woisard, V
    BEHAVIOR RESEARCH METHODS, 2022, 54 (01) : 42 - 53
  • [4] Optimizing linguistic materials for feature-based intelligibility assessment in speech impairments
    A. Marczyk
    A. Ghio
    M. Lalain
    M. Rebourg
    C. Fredouille
    V. Woisard
    Behavior Research Methods, 2022, 54 : 42 - 53
  • [5] An Auditory Saliency Pooling-Based LSTM Model for Speech Intelligibility Classification
    Gallardo-Antolin, Ascension
    Montero, Juan M.
    SYMMETRY-BASEL, 2021, 13 (09):
  • [6] Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data
    Payton, K.L. (kpayton@umassd.edu), 1600, Acoustical Society of America (134):
  • [7] Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data
    Payton, Karen L.
    Shrestha, Mona
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (05): : 3818 - 3827
  • [8] AUDITORY PROCESSING FOR SPEECH INTELLIGIBILITY IMPROVEMENT
    TOBIAS, JV
    AEROSPACE MEDICINE, 1970, 41 (07): : 728 - &
  • [9] Auditory-model based robust feature selection for speech recognition
    Koniaris, Christos
    Kuropatwinski, Marcin
    Kleijn, W. Bastiaan
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (02): : EL73 - EL79
  • [10] The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction
    Chabot-Leclerc, Alexandre
    Jorgensen, Soren
    Dau, Torsten
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06): : 3502 - 3512