Enhancing robustness for speech recognition through bio-inspired auditory filter-bank

被引:1
|
作者
Maganti, Hari Krishna [1 ]
Matassoni, Marco [1 ]
机构
[1] Fdn Bruno Kessler Irst, Ctr Informat Technol, I-38123 Trento, Italy
关键词
speech recognition; robustness; reverberant environment; feature extraction; auditory processing; lateral inhibition and level dependent frequency analysis;
D O I
10.1504/IJBIC.2012.049884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the important properties observed in basilar membrane filtering, aimed to improve robustness of the human car is lateral inhibition-based level-dependent frequency resolution. However, this particular property has not been extensively considered for improving robustness of the speech processing systems. In this work, an auditory filter-bank which includes lateral inhibition based on input stimulus providing a good fit to human auditory masking is used for improving robustness of the speech recognition system. The gammachirp auditory filter is the real part of the analytic gammachirp function which has been shown to provide an accurate description for the asymmetric and lateral inhibition observed in the basilar membrane filtering. The gammachirp is characterised with asymmetry in the low frequency tail of auditory filter response and models level dependent properties such as decrease in gain and a shift in the centre frequency of the filter with increase in level. The speech recognition experiments using the standard HTK framework are performed on standard Aurora-5 digit task database, both simulated and real data recorded with distant microphones in a hands-free mode at a real meeting room. The gammachirp-based features show reliable and consistent improvements when compared to the conventional features used for speech recognition.
引用
收藏
页码:271 / 277
页数:7
相关论文
共 50 条
  • [21] Enhancing the performance of Bio-inspired adhesives
    Chung, Hoyong
    Glass, Paul
    Sitti, Metin
    Washburn, Newell R.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 240
  • [22] On enhancing feature sequence filtering with filter-bank energy transformation in speaker verification with telephone speech
    Garreton, Claudio
    Becerra Yoma, Nestor
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1461 - 1464
  • [23] Role of Robustness in Running: Bio- and Bio-inspired Exoskeletons
    Full, R. J.
    Jayaram, K.
    Mongeau, J. M.
    Birkmeyer, P.
    Hoover, A.
    Fearing, R. S.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2011, 51 : E44 - E44
  • [24] PCA-based human auditory filter bank for speech recognition
    Nhat, VDM
    Lee, SY
    2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 393 - 397
  • [25] Bio-inspired molecular recognition of polymers inspired by Ronald Breslow the father of biomimetic and bio-inspired chemistry
    Zimmerman, Steven C.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2011, 241
  • [26] A Kalman filter based on wavelet filter-bank and psychoacoustic modeling for speech enhancement
    Shao, Yu
    Chang, Chip-Hong
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 121 - +
  • [27] INSTANTANEOUS FREQUENCY FILTER-BANK FEATURES FOR LOW RESOURCE SPEECH RECOGNITION USING DEEP RECURRENT ARCHITECTURES
    Nayak, Shekhar
    Kumar, C. Shiva
    Murty, K. Sri Rama
    2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 105 - 110
  • [28] Design for Robustness: Bio-Inspired Perspectives in Structural Engineering
    Kiakojouri, Foad
    De Biagi, Valerio
    Abbracciavento, Lorenza
    BIOMIMETICS, 2023, 8 (01)
  • [29] Bio-Inspired Multi-Robot Communication through Behavior Recognition
    Novitzky, Michael
    Pippin, Charles
    Collins, Thomas R.
    Balch, Tucker R.
    West, Michael E.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [30] Manipulation of Bio-inspired Robot with Gesture Recognition through Fractional Calculus
    Marques Junior, F. C. F.
    Saraiva, Arata A.
    Sousa, Jose Vigno M.
    Fonseca Ferreira, N. M.
    Valente, Antonio
    15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, : 230 - 235