Enhancing robustness for speech recognition through bio-inspired auditory filter-bank

被引:1
|
作者
Maganti, Hari Krishna [1 ]
Matassoni, Marco [1 ]
机构
[1] Fdn Bruno Kessler Irst, Ctr Informat Technol, I-38123 Trento, Italy
关键词
speech recognition; robustness; reverberant environment; feature extraction; auditory processing; lateral inhibition and level dependent frequency analysis;
D O I
10.1504/IJBIC.2012.049884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the important properties observed in basilar membrane filtering, aimed to improve robustness of the human car is lateral inhibition-based level-dependent frequency resolution. However, this particular property has not been extensively considered for improving robustness of the speech processing systems. In this work, an auditory filter-bank which includes lateral inhibition based on input stimulus providing a good fit to human auditory masking is used for improving robustness of the speech recognition system. The gammachirp auditory filter is the real part of the analytic gammachirp function which has been shown to provide an accurate description for the asymmetric and lateral inhibition observed in the basilar membrane filtering. The gammachirp is characterised with asymmetry in the low frequency tail of auditory filter response and models level dependent properties such as decrease in gain and a shift in the centre frequency of the filter with increase in level. The speech recognition experiments using the standard HTK framework are performed on standard Aurora-5 digit task database, both simulated and real data recorded with distant microphones in a hands-free mode at a real meeting room. The gammachirp-based features show reliable and consistent improvements when compared to the conventional features used for speech recognition.
引用
收藏
页码:271 / 277
页数:7
相关论文
共 50 条
  • [41] A noise-robust front-end based on tree-structured filter-bank for speech recognition
    Kil, RM
    Kim, YI
    Lee, GH
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 81 - 86
  • [42] Bio-inspired Audio-Visual Speech Recognition Towards the Zero Instruction Set Computing
    Malcangi, Mario
    Quan, Hao
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2016, 2016, 629 : 326 - 334
  • [43] Low-latency monaural speech enhancement with deep filter-bank equalizer
    Zheng, Chengshi
    Liu, Wenzhe
    Li, Andong
    Ke, Yuxuan
    Li, Xiaodong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (05): : 3291 - 3304
  • [44] Process Innovation through bio-inspired design
    Barthe-Delanoe, Anne Marie
    Negny, Stephane
    Le Lann, Jean Marc
    28TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2018, 43 : 785 - 790
  • [45] Bio-inspired microsystem for robust genetic assay recognition
    Lue, Jaw-Chyng
    Fang, Wai-Chi
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2008,
  • [46] Multi-scale Bio-inspired Place Recognition
    Chen, Zetao
    Jacobson, Adam
    Erdem, Ugur M.
    Hasselmo, Michael E.
    Milford, Michael
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 1895 - 1901
  • [47] Bio-inspired Architecture for Visual Recognition of Humans Walking
    Sanchez Orellana, Pedro Luis
    Castellanos Sanchez, Claudio
    del Angel-Guerrero, Edgar
    Martinez-Arenas, Tomas
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2009, 61 : 443 - +
  • [48] Bio-inspired Adaboost method for efficient face recognition
    Sedai, Suman
    Rhee, Phill Kyu
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 715 - 718
  • [49] Bio-inspired recognition of dopamine versus ascorbic acid
    Vergheese, TM
    Berchmans, S
    JOURNAL OF ELECTROANALYTICAL CHEMISTRY, 2004, 570 (01) : 35 - 46
  • [50] Bio-inspired anion recognition: Tunable electrostatic complementarity
    McDonald, Kevin P.
    Flood, Amar H.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 244