A noise-robust front-end based on tree-structured filter-bank for speech recognition

被引:0
|
作者
Kil, RM [1 ]
Kim, YI [1 ]
Lee, GH [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Brain Sci Res Ctr, Yusong Gu, Taejon 305701, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new model of auditory preprocessors based on tree-structured filter-banks to provide the robustness to noise and the easiness of hardware implementation. The robustness to noise is further improved by two components: one is the adaptive Q control of filter-bank in the sense of enhancing the contrast between frequency channels and another is the adaptive gain control of filter-bank outputs in the sense of enhancing the contrast between time frames. As a result, the proposed approach has shown the better performance of speech recognition in noisy environment compared to other approaches of auditory preprocessors. To show the effectiveness of our approach, the simulation for the English digit recognition of TI46-Word database has been performed.
引用
收藏
页码:81 / 86
页数:6
相关论文
共 50 条
  • [21] A Multichannel Noise Reduction Front-end based on psychoacoustics for robust speech recognition in highly noisy environments
    Cifani, Simone
    Principi, Emanuele
    Rocchi, Cesare
    Squartini, Stefano
    Piazza, Francesco
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 173 - 176
  • [22] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
    Wang, Haikun
    Ye, Zhongfu
    Chen, Jingdong
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
  • [23] Time and frequency filtering of filter-bank energies for robust HMM speech recognition
    Nadeu, C
    Macho, D
    Hernando, J
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 93 - 114
  • [24] Investigation into a Mel subspace based front-end processing for robust speech recognition
    Selouani, SA
    O'Shaughnessy, D
    Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 187 - 190
  • [25] Robust Front-End Processing For Emotion Recognition In Noisy Speech
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Kopparapu, Sunil Kumar
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
  • [26] Performance evaluation of front-end algorithms for robust speech recognition
    Cheng, O
    Abdulla, W
    Salcic, Z
    ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 711 - 714
  • [27] INTERACTIVE FEATURE FUSION FOR END-TO-END NOISE-ROBUST SPEECH RECOGNITION
    Hu, Yuchen
    Hou, Nana
    Chen, Chen
    Chng, Eng Siong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6292 - 6296
  • [28] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
    Das, Biswajit
    Panda, Ashish
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
  • [29] A noise robust front-end for speech recognition using hough transform and cumulative distribution mapping
    Choi, Eric H. C.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 286 - +
  • [30] A noise robust front-end with low computational cost for embedded in-car speech recognition
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1045 - +