Noise suppression based on auditory-like filters for robust speech recognition

被引:0
|
作者
Zhao, JH [1 ]
Xie, X [1 ]
Kuang, JM [1 ]
机构
[1] Beijing Inst Technol, Dept Elect Engn, Beijing 100081, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an efficient noise suppression algorithm for speech signal is present which is based on auditory-like filters. The algorithm process in three steps: first, the noise energy spectral is estimated after corrupted speech is input into a set of auditory-like filters. A statistical estimation method based on multi band filters is proposed and compared with weighted average. The second step is to eliminate the estimated noise spectral from the observed signal by spectral subtraction. Finally, auditory-based feature is extracted from the enhanced signal and introduced into ASR system. The noise suppression algorithm is evaluated in speaker-dependent Chinese digit experiment and the experiment results show that the proposed algorithm, can improve the automatic speech recognition performance in noisy environment.
引用
收藏
页码:560 / 563
页数:4
相关论文
共 50 条
  • [1] AUDITORY FEATURES BASED ON GAMMATONE FILTERS FOR ROBUST SPEECH RECOGNITION
    Qi, Jun
    Wang, Dong
    Jiang, Yi
    Liu, Runsheng
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 305 - 308
  • [2] Robust noise suppression methods in speech recognition
    Cui, Yi
    Zhang, Dong
    Shi, Liangping
    Chen, Liyuan
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14
  • [3] Noise Suppression based on nonnegative matrix factorization for robust speech recognition
    Fan, Hao-teng
    Lin, Pao-han
    Hung, Jeih-weih
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1731 - +
  • [4] Auditory-like filterbank: An optimal speech processor for efficient human speech communication
    PRASANTA KUMAR GHOSH
    LOUIS M GOLDSTEIN
    SHRIKANTH S NARAYANAN
    Sadhana, 2011, 36 : 699 - 712
  • [5] Auditory-like filterbank: An optimal speech processor for efficient human speech communication
    Ghosh, Prasanta Kumar
    Goldstein, Louis M.
    Narayanan, Shrikanth S.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 699 - 712
  • [6] Deep Q-network-based noise suppression for robust speech recognition
    Park, Tae-Jun
    Chang, Joon-Hyuk
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (05) : 2362 - 2373
  • [7] Deep Q-network-based noise suppression for robust speech recognition
    Park T.-J.
    Chang J.-H.
    Turkish Journal of Electrical Engineering and Computer Sciences, 2021, 25 (09) : 2362 - 2373
  • [8] Discriminative training of auditory filters of different shapes for robust speech recognition
    Mak, B
    Tam, YC
    Hsiao, R
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 45 - 48
  • [9] Noise Robust Feature Scheme for Automatic Speech Recognition Based on Auditory Perceptual Mechanisms
    Cai, Shang
    Xiao, Yeming
    Pan, Jielin
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (06): : 1610 - 1618
  • [10] Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition
    Shi, Yanyan
    Bai, Jing
    Xue, Peiyun
    Shi, Dianxi
    IEEE ACCESS, 2019, 7 : 81911 - 81922