Noise suppression based on auditory-like filters for robust speech recognition

被引：0

作者：

Zhao, JH ^{[1
]}

Xie, X ^{[1
]}

Kuang, JM ^{[1
]}

机构：

[1] Beijing Inst Technol, Dept Elect Engn, Beijing 100081, Peoples R China

来源：

2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II | 2002年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, an efficient noise suppression algorithm for speech signal is present which is based on auditory-like filters. The algorithm process in three steps: first, the noise energy spectral is estimated after corrupted speech is input into a set of auditory-like filters. A statistical estimation method based on multi band filters is proposed and compared with weighted average. The second step is to eliminate the estimated noise spectral from the observed signal by spectral subtraction. Finally, auditory-based feature is extracted from the enhanced signal and introduced into ASR system. The noise suppression algorithm is evaluated in speaker-dependent Chinese digit experiment and the experiment results show that the proposed algorithm, can improve the automatic speech recognition performance in noisy environment.

引用

页码：560 / 563

页数：4

共 50 条

[1] AUDITORY FEATURES BASED ON GAMMATONE FILTERS FOR ROBUST SPEECH RECOGNITION
Qi, Jun
Wang, Dong
Jiang, Yi
Liu, Runsheng
2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 305 - 308
[2] Robust noise suppression methods in speech recognition
Cui, Yi
Zhang, Dong
Shi, Liangping
Chen, Liyuan
Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14
[3] Noise Suppression based on nonnegative matrix factorization for robust speech recognition
Fan, Hao-teng
Lin, Pao-han
Hung, Jeih-weih
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1731 - +
[4] Auditory-like filterbank: An optimal speech processor for efficient human speech communication
PRASANTA KUMAR GHOSH
LOUIS M GOLDSTEIN
SHRIKANTH S NARAYANAN
Sadhana, 2011, 36 : 699 - 712
[5] Auditory-like filterbank: An optimal speech processor for efficient human speech communication
Ghosh, Prasanta Kumar
Goldstein, Louis M.
Narayanan, Shrikanth S.
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 699 - 712
[6] Deep Q-network-based noise suppression for robust speech recognition
Park, Tae-Jun
Chang, Joon-Hyuk
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (05) : 2362 - 2373
[7] Deep Q-network-based noise suppression for robust speech recognition
Park T.-J.
Chang J.-H.
Turkish Journal of Electrical Engineering and Computer Sciences, 2021, 25 (09) : 2362 - 2373
[8] Discriminative training of auditory filters of different shapes for robust speech recognition
Mak, B
Tam, YC
Hsiao, R
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 45 - 48
[9] Noise Robust Feature Scheme for Automatic Speech Recognition Based on Auditory Perceptual Mechanisms
Cai, Shang
Xiao, Yeming
Pan, Jielin
Zhao, Qingwei
Yan, Yonghong
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (06): : 1610 - 1618
[10] Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition
Shi, Yanyan
Bai, Jing
Xue, Peiyun
Shi, Dianxi
IEEE ACCESS, 2019, 7 : 81911 - 81922

← 1 2 3 4 5 →