Binaural bark subband preprocessing of nonstationary signals for noise robust speech feature extraction

被引：0

作者：

Peters, M ^{[1
]}

机构：

[1] BMW AG, Ctr Res & Dev, D-80788 Munich, Germany

来源：

PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS | 1998年

关键词：

D O I：

10.1109/TFSA.1998.721498

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A two channel approach to noise robust feature extraction for speech recognition in the car is proposed. The coherence function within the Bark subbands of the Mel-Frequency-Cepstral-Transform is calculated to estimate the spectral similarity of two statistic processes. It is illustrated how the coherence of speech in binaural signals is used to increase the robustness against incoherent noise. The introduced preprocessing method of nonstationary signals in two microphones results in an additive correction term of the Mel-Frequency-Cepstral-Coefficients.

引用

页码：609 / 612

页数：4

共 50 条

[41] Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network
Hung, Jeih-weih
Lin, Jung-Shan
Wu, Po-Jen
APPLIED SYSTEM INNOVATION, 2018, 1 (03) : 1 - 14
[42] Distinctive phonetic feature extraction for robust speech recognition
Fukuda, T
Yamamoto, W
Nitta, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 25 - 28
[43] Nonstationary Noise PSD Matrix Estimation for Multichannel Blind Speech Extraction
Taseska, Maja
Habets, Emanuel A. P.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2223 - 2236
[44] Adaptable noise reduction of ECG signals for feature extraction
Kim, Hyun Dong
Min, Chul Hong
Kim, Tae Seon
ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 586 - 591
[45] A Noise Tolerant Method for ECG Signals Feature Extraction and Noise Reduction
Ayari, Emna Zoghlami
Tielert, Reinhard
Wehn, Norbert
2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 2399 - 2402
[46] Robust speech features extraction in convolutional noise environment
Lü, Zhao
Wu, Xiaopei
Zhang, Chao
Li, Mi
Shengxue Xuebao/Acta Acustica, 2010, 35 (04): : 465 - 470
[47] Improvements in intelligibility of noisy reverberant speech using a binaural subband adaptive noise-cancellation processing scheme
Shields, PW
Campbell, DR
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (06): : 3232 - 3242
[48] Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise
Zhu, QF
Alwan, A
COMPUTER SPEECH AND LANGUAGE, 2003, 17 (04): : 381 - 402
[49] Spike-based feature extraction for noise robust speech recognition using phase synchrony coding
Uysal, Ismail
Sathyendra, Harsha
Harris, John G.
2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 1529 - 1532
[50] HMM-Based strategies for enhancement of speech signals embedded in nonstationary noise
Sameti, H
Sheikhzadeh, H
Deng, L
Brennan, RL
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 445 - 455

← 1 2 3 4 5 →