Binaural bark subband preprocessing of nonstationary signals for noise robust speech feature extraction

被引:0
|
作者
Peters, M [1 ]
机构
[1] BMW AG, Ctr Res & Dev, D-80788 Munich, Germany
来源
PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS | 1998年
关键词
D O I
10.1109/TFSA.1998.721498
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A two channel approach to noise robust feature extraction for speech recognition in the car is proposed. The coherence function within the Bark subbands of the Mel-Frequency-Cepstral-Transform is calculated to estimate the spectral similarity of two statistic processes. It is illustrated how the coherence of speech in binaural signals is used to increase the robustness against incoherent noise. The introduced preprocessing method of nonstationary signals in two microphones results in an additive correction term of the Mel-Frequency-Cepstral-Coefficients.
引用
收藏
页码:609 / 612
页数:4
相关论文
共 50 条
  • [41] Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network
    Hung, Jeih-weih
    Lin, Jung-Shan
    Wu, Po-Jen
    APPLIED SYSTEM INNOVATION, 2018, 1 (03) : 1 - 14
  • [42] Distinctive phonetic feature extraction for robust speech recognition
    Fukuda, T
    Yamamoto, W
    Nitta, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 25 - 28
  • [43] Nonstationary Noise PSD Matrix Estimation for Multichannel Blind Speech Extraction
    Taseska, Maja
    Habets, Emanuel A. P.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2223 - 2236
  • [44] Adaptable noise reduction of ECG signals for feature extraction
    Kim, Hyun Dong
    Min, Chul Hong
    Kim, Tae Seon
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 586 - 591
  • [45] A Noise Tolerant Method for ECG Signals Feature Extraction and Noise Reduction
    Ayari, Emna Zoghlami
    Tielert, Reinhard
    Wehn, Norbert
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 2399 - 2402
  • [46] Robust speech features extraction in convolutional noise environment
    Lü, Zhao
    Wu, Xiaopei
    Zhang, Chao
    Li, Mi
    Shengxue Xuebao/Acta Acustica, 2010, 35 (04): : 465 - 470
  • [47] Improvements in intelligibility of noisy reverberant speech using a binaural subband adaptive noise-cancellation processing scheme
    Shields, PW
    Campbell, DR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (06): : 3232 - 3242
  • [48] Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise
    Zhu, QF
    Alwan, A
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (04): : 381 - 402
  • [49] Spike-based feature extraction for noise robust speech recognition using phase synchrony coding
    Uysal, Ismail
    Sathyendra, Harsha
    Harris, John G.
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 1529 - 1532
  • [50] HMM-Based strategies for enhancement of speech signals embedded in nonstationary noise
    Sameti, H
    Sheikhzadeh, H
    Deng, L
    Brennan, RL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 445 - 455