Binaural bark subband preprocessing of nonstationary signals for noise robust speech feature extraction

被引:0
|
作者
Peters, M [1 ]
机构
[1] BMW AG, Ctr Res & Dev, D-80788 Munich, Germany
来源
PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS | 1998年
关键词
D O I
10.1109/TFSA.1998.721498
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A two channel approach to noise robust feature extraction for speech recognition in the car is proposed. The coherence function within the Bark subbands of the Mel-Frequency-Cepstral-Transform is calculated to estimate the spectral similarity of two statistic processes. It is illustrated how the coherence of speech in binaural signals is used to increase the robustness against incoherent noise. The introduced preprocessing method of nonstationary signals in two microphones results in an additive correction term of the Mel-Frequency-Cepstral-Coefficients.
引用
收藏
页码:609 / 612
页数:4
相关论文
共 50 条
  • [31] Enhanced Face Preprocessing and Feature Extraction Methods Robust to Illumination Variation
    Kim, Dong-Ju
    Sohn, Myoung-Kyu
    Kim, Hyunduk
    Ryu, Nuri
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, ICCCI 2014, 2014, 8733 : 223 - 232
  • [32] Syntactic and Semantic Feature Extraction and Preprocessing to Reduce Noise in Bug Classification
    Agrawal, Ruchi
    Reddy, G. Ram Mohan
    WIRELESS NETWORKS AND COMPUTATIONAL INTELLIGENCE, ICIP 2012, 2012, 292 : 329 - 339
  • [33] Robust treatment of impulsive noise in speech and audio signals
    Godsill, SJ
    Rayner, PJW
    BAYESIAN ROBUSTNESS, 1996, 29 : 331 - 342
  • [34] A Noise Robust Technique for Detecting Vowels in Speech Signals
    Kumar, Avinash
    Shahnawazuddin, S.
    Ahmad, Waquar
    INTERSPEECH 2020, 2020, : 3680 - 3684
  • [35] A posterior union model for improved robust speech recognition in nonstationary noise
    Ming, J
    Smith, FJ
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 420 - 423
  • [36] HIERARCHICAL CLASSIFICATION TREE MODELING OF NONSTATIONARY NOISE FOR ROBUST SPEECH RECOGNITION
    Zelinka, Petr
    Sigmund, Milan
    INFORMATION TECHNOLOGY AND CONTROL, 2010, 39 (03): : 202 - 210
  • [37] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [38] MVDR based feature extraction for robust speech recognition
    Dharanipragada, S
    Rao, BD
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 309 - 312
  • [39] Modified feature extraction methods in robust speech recognition
    Rajnoha, Josef
    Pollak, Petr
    2007 17TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, VOLS 1 AND 2, 2007, : 337 - +
  • [40] Discriminative temporal feature extraction for robust speech recognition
    Shen, JL
    ELECTRONICS LETTERS, 1997, 33 (19) : 1598 - 1600