Comparison of Feature Extraction Methods for Speech Recognition in Noise-Free and in Traffic Noise Environment

被引:0
|
作者
Sarosi, Gellert [1 ]
Mozsary, Mihaly [1 ]
Mihajlik, Peter [1 ,2 ]
Fegyo, Tibor [1 ,3 ]
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
[2] THINKTech Res Ctr Nonprofit LLC, Budapest, Hungary
[3] Aitia Int Inc, Budapest, Hungary
关键词
feature extraction; multiple languages; multiple sample rates; real-life and white noise; varied SNR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A crucial part of a speech recognizer is the acoustic feature extraction, especially when the application is intended to be used in noisy environment. In this paper we investigate several novel front-end techniques and compare them to multiple baselines. Recognition tests were performed on studio quality wide band recordings on Hungarian as well as on narrow band telephone speech including real-life noises collected in six languages: English, German, French, Italian, Spanish and Hungarian. The following baseline feature types were used with several settings: Mel Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP) features implemented in HTK, SPHINX, or by ourselves. Novel methods include Perceptual Minimum Variance Distortionless Response (PMVDR) and multiple variations of the Power-Normalized Cepstral Coefficients (PNCC). Also, adaptive techniques are applied to reduce convolutive distortions. We have experienced a significant difference between the MFCC implementations, and there were major differences in the PNCC variations useful in the different bandwidths and noise conditions.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Noise-free
    Jaeger, A
    KUNSTSTOFFE-PLAST EUROPE, 2000, 90 (09): : 40 - +
  • [2] Noise-free images
    Horiuchi, Noriaki
    NATURE PHOTONICS, 2016, 10 (11) : 692 - 692
  • [3] Noise-free images
    Noriaki Horiuchi
    Nature Photonics, 2016, 10 : 693 - 693
  • [4] Nonlinear noise compensation in feature domain for speech recognition with numerical methods
    Jiang, H
    Wang, Q
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 985 - 988
  • [5] Speech Recognition in High Noise Environment
    Tang, Chunling
    Li, Min
    EKOLOJI, 2019, 28 (107): : 1561 - 1565
  • [6] COMPARISON OF NOISE ROBUST METHODS IN LARGE VOCABULARY SPEECH RECOGNITION
    Keronen, Sami
    Remes, Ulpu
    Palomaki, Kalle J.
    Virtanen, Tuomas
    Kurimo, Mikko
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1973 - 1977
  • [7] Comparison of Feature Extraction Methods for Automated Target Recognition by Reducing Speckle Noise in SAR Data
    El Hasnaouy, Hasna
    Kasapoglu, Necip Gokhan
    2023 10TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN AIR AND SPACE TECHNOLOGIES, RAST, 2023,
  • [8] Evaluation of different feature extraction methods for speech recognition in car environment
    Wolf, Martin
    Nadeu, Climent
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 359 - 362
  • [9] System for noise-free driving
    Verfürth, Gerd
    2001, Springer Nature (103) : 13 - 14
  • [10] The mathematics of noise-free SPSA
    Gerencsér, L
    Vágó, Z
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4400 - 4405