Comparison of Feature Extraction Methods for Speech Recognition in Noise-Free and in Traffic Noise Environment

被引:0
|
作者
Sarosi, Gellert [1 ]
Mozsary, Mihaly [1 ]
Mihajlik, Peter [1 ,2 ]
Fegyo, Tibor [1 ,3 ]
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
[2] THINKTech Res Ctr Nonprofit LLC, Budapest, Hungary
[3] Aitia Int Inc, Budapest, Hungary
关键词
feature extraction; multiple languages; multiple sample rates; real-life and white noise; varied SNR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A crucial part of a speech recognizer is the acoustic feature extraction, especially when the application is intended to be used in noisy environment. In this paper we investigate several novel front-end techniques and compare them to multiple baselines. Recognition tests were performed on studio quality wide band recordings on Hungarian as well as on narrow band telephone speech including real-life noises collected in six languages: English, German, French, Italian, Spanish and Hungarian. The following baseline feature types were used with several settings: Mel Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP) features implemented in HTK, SPHINX, or by ourselves. Novel methods include Perceptual Minimum Variance Distortionless Response (PMVDR) and multiple variations of the Power-Normalized Cepstral Coefficients (PNCC). Also, adaptive techniques are applied to reduce convolutive distortions. We have experienced a significant difference between the MFCC implementations, and there were major differences in the PNCC variations useful in the different bandwidths and noise conditions.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Acoustical feature extraction from aircraft and traffic noise
    Univ of Sydney
    Acoust Aust, 1 (5-10):
  • [22] Robust noise suppression methods in speech recognition
    Cui, Yi
    Zhang, Dong
    Shi, Liangping
    Chen, Liyuan
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14
  • [23] Noise-free logical stochastic resonance
    Gupta, Animesh
    Sohane, Aman
    Kohar, Vivek
    Murali, K.
    Sinha, Sudeshna
    PHYSICAL REVIEW E, 2011, 84 (05):
  • [24] Quasi noise-free digital holography
    Vittorio Bianco
    Pasquale Memmolo
    Melania Paturzo
    Andrea Finizio
    Bahram Javidi
    Pietro Ferraro
    Light: Science & Applications, 2016, 5 : e16142 - e16142
  • [25] Congruent Visual Speech Enhances Cortical Entrainment to Continuous Auditory Speech in Noise-Free Conditions
    Crosse, Michael J.
    Butler, John S.
    Lalor, Edmund C.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (42): : 14195 - 14204
  • [26] On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition
    Kacur, Juraj
    Puterka, Boris
    Pavlovicova, Jarmila
    Oravec, Milos
    SENSORS, 2021, 21 (05) : 1 - 27
  • [27] A NOVEL NOISE-FREE PIXELS BASED IMPULSE NOISE FILTERING
    Majid, Abdul
    Mahmood, Muhammad Tariq
    Choi, Tae-Sun
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 125 - 128
  • [28] Suppressing aliasing noise in the speech feature domain for automatic speech recognition
    Deng, Huiqun
    O'Shaughnessy, Douglas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (01): : EL8 - EL13
  • [29] Noise Robust Speech Recognition Based on Noise-Adapted HMMs Using Speech Feature Compensation
    Chung, Yong-Joo
    2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 132 - 135
  • [30] Recovering noise-free quantum observables
    Otten, Matthew
    Gray, Stephen K.
    PHYSICAL REVIEW A, 2019, 99 (01)