Comparison of Feature Extraction Methods for Speech Recognition in Noise-Free and in Traffic Noise Environment

被引:0
|
作者
Sarosi, Gellert [1 ]
Mozsary, Mihaly [1 ]
Mihajlik, Peter [1 ,2 ]
Fegyo, Tibor [1 ,3 ]
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
[2] THINKTech Res Ctr Nonprofit LLC, Budapest, Hungary
[3] Aitia Int Inc, Budapest, Hungary
关键词
feature extraction; multiple languages; multiple sample rates; real-life and white noise; varied SNR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A crucial part of a speech recognizer is the acoustic feature extraction, especially when the application is intended to be used in noisy environment. In this paper we investigate several novel front-end techniques and compare them to multiple baselines. Recognition tests were performed on studio quality wide band recordings on Hungarian as well as on narrow band telephone speech including real-life noises collected in six languages: English, German, French, Italian, Spanish and Hungarian. The following baseline feature types were used with several settings: Mel Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP) features implemented in HTK, SPHINX, or by ourselves. Novel methods include Perceptual Minimum Variance Distortionless Response (PMVDR) and multiple variations of the Power-Normalized Cepstral Coefficients (PNCC). Also, adaptive techniques are applied to reduce convolutive distortions. We have experienced a significant difference between the MFCC implementations, and there were major differences in the PNCC variations useful in the different bandwidths and noise conditions.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Traffic noise in Europe: A comparison of calculation methods, noise indices and noise standards for road and railroad traffic in Europe
    Nijland, HA
    Van Wee, GP
    TRANSPORT REVIEWS, 2005, 25 (05) : 591 - 612
  • [42] Noise-free normalized fringe patterns and local pixel transforms for strain extraction
    Yu, Qifeng
    Andresen, Klaus
    Osten, Wolfgang
    Jueptner, Werner
    Applied Optics, 1996, 35 (20): : 3783 - 3790
  • [43] Comparison of two methods for predicting traffic noise
    Gabillet, Y
    Dulau, B
    Laporte, JC
    Zouboff, JC
    Soulage, D
    Besnard, F
    INTER-NOISE 96 - THE 1996 INTERNATIONAL CONGRESS ON NOISE CONTROL ENGINEERING, 25TH ANNIVERSARY CONGRESS - LIVERPOOL, PROCEEDINGS, BOOKS 1-6: NOISE CONTROL - THE NEXT 25 YEARS, 1996, : 3133 - 3138
  • [44] Comparison of Two Methods for Predicting Traffic Noise
    Proc Int Conf Noise Control Eng, 6 (3133):
  • [45] Modified feature extraction methods in robust speech recognition
    Rajnoha, Josef
    Pollak, Petr
    2007 17TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, VOLS 1 AND 2, 2007, : 337 - +
  • [46] Fractional optical cryptographic protocol for data containers in a noise-free multiuser environment
    Jaramillo, Alexis
    Fredy Barrera, John
    Velez Zea, Alejandro
    Torroba, Roberto
    OPTICS AND LASERS IN ENGINEERING, 2018, 102 : 119 - 125
  • [47] Noise robust speech parameterization using multiresolution feature extraction
    Hariharan, R
    Kiss, I
    Viikki, O
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 856 - 865
  • [48] Feature extraction of speech signals of exoskeleton devices in noise environments
    Chen, Wen-Jie
    Su, Zhen-Xing
    Sun, Xian-Tao
    Liu, Yuan-Yuan
    Hu, Xiang-Tao
    Zhi, Ya-Li
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (10): : 3050 - 3057
  • [49] Research on the Continuous Speech Feature Extraction Method for Different Noise
    Liu Wei
    Sun Yiming
    Liu Yanxiu
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 3589 - 3592
  • [50] Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition
    Leutnant, Volker
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1640 - 1652