Factorial Speech Processing Models for Noise-Robust Automatic Speech Recognition

被引:0
|
作者
Khademian, Mahdi [1 ]
Homayounpour, Mohammad Mehdi [1 ]
机构
[1] Amirkabir Univ Technol, LIMP, Tehran, Iran
关键词
factorial models of speech processing; state-conditional observation distribution; weighted stereo sampling; two-dimensional Viterbi algorithm;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an introduction of factorial speech processing models for noise-robust automatic speech processing tasks. Factorial models try to use more noise information rather than other robustness techniques for better generative modeling of speech and noise and the way they are combine together. Since factorial models were not completely successful in noise-robust speech processing applications while they have significant achievements in other speech processing areas in the past, we decide to reconsider them and evaluate their effects in the Aurora 2 task. In addition to Aurora noises, two more regular noises are examined in our experiments including Helicopter and Locomotive engine noises. Experiments show that these models are successful when we faced with destructive noises in addition to their unexpected improvements for non-regular non-stationary noises like Babble.
引用
收藏
页码:637 / 642
页数:6
相关论文
共 50 条
  • [31] An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition
    Raj, Bhiksha
    Turicchia, Lorenzo
    Schmidt-Nielsen, Bent
    Sarpeshkar, Rahul
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007,
  • [32] Noise-robust speech recognition based on difference of power spectrum
    Xu, JF
    Wei, G
    ELECTRONICS LETTERS, 2000, 36 (14) : 1247 - 1248
  • [33] On the temporal decorrelation of feature parameters for noise-robust speech recognition
    Jung, HY
    Lee, SY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 407 - 416
  • [34] Deep Maxout Networks Applied to Noise-Robust Speech Recognition
    de-la-Calle-Silos, F.
    Gallardo-Antolin, A.
    Pelaez-Moreno, C.
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 109 - 118
  • [35] Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network
    Hung, Jeih-weih
    Lin, Jung-Shan
    Wu, Po-Jen
    APPLIED SYSTEM INNOVATION, 2018, 1 (03) : 1 - 14
  • [36] MULTI-TASK AUTOENCODER FOR NOISE-ROBUST SPEECH RECOGNITION
    Zhang, Haoyi
    Liu, Conggui
    Inoue, Nakamasa
    Shinoda, Koichi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5599 - 5603
  • [37] An Efficient and Noise-Robust Audiovisual Encoder for Audiovisual Speech Recognition
    Li, Zhengyang
    Liang, Chenwei
    Lohrenz, Timo
    Sach, Marvin
    Moeller, Bjoern
    Fingscheidt, Tim
    INTERSPEECH 2023, 2023, : 1583 - 1587
  • [38] Noise-Robust Speech Recognition Based on RBF Neural Network
    Hou, Xuemei
    HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
  • [39] Noise-robust speech feature processing with empirical mode decomposition
    Kuo-Hau Wu
    Chia-Ping Chen
    Bing-Feng Yeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [40] Unsupervised modulation filter learning for noise-robust speech recognition
    Agrawal, Purvi
    Ganapathy, Sriram
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692