Improved robustness of noisy speech HMMs based on weighted variance expansion

被引:0
|
作者
Kanno, S [1 ]
Funada, T [1 ]
机构
[1] Kanazawa Univ, Ind Res Inst Ishikawa, Kanazawa, Ishikawa 9200223, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spectrum of noise and SNR often vary abruptly due to the non-stationary noise under field conditions. The performance of speech recognition degrades rapidly when the noise conditions in the recognition process are different from those in the process of training or adaptation, therefore it is necessary to make HMMs robust to abrupt variation of noise. In this paper, we propose a method to modify the output probability at the state sensitive to noise by using weighted variance expansion based on the power of state or probability distribution, in order to improve the performance. The effectiveness of this method was examined in two types of noisy speech HMMs (one was trained with a specific SNR. the other was trained with five kinds of SNRs), through the evaluation experiments of speaker independent word recognition using noises of two factories. As the results, this method improved the robustness of the HMMs against the variation of noise conditions (noise type and SNR).
引用
收藏
页码:556 / 559
页数:4
相关论文
共 50 条
  • [31] Improved weighted spatial smoothing algorithm based on virtual array expansion
    Sun Z.
    Bai Q.
    Bai Y.
    Sun R.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (01): : 250 - 256
  • [32] Improved Noisy Student Training for Automatic Speech Recognition
    Park, Daniel S.
    Zhang, Yu
    Jia, Ye
    Han, Wei
    Chiu, Chung-Cheng
    Li, Bo
    Wu, Yonghui
    Le, Quoc, V
    INTERSPEECH 2020, 2020, : 2817 - 2821
  • [33] Improved Laplacian Factor Estimation for Noisy Speech Enhancement
    Ou, Shifeng
    Zhao, Xiaohui
    Gao, Ying
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 2911 - +
  • [34] Very low bit rate speech coding based on HMMs
    Hiroi, J., 1600, John Wiley and Sons Inc. (32):
  • [35] Landmark-based Approach to Speech Recognition: An Alternative to HMMs
    Espy-Wilson, Carol Y.
    Pruthi, Tarun
    Juneja, Amit
    Deshmukh, Om
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2516 - +
  • [36] Improved Robustness through Population Variance in Ant Colony Optimization
    Matthews, David C.
    Sutton, Andrew M.
    Hains, Doug
    Whitley, L. Darrell
    ENGINEERING STOCHASTIC LOCAL SEARCH ALGORITHMS: DESIGNING, IMPLEMENTING AND ANALYZING EFFECTIVE HEURISTICS, 2009, 5752 : 145 - 149
  • [37] Robustness of LSTM Neural Networks for the Enhancement of Spectral Parameters in Noisy Speech Signals
    Coto-Jimenez, Marvin
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2018, PT II, 2018, 11289 : 227 - 238
  • [38] An Algorithm for Intelligibility Prediction of Time-Frequency Weighted Noisy Speech
    Taal, Cees H.
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2125 - 2136
  • [39] NOISY SPEECH RECOGNITION BY USING VARIANCE ADAPTED HIDDEN MARKOV-MODELS
    CHIEN, JT
    LEE, LM
    WANG, HC
    ELECTRONICS LETTERS, 1995, 31 (18) : 1555 - 1556
  • [40] DENL: Diverse Ensemble and Noisy Logits for Improved Robustness of Neural Networks
    Yazdani, Mina
    Karimi, Hamed
    Samavi, Reza
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222