A discriminative and robust training algorithm for noisy speech recognition

被引:0
|
作者
Hong, WT
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A combined technique of discriminative and robust training algorithms, referred to as the D-REST (Discriminative and Robust Environment-effects Suppression Training), is proposed for noisy speech recognition. The D-REST technique can separately model the environmental characteristics and phonetic information and thus it can train speech models discriminatively on phonetic variability by eliminating the disturbance of environment-specific effects. According to the experimental results of Taiwan stock name recognition task over wireless network, the proposed D-REST algorithm has the potential to improve performance not only on diverse training data but also on noise-type unmatched environments between training and testing. Furthermore, the usage of the D-REST algorithm amounted to a 60% reduction in average word error rate over the performance by the conventional MCE/GPD-based training approach without environment-effects suppression training technique.
引用
收藏
页码:8 / 11
页数:4
相关论文
共 50 条
  • [31] A GENERAL DISCRIMINATIVE TRAINING ALGORITHM FOR SPEECH RECOGNITION USING WEIGHTED FINITE-STATE TRANSDUCERS
    Zhao, Yong
    Ljolje, Andrej
    Caseiro, Diamantino
    Juang, Biing-Hwang
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4217 - 4220
  • [32] Discriminative auditory-based features for robust speech recognition
    Mak, BKW
    Tam, YC
    Li, PQ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36
  • [33] STRUCTURED DISCRIMINATIVE MODELS FOR NOISE ROBUST CONTINUOUS SPEECH RECOGNITION
    Ragni, A.
    Gales, M. J. F.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4788 - 4791
  • [34] Discriminative classifiers with adaptive kernels for noise robust speech recognition
    Gales, M. J. F.
    Flego, F.
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 648 - 662
  • [35] Speech Emotion Recognition Based on Robust Discriminative Sparse Regression
    Song, Peng
    Zheng, Wenming
    Yu, Yanwei
    Ou, Shifeng
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (02) : 343 - 353
  • [36] Robust speech recognition based on discriminative learning of environmental features
    Han, J.Q.
    Gao, W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2001, 29 (02): : 196 - 198
  • [37] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
    Bou-Ghazale, SE
    Assaleh, K
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
  • [38] A robust speech recognition system for communication robots in noisy environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (03) : 759 - 763
  • [39] Linearized distortion model for robust speech recognition in noisy environments
    He, Yong-Jun
    Han, Ji-Qing
    Tongxin Xuebao/Journal on Communications, 2010, 31 (09): : 8 - 14
  • [40] Robust noisy speech recognition with adaptive frequency bank selection
    Tian, Y
    Wu, J
    Wang, ZY
    Lu, DJ
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 75 - 80