Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition

被引:0
|
作者
Tahir, Muhammad Ali [1 ]
Nussbaum-Thom, Markus [1 ]
Schlueter, Ralf [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Lehrstuhl Informat 6, Aachen, Germany
关键词
speech recognition; log linear modelling; discriminative training; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method is proposed to incorporate mixture density splitting into the acoustic model discriminative training for speech recognition. The standard method is to obtain a high resolution acoustic model by maximum likelihood training and density splitting, and then improving this model by discriminative training. We choose a log-linear form of acoustic model because for a single Gaussian density per triphone state the log-linear MMI optimization is a convex optimization problem, and by further splitting and discriminative training of this model we can get a higher complexity model. Previously it was shown that we achieve large gains in the objective function and corresponding moderate gains in the word error rate on a large vocabulary corpus. This paper incorporates the state of the art minimum phone error training criterion into the framework, and shows that after discriminative splitting, a subsequent log-linear MPE training achieves better results than Gaussian mixture model MPE optimization alone.
引用
收藏
页码:570 / 573
页数:4
相关论文
共 50 条
  • [31] Training mixture density HMMs with SOM and LVQ
    Kurimo, M
    COMPUTER SPEECH AND LANGUAGE, 1997, 11 (04): : 321 - 343
  • [32] A constrained line search optimization method for discriminative training of HMMs
    Liu, Peng
    Liu, Cong
    Jiang, Hui
    Soong, Frank
    Wang, Ren-Hua
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 900 - 909
  • [33] A constrained line search optimization method for discriminative training of HMMs
    Liu, Cong
    Hu, Yu
    Dai, Li-Rong
    Wang, Ren-Hua
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (04): : 450 - 455
  • [34] Discriminative training of auditory filters of different shapes for robust speech recognition
    Mak, B
    Tam, YC
    Hsiao, R
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 45 - 48
  • [35] Hybrid speech recognition system with discriminative training applied for Romanian language
    Gavat, I
    Zirra, M
    Cula, O
    MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 11 - 15
  • [36] Lattice-based discriminative training for large vocabulary speech recognition
    Valtchev, V
    Odell, JJ
    Woodland, PC
    Young, SJ
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 605 - 608
  • [37] DISCRIMINATIVE TRAINING FOR SPEECH RECOGNITION IS COMPENSATING FOR STATISTICAL DEPENDENCE IN THE HMM FRAMEWORK
    Gillick, Dan
    Wegmann, Steven
    Gillick, Larry
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4745 - 4748
  • [38] Discriminative training of decoding graphs for large vocabulary continuous speech recognition
    Kuo, Hong-Kwang Jeff
    Kingsbury, Brian
    Zweig, Geoffrey
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 45 - +
  • [39] Large scale discriminative training of hidden Markov models for speech recognition
    Woodland, PC
    Povey, D
    COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 25 - 47
  • [40] Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition
    Yu, Dong
    Deng, Li
    He, Xiaodong
    Acero, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2418 - 2421