Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition

被引:0
|
作者
Tahir, Muhammad Ali [1 ]
Nussbaum-Thom, Markus [1 ]
Schlueter, Ralf [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Lehrstuhl Informat 6, Aachen, Germany
关键词
speech recognition; log linear modelling; discriminative training; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method is proposed to incorporate mixture density splitting into the acoustic model discriminative training for speech recognition. The standard method is to obtain a high resolution acoustic model by maximum likelihood training and density splitting, and then improving this model by discriminative training. We choose a log-linear form of acoustic model because for a single Gaussian density per triphone state the log-linear MMI optimization is a convex optimization problem, and by further splitting and discriminative training of this model we can get a higher complexity model. Previously it was shown that we achieve large gains in the objective function and corresponding moderate gains in the word error rate on a large vocabulary corpus. This paper incorporates the state of the art minimum phone error training criterion into the framework, and shows that after discriminative splitting, a subsequent log-linear MPE training achieves better results than Gaussian mixture model MPE optimization alone.
引用
收藏
页码:570 / 573
页数:4
相关论文
共 50 条
  • [1] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [2] A BOUNDED TRUST REGION OPTIMIZATION FOR DISCRIMINATIVE TRAINING OF HMMS IN SPEECH RECOGNITION
    Liu, Cong
    Hu, Yu
    Jiang, Hui
    Dai, Li-Rong
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4914 - 4917
  • [3] Discriminative training of tied mixture density HMMs for online handwritten digit recognition
    Nopsuwanchai, R
    Biem, A
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 817 - 820
  • [4] Discriminative Training of Variable-Parameter HMMs for Noise Robust Speech Recognition
    Yu, Dong
    Deng, Li
    Gong, Yifan
    Acero, Alex
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 285 - 288
  • [5] Generalized mixture of HMMs for continuous speech recognition
    Korkmazskiy, F
    Juang, BH
    Soong, F
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1443 - 1446
  • [6] Boosted Mixture Learning of Gaussian Mixture HMMs for Speech Recognition
    Du, Jun
    Hu, Yu
    Jiang, Hui
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2942 - +
  • [7] Discriminative training of Gaussian mixture models for large vocabulary speech recognition systems
    Bahl, LR
    Padmanabhan, M
    Nahamoo, D
    Gopalakrishnan, PS
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 613 - 616
  • [8] Lecture Speech Recognition Using Discrete-Mixture HMMs
    Kosaka, Tetsuo
    Yamamoto, Akiyoshi
    Kumakura, Takuya
    Kato, Masaharu
    Kohda, Masaki
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2011, 6 (01) : 23 - 29
  • [9] Robust speech recognition using discrete-mixture HMMs
    Kosaka, T
    Katoh, M
    Kohda, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (12): : 2811 - 2818
  • [10] Discriminative Training for Automatic Speech Recognition
    Heigold, Georg
    Ney, Hermann
    Schlueter, Ralf
    Wiesler, Simon
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69