Tone model integration based on discriminative weight training for Putonghua speech recognition

被引:0
|
作者
HUANG Hao ZHU Jie (Department of Electronic Engineering
机构
关键词
mode; MPE; Tone model integration based on discriminative weight training for Putonghua speech recognition; FMD; TSD; SFM; MCD; HMM;
D O I
10.15949/j.cnki.0217-9776.2008.03.007
中图分类号
TN912.34 [语音识别与设备];
学科分类号
摘要
A discriminative framework of tone model integration in continuous speech recog- nition was proposed.The method uses model dependent weights to scale probabilities of the hidden Markov models based on spectral features and tone models based on tonal features. The weights are discriminatively trained by minimum phone error criterion.Update equation of the model weights based on extended Baum-Welch algorithm is derived.Various schemes of model weight combination are evaluated and a smoothing technique is introduced to make training robust to over fitting.The proposed method is evaluated on tonal syllable output and character output speech recognition tasks.The experimental results show the proposed method has obtained 9.5% and 4.7% relative error reduction than global weight on the two tasks due to a better interpolation of the given models.This proves the effectiveness of discriminative trained model weights for tone model integration.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [1] Tone modeling based on hidden conditional random fields and discriminative model weight training
    Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai 200240, China
    Trans. Nanjing Univ. Aero. Astro., 2008, 1 (43-49): : 43 - 49
  • [2] Discriminative Training for Automatic Speech Recognition
    Heigold, Georg
    Ney, Hermann
    Schlueter, Ralf
    Wiesler, Simon
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
  • [3] Automatic context induction for tone model integration in mandarin speech recognition
    HUANG HaoLI Binghu Department of Information Science and EngineeringXinjiang UniversityUrumqi China Laboratory of MultiLingual Information TechnologyXinjiang UniversityUrumqi China
    The Journal of China Universities of Posts and Telecommunications, 2012, 19 (01) : 94 - 100
  • [5] Discriminative training of language models for speech recognition
    Kuo, KHJ
    Fosler-Lussier, E
    Jiang, H
    Lee, CH
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
  • [6] DISCRIMINATIVE STREAM-WEIGHT TRAINING FOR MANDARIN AUDIO-VISUAL SPEECH RECOGNITION
    Wu, Guanyong
    Zhu, Jie
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2010, 33 (05) : 775 - 780
  • [7] Lattice-based discriminative training for large vocabulary speech recognition
    Valtchev, V
    Odell, JJ
    Woodland, PC
    Young, SJ
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 605 - 608
  • [8] On a Generalization of Margin-Based Discriminative Training to Robust Speech Recognition
    Li, Jinyu
    Lee, Chin-Hui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1992 - 1995
  • [9] COMBINING DISCRIMINATIVE FEATURE, TRANSFORM, AND MODEL TRAINING FOR LARGE VOCABULARY SPEECH RECOGNITION
    Zheng, Jing
    Cetin, Ozgur
    Hwang, Mei-Yuh
    Lei, Xin
    Stolcke, Andreas
    Morgan, Nelson
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 633 - +
  • [10] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608