Tone model integration based on discriminative weight training for Putonghua speech recognition

被引:0
|
作者
HUANG Hao ZHU Jie (Department of Electronic Engineering
机构
关键词
mode; MPE; Tone model integration based on discriminative weight training for Putonghua speech recognition; FMD; TSD; SFM; MCD; HMM;
D O I
10.15949/j.cnki.0217-9776.2008.03.007
中图分类号
TN912.34 [语音识别与设备];
学科分类号
摘要
A discriminative framework of tone model integration in continuous speech recog- nition was proposed.The method uses model dependent weights to scale probabilities of the hidden Markov models based on spectral features and tone models based on tonal features. The weights are discriminatively trained by minimum phone error criterion.Update equation of the model weights based on extended Baum-Welch algorithm is derived.Various schemes of model weight combination are evaluated and a smoothing technique is introduced to make training robust to over fitting.The proposed method is evaluated on tonal syllable output and character output speech recognition tasks.The experimental results show the proposed method has obtained 9.5% and 4.7% relative error reduction than global weight on the two tasks due to a better interpolation of the given models.This proves the effectiveness of discriminative trained model weights for tone model integration.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [21] STEREO-BASED STOCHASTIC MAPPING WITH DISCRIMINATIVE TRAINING FOR NOISE ROBUST SPEECH RECOGNITION
    Cui, Xiaodong
    Afify, Mohamed
    Gao, Yuqing
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3933 - +
  • [22] Towards discriminative training estimators for HMM speech recognition system
    Frikha, Mondher
    Messaoud, Z. Ben
    Hamida, A. Ben
    Journal of Applied Sciences, 2007, 7 (24) : 3891 - 3899
  • [23] Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition
    Tahir, Muhammad Ali
    Nussbaum-Thom, Markus
    Schlueter, Ralf
    Ney, Hermann
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 570 - 573
  • [24] Comparison of discriminative training criteria and optimization methods for speech recognition
    Schlüter, R
    Macherey, W
    Müller, B
    Ney, H
    SPEECH COMMUNICATION, 2001, 34 (03) : 287 - 310
  • [25] OVERVIEW OF LARGE SCALE OPTIMIZATION FOR DISCRIMINATIVE TRAINING IN SPEECH RECOGNITION
    Kanevsky, Dimitri
    Heigold, Georg
    Wright, Stephen
    Ney, Hermann
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5233 - 5236
  • [26] An N-Best Candidates-Based Discriminative Training for Speech Recognition Applications
    Chen, Jung-Kuei
    Soong, Frank K.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 206 - 216
  • [27] Recognition of Putonghua voiceless stop like initials based on speech main periods
    OU Guiwen(Institute of Computer Software. Zhongshan University. Guangzhou 510275)
    Chinese Journal of Acoustics, 1994, (01) : 83 - 86
  • [28] A DISCRIMINATIVE MODEL FOR CONTINUOUS SPEECH RECOGNITION BASED ON WEIGHTED FINITE STATE TRANSDUCERS
    Watanabe, Shinji
    Hori, Takaaki
    McDermott, Erik
    Nakamura, Atsushi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4922 - 4925
  • [29] A New Method for Discriminative Model Combination in Speech Recognition
    Wu Yahui
    Liu Gang
    Guo Jun
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, VOLS 1 AND 2, PROCEEDINGS, 2008, : 200 - 203
  • [30] ON LANGUAGE MODEL INTEGRATION FOR RNN TRANSDUCER BASED SPEECH RECOGNITION
    Zhou, Wei
    Zheng, Zuoyun
    Schlueter, Ralf
    Ney, Hermann
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8407 - 8411