Tone model integration based on discriminative weight training for Putonghua speech recognition

被引:0
|
作者
HUANG Hao ZHU Jie (Department of Electronic Engineering
机构
关键词
mode; MPE; Tone model integration based on discriminative weight training for Putonghua speech recognition; FMD; TSD; SFM; MCD; HMM;
D O I
10.15949/j.cnki.0217-9776.2008.03.007
中图分类号
TN912.34 [语音识别与设备];
学科分类号
摘要
A discriminative framework of tone model integration in continuous speech recog- nition was proposed.The method uses model dependent weights to scale probabilities of the hidden Markov models based on spectral features and tone models based on tonal features. The weights are discriminatively trained by minimum phone error criterion.Update equation of the model weights based on extended Baum-Welch algorithm is derived.Various schemes of model weight combination are evaluated and a smoothing technique is introduced to make training robust to over fitting.The proposed method is evaluated on tonal syllable output and character output speech recognition tasks.The experimental results show the proposed method has obtained 9.5% and 4.7% relative error reduction than global weight on the two tasks due to a better interpolation of the given models.This proves the effectiveness of discriminative trained model weights for tone model integration.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [41] Improved discriminative training techniques for large vocabulary continuous speech recognition
    Povey, D
    Woodland, PC
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 45 - 48
  • [42] Frame margin probability discriminative training algorithm for noisy speech recognition
    Li, Hao-Zheng
    O'Shaughnessy, Douglas
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 101 - 104
  • [43] Chain-based Discriminative Autoencoders for Speech Recognition
    Lee, Hung-Shin
    Huang, Pin-Tuan
    Cheng, Yao-Fei
    Wang, Hsin-Min
    INTERSPEECH 2022, 2022, : 2078 - 2082
  • [44] Discriminative weight training for a statistical model-based voice activity detection
    Kang, Sang-Ick
    Jo, Q-Haing
    Chang, Joon-Hyuk
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 170 - 173
  • [45] Integration of tonal knowledge into phonetic HMMs for recognition of speech in tone languages
    Demeechai, T
    Mäkeläinen, K
    SIGNAL PROCESSING, 2000, 80 (10) : 2241 - 2247
  • [46] Discriminative Weight Training for a Statistical Model-Based Voice Activity Detection
    Kang, Sang-Ick
    Jo, Q-Haing
    Chang, Joon-Hyuk
    Park, Seung Seop
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2007, 26 (05): : 194 - 198
  • [47] Discriminative Training of Dynamic Programming Based Speech Recognizers
    Chang, Pao-Chung
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 135 - 143
  • [48] Discriminative semi-parametric trajectory model for speech recognition
    Sim, K. C.
    Gales, M. J. F.
    COMPUTER SPEECH AND LANGUAGE, 2007, 21 (04): : 669 - 687
  • [49] An Empirical Study of Language Model Integration for Transducer based Speech Recognition
    Zheng, Huahuan
    An, Keyu
    Ou, Zhijian
    Huang, Chen
    Ding, Ke
    Wan, Guanglu
    INTERSPEECH 2022, 2022, : 3904 - 3908
  • [50] Performance Analysis of Mandarin Whispered Speech Recognition Based on Normal Speech Training Model
    Chen Xueqin
    Zhao Heming
    Fan Xiaohe
    2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 548 - 551