A STUDY ON SPEAKER ADAPTATION FOR MANDARINE SYLLABLE RECOGNITION WITH MINIMUM ERROR DISCRIMINATIVE TRAINING

被引:0
|
作者
LIN, CH
WU, CH
CHANG, PC
机构
关键词
SPEAKER-ADAPTATION; DISCRIMINATIVE TRAINING; MANDARINE SYLLABLE RECOGNITION; CONFUSION SET;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a different method of speaker adaptation for Mandarin syllable recognition. Based on the minimum classification error (MCE) criterion, we use the generalized probabilistic decent (GPD) algorithm to adjust iteratively the parameters of the hidden Markov models (HMM). The experiments on the multi-speaker Mandarin syllable database of Telecommunication Laboratories (T.L.) yield the following results: 1) Efficient speaker adaptation can be achieved through discriminative training using the MCE criterion and the GPD algorithm. 2) The computations required can be reduced through the use of the confusion sets in Mandarin base syllables. 3) For the discriminative training, the adjustment on the mean values of the Gaussian mixtures has the most prominent effect on speaker adaptation. 4) The discriminative training approach can be used to enhance the speaker adaptation capability of the maximum a posteriori (MAP) approach.
引用
收藏
页码:712 / 718
页数:7
相关论文
共 50 条
  • [31] Study of a fast discriminative training algorithm for pattern recognition
    Li, Qi
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (05): : 1212 - 1221
  • [32] Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation
    Seki, Hiroshi
    Yamamoto, Kazumasa
    Akiba, Tomoyosi
    Nakagawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (02) : 364 - 374
  • [33] Minimum error rate training for PHMM-based text recognition
    Yen, CC
    Kuo, SS
    Lee, CH
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (08) : 1120 - 1124
  • [34] Minimum word classification error training of HMMS for automatic speech recognition
    Yan, Zhi-Jie
    Zhu, Bo
    Hu, Yu
    Wang, Ren-Hua
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4521 - 4524
  • [35] Minimum Classification Error training of Hidden Markov Models for handwriting recognition
    Biem, AE
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 1529 - 1532
  • [36] Improving the characterization of the alternative hypothesis via minimum verification error training with applications to speaker verification
    Chao, Yi-Hsiang
    Tsai, Wei-Ho
    Wang, Hsin-Min
    Chang, Ruei-Chuan
    PATTERN RECOGNITION, 2009, 42 (07) : 1351 - 1360
  • [37] A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition
    Pao, Tsang-Long
    Wang, Chun-Hsiang
    Li, Yu-Ji
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2012, : 157 - 162
  • [38] Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition
    Doumpiotis, V
    Byrne, W
    SPEECH COMMUNICATION, 2006, 48 (02) : 142 - 160
  • [39] Audio-visual speech recognition using minimum classification error training
    Miyajima, C
    Tokuda, K
    Kitamura, T
    NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 3 - 12
  • [40] MINIMUM CLASSIFICATION ERROR TRAINING WITH GEOMETRIC MARGIN ENHANCEMENT FOR ROBUST PATTERN RECOGNITION
    Watanabe, Hideyuki
    Katagiri, Shigeru
    Ohsaki, Miho
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,