A STUDY ON SPEAKER ADAPTATION FOR MANDARINE SYLLABLE RECOGNITION WITH MINIMUM ERROR DISCRIMINATIVE TRAINING

被引：0

作者：

LIN, CH

WU, CH

CHANG, PC

机构：

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 1995年 / E78D卷 / 06期

关键词：

SPEAKER-ADAPTATION; DISCRIMINATIVE TRAINING; MANDARINE SYLLABLE RECOGNITION; CONFUSION SET;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a different method of speaker adaptation for Mandarin syllable recognition. Based on the minimum classification error (MCE) criterion, we use the generalized probabilistic decent (GPD) algorithm to adjust iteratively the parameters of the hidden Markov models (HMM). The experiments on the multi-speaker Mandarin syllable database of Telecommunication Laboratories (T.L.) yield the following results: 1) Efficient speaker adaptation can be achieved through discriminative training using the MCE criterion and the GPD algorithm. 2) The computations required can be reduced through the use of the confusion sets in Mandarin base syllables. 3) For the discriminative training, the adjustment on the mean values of the Gaussian mixtures has the most prominent effect on speaker adaptation. 4) The discriminative training approach can be used to enhance the speaker adaptation capability of the maximum a posteriori (MAP) approach.

引用

页码：712 / 718

页数：7

共 50 条

[31] Study of a fast discriminative training algorithm for pattern recognition
Li, Qi
Juang, Biing-Hwang
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (05): : 1212 - 1221
[32] Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation
Seki, Hiroshi
Yamamoto, Kazumasa
Akiba, Tomoyosi
Nakagawa, Seiichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (02) : 364 - 374
[33] Minimum error rate training for PHMM-based text recognition
Yen, CC
Kuo, SS
Lee, CH
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (08) : 1120 - 1124
[34] Minimum word classification error training of HMMS for automatic speech recognition
Yan, Zhi-Jie
Zhu, Bo
Hu, Yu
Wang, Ren-Hua
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4521 - 4524
[35] Minimum Classification Error training of Hidden Markov Models for handwriting recognition
Biem, AE
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 1529 - 1532
[36] Improving the characterization of the alternative hypothesis via minimum verification error training with applications to speaker verification
Chao, Yi-Hsiang
Tsai, Wei-Ho
Wang, Hsin-Min
Chang, Ruei-Chuan
PATTERN RECOGNITION, 2009, 42 (07) : 1351 - 1360
[37] A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition
Pao, Tsang-Long
Wang, Chun-Hsiang
Li, Yu-Ji
2012 FIFTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2012, : 157 - 162
[38] Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition
Doumpiotis, V
Byrne, W
SPEECH COMMUNICATION, 2006, 48 (02) : 142 - 160
[39] Audio-visual speech recognition using minimum classification error training
Miyajima, C
Tokuda, K
Kitamura, T
NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 3 - 12
[40] MINIMUM CLASSIFICATION ERROR TRAINING WITH GEOMETRIC MARGIN ENHANCEMENT FOR ROBUST PATTERN RECOGNITION
Watanabe, Hideyuki
Katagiri, Shigeru
Ohsaki, Miho
2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,

← 1 2 3 4 5 →