A speaker adaptive Chinese syllable recognition system based on discriminative training

被引:0
|
作者
Zhou, L
Imai, S
机构
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present two speaker adaptation methods to implement a MSVQ-based adaptive Chinese syllable recognition system. The first proposed method is feature normalization in which we model the inter-speaker variability as a linear transformation. By applying the feature normalization, the target speaker speech is normalized to reduce the inter-speaker acoustic variability. In the second adaptation method, we first present an implementation of the MCE/GPD algorithm for discriminatively training MSVQ-based speech recognizer. It is expected that this method can separate the confusion classes and can enhance speaker adaptation capability. We carried out recognition experiments to assess the performance by using standard Chinese syllable database CRDB in China, the results show that when both adaptation methods are combined, the error rate reduction on open data is over 62% with a single set of adaptation training data. When increasing training data, the capability of speaker adaptation is improved using the MCE/GPD training only. After using 5 sets of training data, the average recognition rate for two new speakers was improved from 72.87% to 97.31% which is best performance reported in this database.
引用
收藏
页码:31 / 36
页数:6
相关论文
共 50 条
  • [1] MSVQ-based speaker-adaptive Chinese syllable recognition based on discriminative training
    Zhou, L
    Imai, S
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 1997, 11 (07) : 569 - 583
  • [2] A STUDY ON SPEAKER ADAPTATION FOR MANDARINE SYLLABLE RECOGNITION WITH MINIMUM ERROR DISCRIMINATIVE TRAINING
    LIN, CH
    WU, CH
    CHANG, PC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 712 - 718
  • [3] A STUDY ON MINIMUM ERROR DISCRIMINATIVE TRAINING FOR SPEAKER RECOGNITION
    LIU, CS
    LEE, CH
    CHOU, W
    JUANG, BH
    ROSENBERG, AE
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01): : 637 - 648
  • [4] Real-time speaker-dependent syllable recognition system of complete vocabulary of Chinese
    Chen, Tao
    Li, Changli
    Mo, Fuyuan
    Shengxue Xuebao/Acta Acustica, 1993, 18 (03): : 161 - 171
  • [5] A discriminative training approach for text-independent speaker recognition
    Hong, QY
    Kwong, S
    SIGNAL PROCESSING, 2005, 85 (07) : 1449 - 1463
  • [6] Fast Speaker Adaptive Training for Speech Recognition
    Povey, Daniel
    Kuo, Hong-Kwang J.
    Soltau, Hagen
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1245 - 1248
  • [7] Speaker identification based on adaptive discriminative vector quantisation
    Zhou, G.
    Mikhael, W. B.
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (06): : 754 - 760
  • [8] Discriminative training for speaker identification
    Hong, QY
    Kwong, S
    ELECTRONICS LETTERS, 2004, 40 (04) : 280 - 281
  • [9] Minimum Phone Error Discriminative Training For Mandarin Chinese Speaker Adaptation
    Chen, Liang-Yu
    Lee, Chun-Jen
    Jang, Jyh-Shing Roger
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1241 - +
  • [10] Similar Handwritten Chinese Character Recognition based on Adaptive Discriminative Locality Alignment
    Qu, Xiwen
    Xu, Ning
    Wang, Weiqiang
    Lu, Ke
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 130 - 133