Investigations of Issues for Using Multiple Acoustic Models to Improve Continuous Speech Recognition

被引:0
|
作者
Zhang, Rong [1 ]
Rudnicky, Alexander I. [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
Boosting; ROVER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates two important issues in constructing and combining ensembles of acoustic models for reducing recognition errors. First, we investigate the applicability of the AnyBoost algorithm for acoustic model training. AnyBoost is a generalized Boosting method that allows the use of an arbitrary loss function as the training criterion to construct ensemble of classifiers. We choose the MCE discriminative objective function for our experiments. Initial test results on a real-world meeting recognition corpus show that AnyBoost is a competitive alternate to the standard AdaBoost algorithm. Second, we investigate ROVER-based combination, focusing on the technique for selecting correct hypothesized words from aligned WTN. We propose a neural network based insertion detection and word scoring scheme for this. Our approach consistently outperforms the current voting technique used by ROVER in the experiments.
引用
收藏
页码:529 / 532
页数:4
相关论文
共 50 条
  • [31] Compact Acoustic Models for Embedded Speech Recognition
    Christophe Lévy
    Georges Linarès
    Jean-François Bonastre
    EURASIP Journal on Audio, Speech, and Music Processing, 2009
  • [32] Speech recognition on an FPA using discrete and continuous hidden Markov models
    Melnikoff, SJ
    Quigley, SF
    Russell, MJ
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 202 - 211
  • [33] Rejection techniques in continuous speech recognition using hidden Markov models
    1600, Publ by Elsevier Science Publishers B.V., Amsterdam, Neth
  • [34] Accent Issues in Large Vocabulary Continuous Speech Recognition
    Chao Huang
    Tao Chen
    Eric Chang
    International Journal of Speech Technology, 2004, 7 (2-3) : 141 - 153
  • [36] CONTINUOUS SPEECH RECOGNITION VIA CENTISECOND ACOUSTIC STATES
    BAKIS, R
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 : S97 - S97
  • [37] Acoustic Model Merging Using Acoustic Models from Multilingual Speakers for Automatic Speech Recognition
    Tan, Tien-Ping
    Besacier, Laurent
    Lecouteux, Benjamin
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 42 - 45
  • [38] MODELS OF CONTINUOUS SPEECH RECOGNITION AND THE CONTENTS OF THE VOCABULARY
    MCQUEEN, JM
    CUTLER, A
    BRISCOE, T
    NORRIS, D
    LANGUAGE AND COGNITIVE PROCESSES, 1995, 10 (3-4): : 309 - 331
  • [39] End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow
    Variani, Ehsan
    Bagby, Tom
    McDermott, Erik
    Bacchiani, Michiel
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1641 - 1645
  • [40] SPEECH RECOGNITION OF MULTIPLE ACCENTED ENGLISH DATA USING ACOUSTIC MODEL INTERPOLATION
    Fraga-Silva, Thiago
    Gauvain, Jean-Luc
    Lamel, Lori
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1781 - 1785