Discriminative training of acoustic models applied to domains with unreliable transcripts

被引:0
|
作者
Mathias, L [1 ]
Yegnanarayanan, G [1 ]
Fritsch, J [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training Automatic Speech Recognition (ASR) systems require availability of training transcripts for the speech data. Obtaining these transcripts is a time consuming and costly process, especially for the medical domain. On the other hand, medical reports which are generated as a by-product of the normal medical transcription workflow are available easily. However, they only partially represent the acoustic data. In this paper, we present a method for the automatic Generation of transcripts from these medical reports (1). In particular, we identify "reliable" regions in the transcript that can be used for training acoustic models. Experiments based on maximum likelihood (ML) and lattice-based discriminative training with frame filtering are presented. It is shown that discriminative training gives us word error rate (WER) reductions of 8-15% relative to the baseline.
引用
收藏
页码:109 / 112
页数:4
相关论文
共 50 条
  • [1] Discriminative training of acoustic models for system combination
    Tachioka, Yuuki
    Watanabe, Shinji
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2354 - 2358
  • [2] Training data selection for improving discriminative training of acoustic models
    Liu, Shih-Hung
    Chu, Fang-Hui
    Lin, Shih-Hsiang
    Lee, Hung-Shin
    Chen, Berlin
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 284 - 289
  • [3] Training data selection for improving discriminative training of acoustic models
    Chen, Berlin
    Liu, Shih-Hung
    Chu, Fang-Hui
    PATTERN RECOGNITION LETTERS, 2009, 30 (13) : 1228 - 1235
  • [4] Discriminative Training of Gender-Dependent Acoustic Models
    Vanek, Jan
    Psutka, Josef V.
    Zelinka, Jan
    Prazak, Ales
    Psutka, Josef
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 331 - 338
  • [5] Leveraging Unlabeled Speech for Sequence Discriminative Training of Acoustic Models
    Sapru, Ashtosh
    Garimella, Sri
    INTERSPEECH 2020, 2020, : 3585 - 3589
  • [6] A variable weighting based training data selection method for discriminative training of acoustic models
    Chen, Bin
    Niu, Tong
    Zhang, Lian-Hai
    Li, Bi-Cheng
    Qu, Dan
    Zidonghua Xuebao/Acta Automatica Sinica, 2014, 40 (12): : 2899 - 2907
  • [7] DISCRIMINATIVE FEATURE DOMAINS FOR REVERBERANT ACOUSTIC ENVIRONMENTS
    Papayiannis, Constantinos
    Evers, Christine
    Naylor, Patrick A.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 756 - 760
  • [8] DISCRIMINATIVE TRAINING OF HIERARCHICAL ACOUSTIC MODELS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Chang, Hung-An
    Glass, James R.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4481 - 4484
  • [9] A DISTRIBUTED ARCHITECTURE FOR FAST SGD SEQUENCE DISCRIMINATIVE TRAINING OF DNN ACOUSTIC MODELS
    Saon, George
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 183 - 188
  • [10] A Regularized Discriminative Training Method of Acoustic Models Derived by Minimum Relative Entropy Discrimination
    Kubo, Yotaro
    Watanabe, Shinji
    Nakamura, Atsushi
    Kobayashi, Tetsunori
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2954 - +