Discriminative training of acoustic models applied to domains with unreliable transcripts

被引：0

作者：

Mathias, L ^{[1
]}

Yegnanarayanan, G ^{[1
]}

Fritsch, J ^{[1
]}

机构：

[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Training Automatic Speech Recognition (ASR) systems require availability of training transcripts for the speech data. Obtaining these transcripts is a time consuming and costly process, especially for the medical domain. On the other hand, medical reports which are generated as a by-product of the normal medical transcription workflow are available easily. However, they only partially represent the acoustic data. In this paper, we present a method for the automatic Generation of transcripts from these medical reports (1). In particular, we identify "reliable" regions in the transcript that can be used for training acoustic models. Experiments based on maximum likelihood (ML) and lattice-based discriminative training with frame filtering are presented. It is shown that discriminative training gives us word error rate (WER) reductions of 8-15% relative to the baseline.

引用

页码：109 / 112

页数：4

共 50 条

[1] Discriminative training of acoustic models for system combination
Tachioka, Yuuki
Watanabe, Shinji
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2354 - 2358
[2] Training data selection for improving discriminative training of acoustic models
Liu, Shih-Hung
Chu, Fang-Hui
Lin, Shih-Hsiang
Lee, Hung-Shin
Chen, Berlin
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 284 - 289
[3] Training data selection for improving discriminative training of acoustic models
Chen, Berlin
Liu, Shih-Hung
Chu, Fang-Hui
PATTERN RECOGNITION LETTERS, 2009, 30 (13) : 1228 - 1235
[4] Discriminative Training of Gender-Dependent Acoustic Models
Vanek, Jan
Psutka, Josef V.
Zelinka, Jan
Prazak, Ales
Psutka, Josef
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 331 - 338
[5] Leveraging Unlabeled Speech for Sequence Discriminative Training of Acoustic Models
Sapru, Ashtosh
Garimella, Sri
INTERSPEECH 2020, 2020, : 3585 - 3589
[6] A variable weighting based training data selection method for discriminative training of acoustic models
Chen, Bin
Niu, Tong
Zhang, Lian-Hai
Li, Bi-Cheng
Qu, Dan
Zidonghua Xuebao/Acta Automatica Sinica, 2014, 40 (12): : 2899 - 2907
[7] DISCRIMINATIVE FEATURE DOMAINS FOR REVERBERANT ACOUSTIC ENVIRONMENTS
Papayiannis, Constantinos
Evers, Christine
Naylor, Patrick A.
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 756 - 760
[8] DISCRIMINATIVE TRAINING OF HIERARCHICAL ACOUSTIC MODELS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
Chang, Hung-An
Glass, James R.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4481 - 4484
[9] A DISTRIBUTED ARCHITECTURE FOR FAST SGD SEQUENCE DISCRIMINATIVE TRAINING OF DNN ACOUSTIC MODELS
Saon, George
2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 183 - 188
[10] A Regularized Discriminative Training Method of Acoustic Models Derived by Minimum Relative Entropy Discrimination
Kubo, Yotaro
Watanabe, Shinji
Nakamura, Atsushi
Kobayashi, Tetsunori
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2954 - +

← 1 2 3 4 5 →