Improving Unsupervised Language Model Adaptation with Discriminative Data Filtering

被引：0

作者：

Chang, Shuangyu ^{[1
]}

Levit, Michael ^{[1
]}

Parthasarathy, Partha ^{[1
]}

Dumoulin, Benoit ^{[1
]}

机构：

[1] Microsoft Corp, Sunnyvale, CA 94089 USA

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

unsupervised; discriminative; language model adaptation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we propose a method for improving unsupervised language model (LM) adaptation by discriminatively filtering the adaptation training material. Two main issues are addressed in this solution: first, how to automatically identify recognition errors and more correct alternatives without manual transcription; second, how to update the model parameters based on the recognition error cues. Within the adaptation framework, we address the first issue by predicting regression pairs between recognition results from the baseline LM and an initial adapted LM, using features such as language model score difference. For the second issue, we adopted a data filtering approach to penalize potent error attractors introduced by the unsupervised adaptation data, using Ngram set difference statistics computed on the predicted regression pairs. Experimental results on a large real-world application of voice catalog search demonstrated that the proposed solution provides significant recognition error reduction over an initial adapted LM.

引用

页码：1207 / 1211

页数：5

共 50 条

[41] Unsupervised cross-adaptation approach for speech recognition by combined language model and acoustic model adaptation
School of Science and Engineering, Yamagata University, Yonezawa, Japan
APSIPA ASC - Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf., (943-946):
[42] Phrasal Cohort Based Unsupervised Discriminative Language Modeling
Xu, Puyang
Khudanpur, Sanjeev
Roark, Brian
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 198 - 201
[43] An Adversarial Training Method for Improving Model Robustness in Unsupervised Domain Adaptation
Nie, Zhishen
Lin, Ying
Yan, Meng
Cao, Yifan
Ning, Shengfu
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 3 - 13
[44] Improving Spoken Document Retrieval by. Unsupervised Language Model Adaptation Using Utterance-based Web Search
Herms, Robert
Ritter, Marc
Wilhelm-Stein, Thomas
Eibl, Maximilian
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1430 - 1433
[45] An unsupervised Web-based topic language model adaptation method
Lecorve, Gwenole
Gravier, Guillaume
Sebillot, Pascale
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5081 - 5084
[46] Unsupervised adaptation of a stochastic Language Model using a Japanese raw corpus
Kurata, Gakuto
Mori, Shinsuke
Nishimura, Masafumi
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1037 - 1040
[47] UNSUPERVISED LANGUAGE MODEL ADAPTATION USING N-GRAM WEIGHTING
Haidar, Md. Akmal
O'Shaughnessy, Douglas
2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 857 - 860
[48] Supervised and unsupervised Web-based language model domain adaptation
Lecorve, Gwenole
Dines, John
Hain, Thomas
Motlicek, Petr
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 182 - 185
[49] Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model
Li, Juntao
He, Ruidan
Ye, Hai
Ng, Hwee Tou
Bing, Lidong
Yan, Rui
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3672 - 3678
[50] Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation
Lee, Seungmin
Kim, Dongwan
Kim, Namil
Jeong, Seong-Gyun
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 91 - 100

← 1 2 3 4 5 →