Improving Unsupervised Language Model Adaptation with Discriminative Data Filtering

被引:0
|
作者
Chang, Shuangyu [1 ]
Levit, Michael [1 ]
Parthasarathy, Partha [1 ]
Dumoulin, Benoit [1 ]
机构
[1] Microsoft Corp, Sunnyvale, CA 94089 USA
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
unsupervised; discriminative; language model adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a method for improving unsupervised language model (LM) adaptation by discriminatively filtering the adaptation training material. Two main issues are addressed in this solution: first, how to automatically identify recognition errors and more correct alternatives without manual transcription; second, how to update the model parameters based on the recognition error cues. Within the adaptation framework, we address the first issue by predicting regression pairs between recognition results from the baseline LM and an initial adapted LM, using features such as language model score difference. For the second issue, we adopted a data filtering approach to penalize potent error attractors introduced by the unsupervised adaptation data, using Ngram set difference statistics computed on the predicted regression pairs. Experimental results on a large real-world application of voice catalog search demonstrated that the proposed solution provides significant recognition error reduction over an initial adapted LM.
引用
收藏
页码:1207 / 1211
页数:5
相关论文
共 50 条
  • [41] Unsupervised cross-adaptation approach for speech recognition by combined language model and acoustic model adaptation
    School of Science and Engineering, Yamagata University, Yonezawa, Japan
    APSIPA ASC - Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf., (943-946):
  • [42] Phrasal Cohort Based Unsupervised Discriminative Language Modeling
    Xu, Puyang
    Khudanpur, Sanjeev
    Roark, Brian
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 198 - 201
  • [43] An Adversarial Training Method for Improving Model Robustness in Unsupervised Domain Adaptation
    Nie, Zhishen
    Lin, Ying
    Yan, Meng
    Cao, Yifan
    Ning, Shengfu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 3 - 13
  • [44] Improving Spoken Document Retrieval by. Unsupervised Language Model Adaptation Using Utterance-based Web Search
    Herms, Robert
    Ritter, Marc
    Wilhelm-Stein, Thomas
    Eibl, Maximilian
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1430 - 1433
  • [45] An unsupervised Web-based topic language model adaptation method
    Lecorve, Gwenole
    Gravier, Guillaume
    Sebillot, Pascale
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5081 - 5084
  • [46] Unsupervised adaptation of a stochastic Language Model using a Japanese raw corpus
    Kurata, Gakuto
    Mori, Shinsuke
    Nishimura, Masafumi
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1037 - 1040
  • [47] UNSUPERVISED LANGUAGE MODEL ADAPTATION USING N-GRAM WEIGHTING
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 857 - 860
  • [48] Supervised and unsupervised Web-based language model domain adaptation
    Lecorve, Gwenole
    Dines, John
    Hain, Thomas
    Motlicek, Petr
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 182 - 185
  • [49] Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model
    Li, Juntao
    He, Ruidan
    Ye, Hai
    Ng, Hwee Tou
    Bing, Lidong
    Yan, Rui
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3672 - 3678
  • [50] Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation
    Lee, Seungmin
    Kim, Dongwan
    Kim, Namil
    Jeong, Seong-Gyun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 91 - 100