Improving Unsupervised Language Model Adaptation with Discriminative Data Filtering

被引:0
|
作者
Chang, Shuangyu [1 ]
Levit, Michael [1 ]
Parthasarathy, Partha [1 ]
Dumoulin, Benoit [1 ]
机构
[1] Microsoft Corp, Sunnyvale, CA 94089 USA
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
unsupervised; discriminative; language model adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a method for improving unsupervised language model (LM) adaptation by discriminatively filtering the adaptation training material. Two main issues are addressed in this solution: first, how to automatically identify recognition errors and more correct alternatives without manual transcription; second, how to update the model parameters based on the recognition error cues. Within the adaptation framework, we address the first issue by predicting regression pairs between recognition results from the baseline LM and an initial adapted LM, using features such as language model score difference. For the second issue, we adopted a data filtering approach to penalize potent error attractors introduced by the unsupervised adaptation data, using Ngram set difference statistics computed on the predicted regression pairs. Experimental results on a large real-world application of voice catalog search demonstrated that the proposed solution provides significant recognition error reduction over an initial adapted LM.
引用
收藏
页码:1207 / 1211
页数:5
相关论文
共 50 条
  • [31] Class Discriminative Adversarial Learning for Unsupervised Domain Adaptation
    Zhou, Lihua
    Ye, Mao
    Zhu, Xiatian
    Li, Shuaifeng
    Liu, Yiguang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4318 - 4326
  • [32] Enhancing unsupervised domain adaptation by discriminative relevance regularization
    Wenju Zhang
    Xiang Zhang
    Long Lan
    Zhigang Luo
    Knowledge and Information Systems, 2020, 62 : 3641 - 3664
  • [33] TRANSFERABLE DISCRIMINATIVE FEATURE MINING FOR UNSUPERVISED DOMAIN ADAPTATION
    Zhao, Lingjun
    Deng, Wanxia
    Kuang, Gangyao
    Hu, Dewen
    Liu, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1259 - 1263
  • [34] Discriminative Subspace Alignment for Unsupervised Visual Domain Adaptation
    Sun, Hao
    Liu, Shuai
    Zhou, Shilin
    NEURAL PROCESSING LETTERS, 2016, 44 (03) : 779 - 793
  • [35] CONSTRAINED DISCRIMINATIVE MAPPING TRANSFORMS FOR UNSUPERVISED SPEAKER ADAPTATION
    Chen, Langzhou
    Gales, Mark J. F.
    Chin, K. K.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5344 - 5347
  • [36] Enhancing unsupervised domain adaptation by discriminative relevance regularization
    Zhang, Wenju
    Zhang, Xiang
    Lan, Long
    Luo, Zhigang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (09) : 3641 - 3664
  • [37] Discriminative feature alignment: Improving transferability of unsupervised domain adaptation by Gaussian-guided latent alignment
    Wang, Jing
    Chen, Jiahong
    Lin, Jianzhe
    Sigal, Leonid
    Silva, Clarence W. de
    PATTERN RECOGNITION, 2021, 116
  • [38] Discriminative language model adaptation for Mandarin broadcast speech transcription and translation
    Liu, X. A.
    Byrne, W. J.
    Gales, M. J. F.
    de Gispert, A.
    Tomalin, M.
    Woodland, P. C.
    Yu, K.
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 153 - 158
  • [39] Data augmentation and language model adaptation
    Janiszek, D
    De Mori, R
    Bechet, E
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 549 - 552
  • [40] Semi-supervised and unsupervised discriminative language model training for automatic speech recognition
    Dikici, Erinc
    Saraclar, Murat
    SPEECH COMMUNICATION, 2016, 83 : 54 - 63