Language model adaptation using mixtures and an exponentially decaying cache

被引:0
|
作者
Clarkson, PR
Robinson, AJ
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents two techniques for language model adaptation. The first is based on the use of mixtures of language models: the training text is partitioned according to topic, a language model is constructed for each component, and at recognition time appropriate weightings are assigned to each component to model the observed style of language. The second technique is based on augmenting the standard trigram model with a cache component in which words recurrence probabilities decay exponentially over time. Both techniques yield a significant reduction in perplexity over the baseline trigram language model when faced with multi-domain test text, the mixture-based model giving a 24% reduction and the cache-based model giving a 14% reduction. The two techniques attack the problem of adaptation at different scales, and as a result can be used in parallel to give a total perplexity reduction of 30%.
引用
收藏
页码:799 / 802
页数:4
相关论文
共 50 条
  • [41] Analysis and realization of an exponentially-decaying impulse response model for frequency-selective fading channels
    Morgan, Dennis R.
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 (441-444) : 441 - 444
  • [42] Visual Comparison of Language Model Adaptation
    Sevastjanova R.
    Cakmak E.
    Ravfogel S.
    Cotterell R.
    El-Assady M.
    IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 1178 - 1188
  • [43] Data augmentation and language model adaptation
    Janiszek, D
    De Mori, R
    Bechet, E
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 549 - 552
  • [44] Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
    Iyer, RM
    Ostendorf, M
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 30 - 39
  • [45] Model adaptation for spoken language understanding
    Tur, G
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 41 - 44
  • [46] Context Dependent Language Model Adaptation
    Liu, X.
    Gales, M. J. F.
    Woodland, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 837 - 840
  • [47] Selective use of gaze information to improve ASR performance in noisy environments by cache-based class language model adaptation
    Shen, Ao
    Cooke, Neil
    Russell, Martin
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1843 - 1847
  • [48] CACHECA: A Cache Language Model Based Code Suggestion Tool
    Franks, Christine
    Tu, Zhaopeng
    Devanbu, Premkumar
    Hellendoorn, Vincent
    2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, : 705 - 708
  • [49] Unbounded cache model for online language modeling with open vocabulary
    Grave, Edouard
    Cisse, Moustapha
    Joulin, Armand
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [50] Language Model Adaptation for a Speech to Sign Language Translation System using Web Frequencies and a MAP Framework
    Fernando D'Haro, Luis
    San-Segundo, Ruben
    de Cordoba, Ricardo
    Bungeroth, Jan
    Stein, Daniel
    Ney, Hermann
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2199 - +