Language model adaptation using mixtures and an exponentially decaying cache

被引:0
|
作者
Clarkson, PR
Robinson, AJ
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents two techniques for language model adaptation. The first is based on the use of mixtures of language models: the training text is partitioned according to topic, a language model is constructed for each component, and at recognition time appropriate weightings are assigned to each component to model the observed style of language. The second technique is based on augmenting the standard trigram model with a cache component in which words recurrence probabilities decay exponentially over time. Both techniques yield a significant reduction in perplexity over the baseline trigram language model when faced with multi-domain test text, the mixture-based model giving a 24% reduction and the cache-based model giving a 14% reduction. The two techniques attack the problem of adaptation at different scales, and as a result can be used in parallel to give a total perplexity reduction of 30%.
引用
收藏
页码:799 / 802
页数:4
相关论文
共 50 条
  • [21] Unsupervised language model adaptation
    Bacchiani, M
    Roark, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 224 - 227
  • [22] PROBABILISTIC TIME-SCHEDULING MODEL FOR AN EXPONENTIALLY DECAYING INVENTORY WHEN DELAYS IN PAYMENTS ARE PERMISSIBLE
    SHAH, NH
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 1993, 32 (01) : 77 - 82
  • [23] ANALYTICAL MODEL FOR THE OUT-DIFFUSION OF AN EXPONENTIALLY DECAYING IMPURITY PROFILE - APPLICATION TO NITROGEN IN SILICON
    WILLEMS, GJ
    MAES, HE
    JOURNAL OF APPLIED PHYSICS, 1993, 73 (07) : 3256 - 3260
  • [24] Exponentially time decaying susceptible-informed (SIT) model for information diffusion process on networks
    Bao, Wei
    Michailidis, George
    CHAOS, 2018, 28 (06)
  • [25] Lieb-Liniger model with exponentially decaying interactions: A continuous matrix product state study
    Rincon, Julian
    Ganahl, Martin
    Vidal, Guifre
    PHYSICAL REVIEW B, 2015, 92 (11):
  • [26] A Cache Language Model for Whole Document Handwriting Recognition
    Frinken, Volkmar
    Karatzas, Dimosthenis
    Fischer, Andreas
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 166 - 170
  • [27] A CACHE-BASED LANGUAGE MODEL FOR SPEECH RECOGNITION
    KUHN, R
    DEMORI, R
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (06) : 691 - 692
  • [28] Data augmentation and language model adaptation using singular value decomposition
    Béchet, F.
    De Mori, R.
    Janiszek, D.
    1600, Elsevier (25):
  • [29] Unsupervised adaptation of a stochastic Language Model using a Japanese raw corpus
    Kurata, Gakuto
    Mori, Shinsuke
    Nishimura, Masafumi
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1037 - 1040
  • [30] Improved language model adaptation using existing and derived external resources
    Chang, PC
    Lee, LS
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 531 - 536