Distant bigram language modelling using maximum entropy

被引:0
|
作者
Simons, M
Ney, H
Martin, SC
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we apply tile maximum entropy approach to so-called distant bigram language modelling. In addition to the usual unigram and bigram dependencies, we use distant bigram dependencies, where tile immediate predecessor word of the word position under consideration is skipped. The contributions of this paper are: (1) We analyze the computational complexity of the resulting training algorithm, i.e. the generalized iterative scaling (GIS) algorithm, and studs the details of its implementation. (2) We describe a method for handling unseen events in the maximum entropy approach; this is achieved by discounting the frequencies of observed events. (3) We study the effect of this discounting operation on the convergence of the GIS algorithm. (4) We give experimental perplexity results for a corpus from the WSJ task. By using the maximum entropy approach and the distant bigram dependencies, we are able to reduce the perplexity from 205.4 for our best conventional bigram model to 169.5.
引用
收藏
页码:787 / 790
页数:4
相关论文
共 50 条
  • [1] A maximum entropy approach to adaptive statistical language modelling
    Rosenfeld, R
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (03): : 187 - 228
  • [2] Maximum entropy approach to adaptive statistical language modelling
    Carnegie Mellon Univ, Pittsburgh, United States
    Comput Speech Lang, 3 (187-228):
  • [3] Modelling and Simulation of Seasonal Rainfall Using the Principle of Maximum Entropy
    Borwein, Jonathan
    Howlett, Phil
    Piantadosi, Julia
    ENTROPY, 2014, 16 (02) : 747 - 769
  • [4] Arabic Diacritics Restoration Using Maximum Entropy Language Models
    Shamardan, Hossam
    Hifny, Yasser
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1227 - 1231
  • [5] Maximum entropy method in comminution modelling
    Otwinowski, Henryk
    GRANULAR MATTER, 2006, 8 (3-4) : 239 - 249
  • [6] Localized maximum entropy shape modelling
    Loog, Marco
    INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, 2007, 4584 : 619 - 629
  • [7] Maximum Entropy Method in Comminution Modelling
    Henryk Otwinowski
    Granular Matter, 2006, 8 : 239 - 249
  • [8] An improved maximum entropy language model
    Fang, GL
    Wen, G
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 1083 - 1086
  • [9] Opto-thermal inverse modelling using a maximum entropy approach
    Xiao, P
    Gull, SF
    Imhof, RE
    ANALYTICAL SCIENCES, 2001, 17 : S394 - S397
  • [10] Modelling Body Mass Index Distribution using Maximum Entropy Density
    Chan, F.
    Harris, M.
    Singh, R.
    21ST INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2015), 2015, : 1036 - 1042