Probabilistic context-free grammars estimated from infinite distributions

被引:5
|
作者
Corazza, Anna
Satta, Giorgio
机构
[1] Univ Naples Federico II, Dept Phys, I-80126 Naples, Italy
[2] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
D O I
10.1109/TPAMI.2007.1065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider probabilistic context-free grammars, a class of generative devices that has been successfully exploited in several applications of syntactic pattern matching, especially in statistical natural language parsing. We investigate the problem of training probabilistic context-free grammars on the basis of distributions defined over an infinite set of trees or an infinite set of sentences by minimizing the cross-entropy. This problem has applications in cases of context-free approximation of distributions generated by more expressive statistical models. We show several interesting theoretical properties of probabilistic context-free grammars that are estimated in this way, including the previously unknown equivalence between the grammar cross-entropy with the input distribution and the so-called derivational entropy of the grammar itself. We discuss important consequences of these results involving the standard application of the maximum-likelihood estimator on finite tree and sentence samples, as well as other finite-state models such as Hidden Markov Models and probabilistic finite automata.
引用
收藏
页码:1379 / 1393
页数:15
相关论文
共 50 条
  • [31] ON CONTEXT-FREE PROGRAMMED GRAMMARS
    SEBESTA, RW
    COMPUTER LANGUAGES, 1989, 14 (02): : 99 - 108
  • [32] MINIMIZATION OF CONTEXT-FREE GRAMMARS
    Ryazanov, Yu D.
    Nazina, S., V
    PRIKLADNAYA DISKRETNAYA MATEMATIKA, 2019, (45): : 90 - 96
  • [33] On Muller Context-Free Grammars
    Esik, Zoltan
    Ivan, Szabolcs
    DEVELOPMENTS IN LANGUAGE THEORY, 2010, 6224 : 173 - 184
  • [34] CONTEXT-FREE TEXT GRAMMARS
    EHRENFEUCHT, A
    TENPAS, P
    ROZENBERG, G
    ACTA INFORMATICA, 1994, 31 (02) : 161 - 206
  • [35] Binary Context-Free Grammars
    Turaev, Sherzod
    Abdulghafor, Rawad
    Alwan, Ali Amer
    Abd Almisreb, Ali
    Gulzar, Yonis
    SYMMETRY-BASEL, 2020, 12 (08):
  • [36] Cooperation in context-free grammars
    Dassow, J
    Mitrana, V
    THEORETICAL COMPUTER SCIENCE, 1997, 180 (1-2) : 353 - 361
  • [37] Pullback Grammars Are Context-Free
    Bauderon, Michel
    Chen, Rui
    Ly, Olivier
    GRAPH TRANSFORMATIONS, ICGT 2008, 2008, 5214 : 366 - +
  • [38] Evolving context-free grammars
    Cyre, W
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 643 - 646
  • [39] ON MULTIPLE CONTEXT-FREE GRAMMARS
    SEKI, H
    MATSUMURA, T
    FUJII, M
    KASAMI, T
    THEORETICAL COMPUTER SCIENCE, 1991, 88 (02) : 191 - 229
  • [40] INDEXED GRAMMARS - AN EXTENSION OF CONTEXT-FREE GRAMMARS
    AHO, AV
    JOURNAL OF THE ACM, 1968, 15 (04) : 647 - &