Probabilistic context-free grammars estimated from infinite distributions

被引:5
|
作者
Corazza, Anna
Satta, Giorgio
机构
[1] Univ Naples Federico II, Dept Phys, I-80126 Naples, Italy
[2] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
D O I
10.1109/TPAMI.2007.1065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider probabilistic context-free grammars, a class of generative devices that has been successfully exploited in several applications of syntactic pattern matching, especially in statistical natural language parsing. We investigate the problem of training probabilistic context-free grammars on the basis of distributions defined over an infinite set of trees or an infinite set of sentences by minimizing the cross-entropy. This problem has applications in cases of context-free approximation of distributions generated by more expressive statistical models. We show several interesting theoretical properties of probabilistic context-free grammars that are estimated in this way, including the previously unknown equivalence between the grammar cross-entropy with the input distribution and the so-called derivational entropy of the grammar itself. We discuss important consequences of these results involving the standard application of the maximum-likelihood estimator on finite tree and sentence samples, as well as other finite-state models such as Hidden Markov Models and probabilistic finite automata.
引用
收藏
页码:1379 / 1393
页数:15
相关论文
共 50 条
  • [21] REDUCTION OF CONTEXT-FREE GRAMMARS
    TANIGUCHI, K
    KASAMI, T
    INFORMATION AND CONTROL, 1970, 17 (01): : 92 - +
  • [22] RELATEDNESS OF CONTEXT-FREE GRAMMARS
    WALTER, HKG
    COMPUTING, 1979, 22 (01) : 31 - 58
  • [23] On restricted context-free grammars
    Dassow, Juergen
    Masopust, Tomas
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (01) : 293 - 304
  • [24] Ordered Context-Free Grammars
    van der Merwe, Brink
    Berglund, Martin
    IMPLEMENTATION AND APPLICATION OF AUTOMATA (CIAA 2022), 2022, 13266 : 53 - 66
  • [25] On Restricted Context-Free Grammars
    Dassow, Juergen
    Masopust, Tomas
    DEVELOPMENTS IN LANGUAGE THEORY, 2010, 6224 : 434 - +
  • [26] PREDICTORS OF CONTEXT-FREE GRAMMARS
    TAI, KC
    SIAM JOURNAL ON COMPUTING, 1980, 9 (03) : 653 - 664
  • [27] CONTEXT-FREE GRAPH GRAMMARS
    DELLAVIGNA, P
    GHEZZI, C
    INFORMATION AND CONTROL, 1978, 37 (02): : 207 - 233
  • [28] ORDERED CONTEXT-FREE GRAMMARS
    LEPISTO, T
    INFORMATION AND CONTROL, 1973, 22 (01): : 56 - 68
  • [29] On Muller context-free grammars
    Esik, Zoltan
    Ivan, Szabolcs
    THEORETICAL COMPUTER SCIENCE, 2012, 416 : 17 - 32
  • [30] On a construction of context-free grammars
    Martinek, Pavel
    Fundamenta Informaticae, 2000, 44 (03) : 245 - 264