A formal language model for parsing SGML

被引:1
|
作者
Matzen, RW [1 ]
George, KM [1 ]
Hedrick, GE [1 ]
机构
[1] OKLAHOMA STATE UNIV,DEPT COMP SCI,STILLWATER,OK 74078
关键词
D O I
10.1016/0164-1212(95)00199-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Standard Generalized Markup Language (SGML) is an international standard for document definition (ISO 8879) that was adopted in 1986 and is rapidly gaining acceptance in industry and government. It is a meta-language system for document design rather than a specific scheme for document processing; almost any kind of document can be described using SGML. Productions called element declarations are used to define arbitrary elements of documents and the context in which they can occur. A finite set of element declarations called a document type definition (DTD) defines the high-level syntax of a set of documents. DTDs are similar to context-free grammars, but the productions are more complex. The standard does not describe a formal language model for SGML, and there is little work in the literature on this topic. This article defines a formal language model for SGML; systems of finite automata from systems of regular expressions. This model is applied in two ways: a parser is constructed for DTDs, and methods are shown for automatically constructing parsers for the documents defined by a DTD. These methods for parsing SGML are new, and they include features of DTDs that have not previously been included in a static language model. The model applies directly to the syntactic constructs of SGML, and thus. the methods shown in this article have distinct advantages for parsing SGML over traditional context-free parsing methods. (C) 1997 by Elsevier Science Inc.
引用
收藏
页码:147 / 166
页数:20
相关论文
共 50 条
  • [21] A psycholinguistic model of natural language parsing implemented in simulated neurons
    Huyck, Christian R.
    COGNITIVE NEURODYNAMICS, 2009, 3 (04) : 317 - 330
  • [22] GRAPHIC APPLICATIONS OF THE STANDARD GENERALIZED MARKUP LANGUAGE (SGML)
    CHAMBERLIN, DD
    GOLDFARB, CF
    COMPUTERS & GRAPHICS, 1987, 11 (04) : 343 - 358
  • [23] Automatically acquiring Chinese parsing knowledge based on a bilingual language model
    Lu, Ya-Juan
    Li, Sheng
    Zhao, Tie-Jun
    Jisuanji Xuebao/Chinese Journal of Computers, 2003, 26 (01): : 32 - 38
  • [24] Leveraging Large Language Model for Enhanced Text-to-SQL Parsing
    Zhan, Zecheng
    Haihong, E.
    Song, Meina
    IEEE ACCESS, 2025, 13 : 30497 - 30504
  • [25] Formal translation directed by parallel LLP parsing
    Vagner, Ladislav
    Melichar, Borivoj
    SOFSEM 2007: THEORY AND PRACTICE OF COMPUTER SCIENCE, PROCEEDINGS, 2007, 4362 : 532 - +
  • [26] Logical Parsing from Natural Language Based on a Neural Translation Model
    Li, Liang
    Liu, Yifan
    Qin, Zengchang
    Li, Pengyu
    Wan, Tao
    COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 115 - 126
  • [27] Connectionist natural language parsing
    Palmer-Brown, D
    Tepper, JA
    Powell, HM
    TRENDS IN COGNITIVE SCIENCES, 2002, 6 (10) : 437 - 442
  • [28] Dependency parsing for Vietnamese Language
    Manh-Ke Tran
    Nguyen, Le-Minh
    Hoang, Diep
    RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 34 - +
  • [29] A Formal Language Model of DNA Polymerase Enzymatic Activity
    Enaganti, Srujan Kumar
    Kari, Lila
    Kopecki, Steffen
    FUNDAMENTA INFORMATICAE, 2015, 138 (1-2) : 179 - 192
  • [30] Using a Formal Language Constructs for Software Model Evolution
    Ajila, Samuel A.
    Alam, Shahid
    2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 390 - +