A formal language model for parsing SGML

被引:1
|
作者
Matzen, RW [1 ]
George, KM [1 ]
Hedrick, GE [1 ]
机构
[1] OKLAHOMA STATE UNIV,DEPT COMP SCI,STILLWATER,OK 74078
关键词
D O I
10.1016/0164-1212(95)00199-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Standard Generalized Markup Language (SGML) is an international standard for document definition (ISO 8879) that was adopted in 1986 and is rapidly gaining acceptance in industry and government. It is a meta-language system for document design rather than a specific scheme for document processing; almost any kind of document can be described using SGML. Productions called element declarations are used to define arbitrary elements of documents and the context in which they can occur. A finite set of element declarations called a document type definition (DTD) defines the high-level syntax of a set of documents. DTDs are similar to context-free grammars, but the productions are more complex. The standard does not describe a formal language model for SGML, and there is little work in the literature on this topic. This article defines a formal language model for SGML; systems of finite automata from systems of regular expressions. This model is applied in two ways: a parser is constructed for DTDs, and methods are shown for automatically constructing parsers for the documents defined by a DTD. These methods for parsing SGML are new, and they include features of DTDs that have not previously been included in a static language model. The model applies directly to the syntactic constructs of SGML, and thus. the methods shown in this article have distinct advantages for parsing SGML over traditional context-free parsing methods. (C) 1997 by Elsevier Science Inc.
引用
收藏
页码:147 / 166
页数:20
相关论文
共 50 条
  • [31] Generic information knowledge model and formal natural language
    Ota, Y
    Ohba, H
    ISAS/CITSA 2004: INTERNATIONAL CONFERENCE ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS AND 10TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ANALYSIS AND SYNTHESIS, VOL 4, PROCEEDINGS, 2004, : 58 - 63
  • [32] Formal Language Model for Transcriptome and Proteome Data Integration
    da Silva Filho, Reginaldo Inojosa
    de Azevedo da Rocha, Ricardo Luis
    Oliveira, Claudio Santos
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2020, PT V, 2020, 12253 : 727 - 735
  • [33] Algebraic Model and Formal Description Language of Software Architecture
    Chen, Wei
    Li, Tong
    Li, Jinglei
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 659 - 665
  • [34] Spatiotemporal database formal semantic model and algebraic language
    Kim, DH
    Lee, YJ
    Kim, JS
    Ryu, KH
    CIC'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN COMPUTING, 2000, : 9 - 15
  • [35] A formal model for parameterized solids in a visual design language
    Cox, PT
    Smedley, TJ
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2000, 11 (06): : 687 - 710
  • [36] A FORMAL SPECIFICATION LANGUAGE FOR DYNAMIC STRAND SPACE MODEL
    刘东喜
    李晓勇
    白英彩
    JournalofShanghaiJiaotongUniversity, 2002, (01) : 23 - 25
  • [37] VeriFormal: An Executable Formal Model of a Hardware Description Language
    Khan, Wilayat
    Tiu, Alwen
    Sanan, David
    SYSTEMS APPROACH TO CYBER SECURITY, 2017, 15 : 19 - 36
  • [38] Application of Standard General Markup Language (SGML) in a library environment
    Tresorukov, VN
    Terekhov, AA
    Sobolev, VA
    INTERNATIONAL CONFERENCE ON LIBRARY AUTOMATION IN CENTRAL AND EASTERN EUROPE, CONFERENCE PROCEEDINGS, 1997, : 153 - 155
  • [39] CONCURRENT, OBJECT-ORIENTED NATURAL-LANGUAGE PARSING - THE PARSETALK MODEL
    HAHN, U
    SCHACHT, S
    BROKER, N
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1994, 41 (1-2) : 179 - 222
  • [40] Natural Language Parsing of Patient Complaints in Indonesian Language
    Ratnasari, Chanifah Indah
    Kusumadewi, Sri
    Rosita, Linda
    2015 INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY (TICST), 2015, : 292 - 297