Efficient parsing for Information Extraction

被引:0
|
作者
Basili, R [1 ]
Pazienza, MT [1 ]
Zanzotto, FM [1 ]
机构
[1] Univ Roma Tor Vergata, Dipartimento Informat Sist & Prod, I-00173 Rome, Italy
来源
ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 1998年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several (and successfull) Information Extraction systems have recently replaced the core parsing components with shallow but more efficient recognizers. In this paper we argue that the absence of an underlying grammatical recognizer, given the complex nature of several (non-english) languages, is a strong limitation for text processing functionalities, like those an IE system needs. We propose a robust and efficient syntactic recognizer mainly aimed to capture grammatical information crucial for several linguistic and non linguistic inferences. The proposed system is based on a novel architecture exploiting two major principles: lexicalization and stratification of the parsing process. As several linguistic theories (e.g. HPSG) and parsing frameworks (e.g. LTAG, SLTAG, lexicalized probabilistic parsing) suggest, lexicon-driven systems ensure the suitable forms of grammatical control for many complex phenomena. In our system an analysis guided by information on typical verb projections (e.g. verb subcategorization structures) is coupled with extended locality constraints (i.e. recognition of clause boundaries). Futhermore, stratification is also employed. A cascade of processing steps starts from chunk recognition and proceeds through clause analysis to dependency detection. Recognition of chunks allows to minimize the input ambiguity to the remaining phases. The resulting system is thus robust against ungrammatical phenomena (e.g. complex clause embedding, misspellings, unknown words). Efficiency is also retained, although ambiguous phenomena (multiple PP attachments) are recognized.
引用
收藏
页码:135 / 139
页数:5
相关论文
共 50 条
  • [21] A framework for parsing heritable information
    Jose, Antony M.
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2020, 17 (165)
  • [22] PARSING TEMPORAL AND SPATIAL INFORMATION
    Maranduc, Catalina
    Bobicev, Victoria
    Perez, Cenel Augusto
    PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE LINGUISTIC RESOURCES AND TOOLS FOR NATURAL LANGUAGE PROCESSING, 2020, : 17 - 28
  • [23] Efficient and flexible incremental parsing
    Wagner, TA
    Graham, SL
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1998, 20 (05): : 980 - 1013
  • [24] Efficient Constituency Parsing by Pointing
    Thanh-Tung Nguyen
    Xuan-Phi Nguyen
    Joty, Shafiq
    Li, Xiaoli
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3284 - 3294
  • [25] Community Structure for Efficient Information Flow in 'ToS;DR', a Social Machine for Parsing Legalese
    Binns, Reuben
    Matthews, David
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 881 - 883
  • [26] Parsing with principles and classes of information
    Satta, G
    COMPUTATIONAL LINGUISTICS, 1998, 24 (01) : 167 - 172
  • [27] Road and Car Extraction Using UAV Images via Efficient Dual Contextual Parsing Network
    Sun, Yueming
    Shao, Zhenfeng
    Cheng, Gui
    Huang, Xiao
    Wang, Zhongyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] Efficient parsing with parser combinators
    Kurs, Jan
    Vrany, Jan
    Ghafari, Mohammad
    Lungu, Mircea
    Nierstrasz, Oscar
    SCIENCE OF COMPUTER PROGRAMMING, 2018, 161 : 57 - 88
  • [29] Efficient and flexible incremental parsing
    Univ of California, Berkeley, United States
    ACM Trans Program Lang Syst, 5 (980-1013):
  • [30] Program extraction applied to monadic parsing
    Berger, Ulrich
    Jones, Alison
    Seisenberger, Monika
    JOURNAL OF LOGIC AND COMPUTATION, 2019, 29 (04) : 487 - 518