Mining information extraction rules from datasheets without linguistic parsing

被引:0
|
作者
Agrawal, R
Ho, H
Jacquenet, F
Jacquenet, M
机构
[1] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
[2] Univ St Etienne, F-42023 St Etienne, France
来源
INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE | 2005年 / 3533卷
关键词
text mining; information extraction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of the Pangea project at IBM, we needed to design an information extraction module in order to extract some information from datasheets. Contrary to several information extraction systems based on some machine learning techniques that need some linguistic parsing of the documents, we propose an hybrid approach based on association rules mining and decision tree learning that does not require any linguistic processing. The system may be parameterized in various ways that influence the efficiency of the information extraction rules we discovered. The experiments show the system does not need a large training set to perform well.
引用
收藏
页码:510 / 520
页数:11
相关论文
共 50 条
  • [1] Parsing without grammar rules
    Matsumoto, Yuji
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2006, 4201 : 1 - 6
  • [2] Automatic information extraction from texts with inference and linguistic knowledge acquisition rules
    de Araujo, Denis A.
    Rigo, Sandro J.
    Muller, Carolina
    Chishman, Rove
    2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY - WORKSHOPS (WI-IAT), VOL 3, 2013, : 151 - 154
  • [3] Full text parsing using cascades of rules: an information extraction perspective
    Ciravegna, F
    Lavelli, A
    NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 1999, : 102 - 109
  • [4] Medication information extraction with linguistic pattern matching and semantic rules
    Spasic, Irene
    Sarafraz, Farzaneh
    Keane, John A.
    Nenadic, Goran
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) : 532 - 535
  • [5] Efficient parsing for Information Extraction
    Basili, R
    Pazienza, MT
    Zanzotto, FM
    ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 135 - 139
  • [6] Mining linguistic valued association rules
    Zou, Xiao-Feng
    Lu, Jian-Jiang
    Song, Zi-Lin
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2002, 14 (09):
  • [7] Mining association rules with linguistic terms
    Lu, JJ
    Xu, BW
    Xu, L
    Kang, DZ
    Chen, HW
    Yang, HJ
    15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 129 - 133
  • [8] MINING RULES FROM INCOMPLETE INFORMATION SYSTEM
    Yin, Xuri
    INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2009, : 435 - 437
  • [9] Unified parsing and information extraction language
    Bednar, Peter
    2016 IEEE 14TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2016, : 131 - 135
  • [10] Building intelligent systems for mining information extraction rules from Web pages by using domain knowledge
    Seo, H
    Yang, J
    Choi, J
    ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 322 - 327