Mining information extraction rules from datasheets without linguistic parsing

被引：0

作者：

Agrawal, R

Ho, H

Jacquenet, F

Jacquenet, M

机构：

[1] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA

[2] Univ St Etienne, F-42023 St Etienne, France

来源：

INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE | 2005年 / 3533卷

关键词：

text mining; information extraction;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the context of the Pangea project at IBM, we needed to design an information extraction module in order to extract some information from datasheets. Contrary to several information extraction systems based on some machine learning techniques that need some linguistic parsing of the documents, we propose an hybrid approach based on association rules mining and decision tree learning that does not require any linguistic processing. The system may be parameterized in various ways that influence the efficiency of the information extraction rules we discovered. The experiments show the system does not need a large training set to perform well.

引用

页码：510 / 520

页数：11

共 50 条

[1] Parsing without grammar rules
Matsumoto, Yuji
GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2006, 4201 : 1 - 6
[2] Automatic information extraction from texts with inference and linguistic knowledge acquisition rules
de Araujo, Denis A.
Rigo, Sandro J.
Muller, Carolina
Chishman, Rove
2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY - WORKSHOPS (WI-IAT), VOL 3, 2013, : 151 - 154
[3] Full text parsing using cascades of rules: an information extraction perspective
Ciravegna, F
Lavelli, A
NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 1999, : 102 - 109
[4] Medication information extraction with linguistic pattern matching and semantic rules
Spasic, Irene
Sarafraz, Farzaneh
Keane, John A.
Nenadic, Goran
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) : 532 - 535
[5] Efficient parsing for Information Extraction
Basili, R
Pazienza, MT
Zanzotto, FM
ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 135 - 139
[6] Mining linguistic valued association rules
Zou, Xiao-Feng
Lu, Jian-Jiang
Song, Zi-Lin
Xitong Fangzhen Xuebao / Journal of System Simulation, 2002, 14 (09):
[7] Mining association rules with linguistic terms
Lu, JJ
Xu, BW
Xu, L
Kang, DZ
Chen, HW
Yang, HJ
15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 129 - 133
[8] MINING RULES FROM INCOMPLETE INFORMATION SYSTEM
Yin, Xuri
INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2009, : 435 - 437
[9] Unified parsing and information extraction language
Bednar, Peter
2016 IEEE 14TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2016, : 131 - 135
[10] Building intelligent systems for mining information extraction rules from Web pages by using domain knowledge
Seo, H
Yang, J
Choi, J
ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 322 - 327

← 1 2 3 4 5 →