Data Acquisition and Information Extraction for Scientific Knowledge Base Building

被引：2

作者：

Andruszkiewicz, Piotr ^{[1
]}

Rybinski, Henryk ^{[1
]}

机构：

[1] Warsaw Univ Technol, Inst Comp Sci, Warsaw, Poland

来源：

2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2018年

关键词：

D O I：

10.1109/ICSC.2018.00045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Here we present the process of data acquisition and information extraction for building a comprehensive and accurate scientific knowledge base including conferences, publications and scientists. We use two kinds of data sources. Firstly we gather structured and reliable, but incomprehensive and not always up-to-date data sources such as digital libraries. We enrich information extracted from those sources with unstructured data obtained from the Internet by filtering websites using SVM classifier to identify potentially useful web pages. There are two potential sources of errors in the process of information enrichment. The first is the unstructured data origin and another is lack of accuracy of the machine learning methods used for data acquisition and information extraction. We address both problems by proposing a new information extraction method as well as by using crowdsourcing to correct information. Our methods are currently used in a scientific platform; namely, Omega-psi(R) university knowledge base, containing list of researchers, publications, events, etc.

引用

页码：256 / 259

页数：4

共 50 条

[41] Reuse of Public Data and Information Property and Building of Knowledge Society
Zhang, Xiao
Yang, Deling
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018), 2018, : 266 - 272
[42] Data Acquisition in Scientific Applications
Halsall, Rob
ERCIM NEWS, 2006, (67): : 62 - 63
[43] Usable information about what works: Building a broader and deeper knowledge base
Schorr, LB
Auspos, P
JOURNAL OF POLICY ANALYSIS AND MANAGEMENT, 2003, 22 (04) : 669 - 676
[44] Information in the knowledge acquisition process
Bosancic, Boris
JOURNAL OF DOCUMENTATION, 2016, 72 (05) : 930 - 960
[45] KNOWLEDGE ACQUISITION IN THE SMALL - BUILDING KNOWLEDGE-ACQUISITION TOOLS FROM PIECES
RUNKEL, JT
BIRMINGHAM, WP
KNOWLEDGE ACQUISITION, 1993, 5 (02): : 221 - 243
[46] Knowledge base of scientific gnosis: I. Knowledge base of scientific gnosis as one of occurrence relations
Miettinen, OS
JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2004, 10 (02) : 353 - 355
[47] An algorithm for knowledge base extraction
Nittka, A
ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 863 - 867
[48] KNOWLEDGE ACQUISITION FOR MODEL-BUILDING
COX, LA
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (01) : 91 - 103
[49] THE ACQUISITION AND RETENTION OF SCIENTIFIC-INFORMATION
MILLER, JD
BARRINGTON, TM
JOURNAL OF COMMUNICATION, 1981, 31 (02) : 178 - 189
[50] Knowledge Obtention Combining Information Extraction Techniques with Linked Data
Luis Garrido, Angel
Blazquez, Pilar
Buey, Maria G.
Ilarri, Sergio
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 643 - 648

← 1 2 3 4 5 →