Knowledge Extraction and Semantic Annotation of Text from the Encyclopedia of Life

被引:16
|
作者
Thessen, Anne E. [1 ]
Parr, Cynthia Sims [2 ]
机构
[1] Arizona State Univ, Sch Life Sci, Tempe, AZ 85283 USA
[2] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC 20560 USA
来源
PLOS ONE | 2014年 / 9卷 / 03期
关键词
D O I
10.1371/journal.pone.0089550
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Numerous digitization and ontological initiatives have focused on translating biological knowledge from narrative text to machine-readable formats. In this paper, we describe two workflows for knowledge extraction and semantic annotation of text data objects featured in an online biodiversity aggregator, the Encyclopedia of Life. One workflow tags text with DBpedia URIs based on keywords. Another workflow finds taxon names in text using GNRD for the purpose of building a species association network. Both workflows work well: the annotation workflow has an F1 Score of 0.941 and the association algorithm has an F1 Score of 0.885. Existing text annotators such as Terminizer and DBpedia Spotlight performed well, but require some optimization to be useful in the ecology and evolution domain. Important future work includes scaling up and improving accuracy through the use of distributional semantics.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Semantic role labeling for knowledge graph extraction from text
    Mehwish Alam
    Aldo Gangemi
    Valentina Presutti
    Diego Reforgiato Recupero
    Progress in Artificial Intelligence, 2021, 10 : 309 - 320
  • [2] Semantic property grammars for knowledge extraction from biomedical text
    Dahl, Veronica
    Gu, Baohua
    LOGIC PROGRAMMING, PROCEEDINGS, 2006, 4079 : 442 - 443
  • [3] Semantic role labeling for knowledge graph extraction from text
    Alam, Mehwish
    Gangemi, Aldo
    Presutti, Valentina
    Reforgiato Recupero, Diego
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2021, 10 (03) : 309 - 320
  • [4] Semantic annotation of a natural language corpus for knowledge extraction
    Navarro, B
    Martínez-Barco, P
    Palomar, M
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 365 - 368
  • [5] QuickGraph: A Rapid Annotation Tool for Knowledge Graph Extraction from Technical Text
    Bikaun, Tyler
    Stewart, Michael
    Liu, Wei
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 270 - 278
  • [6] Modeling user knowledge and semantic structure for information extraction from text
    Moertl, PM
    ICCM - 2001: PROCEEDINGS OF THE 2001 FOURTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2001, : 283 - 284
  • [7] ENVIRONMENTS and EOL: identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life
    Pafilis, Evangelos
    Frankild, Sune P.
    Schnetzer, Julia
    Fanini, Lucia
    Faulwetter, Sarah
    Pavloudi, Christina
    Vasileiadou, Katerina
    Leary, Patrick
    Hammock, Jennifer
    Schulz, Katja
    Parr, Cynthia Sims
    Arvanitidis, Christos
    Jensen, Lars Juhl
    BIOINFORMATICS, 2015, 31 (11) : 1872 - 1874
  • [8] Linguistic Extraction for Semantic Annotation
    Dedek, Jan
    Vojtas, Peter
    INTELLIGENT DISTRIBUTED COMPUTING, SYSTEMS AND APPLICATIONS, 2008, 162 : 85 - +
  • [9] Web Knowledge Discovery Trends: From Semantic Annotation to Semantic Apis
    Dotsika, Fefie
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INTELLECTUAL CAPITAL, KNOWLEDGE MANAGEMENT & ORGANISATIONAL LEARNING, 2009, : 312 - 318
  • [10] Semantic annotation: Mapping text to ontologies
    Laboratoire d'Informatique de Paris-Nord, CNRS, Universiteá Paris 13, 99, Avenue J-B. Cleáment, F-93430 Villetaneuse, France
    Int. J. Metadata Semant. Ontol., 2007, 2 (67-78):