Knowledge Extraction and Semantic Annotation of Text from the Encyclopedia of Life

被引:16
|
作者
Thessen, Anne E. [1 ]
Parr, Cynthia Sims [2 ]
机构
[1] Arizona State Univ, Sch Life Sci, Tempe, AZ 85283 USA
[2] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC 20560 USA
来源
PLOS ONE | 2014年 / 9卷 / 03期
关键词
D O I
10.1371/journal.pone.0089550
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Numerous digitization and ontological initiatives have focused on translating biological knowledge from narrative text to machine-readable formats. In this paper, we describe two workflows for knowledge extraction and semantic annotation of text data objects featured in an online biodiversity aggregator, the Encyclopedia of Life. One workflow tags text with DBpedia URIs based on keywords. Another workflow finds taxon names in text using GNRD for the purpose of building a species association network. Both workflows work well: the annotation workflow has an F1 Score of 0.941 and the association algorithm has an F1 Score of 0.885. Existing text annotators such as Terminizer and DBpedia Spotlight performed well, but require some optimization to be useful in the ecology and evolution domain. Important future work includes scaling up and improving accuracy through the use of distributional semantics.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Semantic Representation Extraction from Unstructured Arabic Text
    Zakria, Gehad
    Farouk, Mamdouh
    Fathy, Khaled
    Makar, Malak N.
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND INFORMATION ENGINEERING (ICSIE 2019), 2019, : 222 - 226
  • [22] Automatic extraction of corollaries from semantic structure of text
    Nurtazin, Abyz T.
    Khisamiev, Zarif G.
    OPEN ENGINEERING, 2016, 6 (01): : 353 - 358
  • [23] Semantic Knowledge Extraction from Research Documents
    Upadhyay, Rishabh
    Fujii, Akihiro
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 439 - 445
  • [24] Knowledge extraction from text for intelligent responses
    Sadamitsu, Kugatsu
    Higashinaka, Ryuichiro
    Hirano, Toru
    Izumi, Tomoko
    NTT Technical Review, 2013, 11 (07):
  • [25] Automated Extraction of Function Knowledge From Text
    Cheong, Hyunmin
    Li, Wei
    Cheung, Adrian
    Nogueira, Andy
    Iorio, Francesco
    JOURNAL OF MECHANICAL DESIGN, 2017, 139 (11)
  • [26] AUTOMATIC EXTRACTION OF FUNCTION KNOWLEDGE FROM TEXT
    Cheong, Hyunmin
    Li, Wei
    Cheung, Adrian
    Nogueira, Andy
    Iorio, Francesco
    INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2015, VOL 2A, 2016,
  • [27] Action Knowledge Extraction from Web Text
    Ge, Ansheng
    Mao, Wenji
    Zeng, Daniel
    Wang, Lei
    2013 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: BIG DATA, EMERGENT THREATS, AND DECISION-MAKING IN SECURITY INFORMATICS, 2013, : 368 - 370
  • [28] Fast Extraction of Semantic Features from a Latent Semantic Indexed Text Corpus
    A. Kabán
    M. A. Girolami
    Neural Processing Letters, 2002, 15 : 31 - 43
  • [29] Fast extraction of semantic features from a latent semantic indexed text corpus
    Kabán, A
    Girolami, MA
    NEURAL PROCESSING LETTERS, 2002, 15 (01) : 31 - 34
  • [30] Improving Semantic Annotation Using Semantic Modeling of Knowledge Embedding
    Fan, Yuhua
    Fan, Liya
    Yang, Jing
    CLOUD COMPUTING AND SECURITY, PT VI, 2018, 11068 : 575 - 585