Leveraging Gene Ontology Annotations to Improve a Memory-Based Language Understanding System

被引:3
|
作者
Livingston, Kevin M. [1 ]
Johnson, Helen L. [1 ]
Verspoor, Karin [1 ]
Hunter, Lawrence E. [1 ]
机构
[1] Univ Colorado Denver, Ctr Computat Pharmacol, Aurora, CO 80045 USA
关键词
natural langugage processing (NLP); direct memory access parsing (DMAP); OpenDMAP; memory; Gene Ontology annotations; biological event extraction;
D O I
10.1109/ICSC.2010.62
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work evaluates how detailed knowledge about proteins can be leveraged for language understanding and disambiguation by OpenDMAP. OpenDMAP is a memory-based language understanding system that uses patterns to identify concepts in text. These patterns match not only lexical elements, such as words, but also semantic elements, such as references to proteins. This work started with an existing pattern set used to extract biological activation events from a corpus of GeneRIFs (sentences or phrases that each describe one of many of the functions of a gene). This is a challenging task because many distinct activation concepts, in addition to being semantically similar, are described using very similar language. We augment the previous approach with additional semantic knowledge about proteins, in the form of associated Gene Ontology annotations, and a small corresponding modification to the ontology used by OpenDMAP. By incorporating additional background knowledge we demonstrate that performance can be significantly improved without modifying the pattern set being used. Specifically precision is improved by 20%, at a modest 6% cost to recall. The additional semantic knowledge allows for more specificity in the ontology used by OpenDMAP, which in turn automatically improves the specificity of the patterns being used to extract knowledge from text reducing false positives by 75%.
引用
收藏
页码:40 / 45
页数:6
相关论文
共 50 条
  • [41] A Memory-Based Decision-Making Model for Multilingual Alternatives: The Role of Memory, Emotion and Language
    Djouamai, Zineb
    Ying, Li
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2020, 1037 : 1121 - 1137
  • [42] Using Bert Embedding to improve memory-based collaborative filtering recommender systems
    Bui Nguyen Minh Hoang
    Ho Thi Hoang Vy
    Tiet Gia Hong
    Vu Thi My Hang
    Ho Le Thi Kim Nhung
    Le Nguyen Hoai Nam
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 150 - 155
  • [43] Hybrid model-based and memory-based traffic prediction system
    Alecsandru, C
    Ishak, S
    INFORMATION SYSTEMS AND TECHNOLOGY, 2004, (1879): : 59 - 70
  • [44] Double Hopf Bifurcation Analysis in the Memory-based Diffusion System
    Song, Yongli
    Peng, Yahong
    Zhang, Tonghua
    JOURNAL OF DYNAMICS AND DIFFERENTIAL EQUATIONS, 2024, 36 (02) : 1635 - 1676
  • [45] A Novel Memory-based ARQ System with its Analysis of Throughput
    Guan, Sheng-Yong
    Li, Yong
    Zhang, Yi
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 1176 - 1180
  • [46] A MASSIVELY-PARALLEL MEMORY-BASED STORY SYSTEM FOR PSYCHOTHERAPY
    SMITH, RN
    CHEN, CC
    FENG, FF
    GOMEZGAUCHIA, H
    COMPUTERS AND BIOMEDICAL RESEARCH, 1993, 26 (05): : 415 - 423
  • [47] Ontology-based Architecture for Reusing and Learning Through Context-aware Annotations Memory
    Aloui, Nadia
    Gargouri, Faiez
    SIXTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2011), 2011, : 154 - 159
  • [48] The spatially inhomogeneous Hopf bifurcation induced by memory delay in a memory-based diffusion system
    Song, Yongli
    Peng, Yahong
    Zhang, Tonghua
    JOURNAL OF DIFFERENTIAL EQUATIONS, 2021, 300 : 597 - 624
  • [49] Analysis of natural language understanding technology based on Semantic Web ontology
    Wang, Yi
    Zhang, Jianming
    Cao, Zhenjie
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 889 - 893
  • [50] GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products
    Park, Yu Rang
    Kim, Jihun
    Lee, Hye Won
    Yoon, Young Jo
    Kim, Ju Han
    BMC BIOINFORMATICS, 2011, 12