PPInterFinder- a mining tool for extracting causal relations on human proteins from literature

被引:46
|
作者
Raja, Kalpana [1 ]
Subramani, Suresh [1 ]
Natarajan, Jeyakumar [1 ]
机构
[1] Bharathiar Univ, Dept Bioinformat, Data Min & Text Min Lab, Coimbatore 641046, Tamil Nadu, India
关键词
EVENT EXTRACTION; DISCOVERING PATTERNS; INFORMATION; CORPUS; TEXT;
D O I
10.1093/database/bas052
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the most common and challenging problem in biomedical text mining is to mine protein-protein interactions (PPIs) from MEDLINE abstracts and full-text research articles because PPIs play a major role in understanding the various biological processes and the impact of proteins in diseases. We implemented, PPInterFinder-a web-based text mining tool to extract human PPIs from biomedical literature. PPInterFinder uses relation keyword co-occurrences with protein names to extract information on PPIs from MEDLINE abstracts and consists of three phases. First, it identifies the relation keyword using a parser with Tregex and a relation keyword dictionary. Next, it automatically identifies the candidate PPI pairs with a set of rules related to PPI recognition. Finally, it extracts the relations by matching the sentence with a set of 11 specific patterns based on the syntactic nature of PPI pair. We find that PPInterFinder is capable of predicting PPIs with the accuracy of 66.05% on AIMED corpus and outperforms most of the existing systems.
引用
收藏
页数:11
相关论文
共 45 条
  • [41] BioLitMine: Advanced Mining of Biomedical and Biological Literature About Human Genes and Genes from Major Model Organisms
    Hu, Yanhui
    Chung, Verena
    Comjean, Aram
    Rodiger, Jonathan
    Nipun, Fnu
    Perrimon, Norbert
    Mohr, Stephanie E.
    G3-GENES GENOMES GENETICS, 2020, 10 (12): : 4531 - 4539
  • [42] Prediction of human phosphorylated proteins by extracting multi-perspective discriminative features from the evolutionary profile and physicochemical properties through LFDA
    Ahmed, Saeed
    Kabir, Muhammad
    Arif, Muhammad
    Ali, Zakir
    Swati, Zar Nawab Khan
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 203 (203)
  • [43] Answering Binary Causal Questions Through Large-Scale Text Mining: An Evaluation Using Cause-Effect Pairs from Human Experts
    Hassanzadeh, Oktie
    Bhattacharjya, Debarun
    Feblowitz, Mark
    Srinivas, Kavitha
    Perrone, Michael
    Sohrabi, Shirin
    Katz, Michael
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5003 - 5009
  • [44] Fluorescence difference gel electrophoresis saturation labelling as a novel tool to analyze glomerular proteins from smallest scale murine and human samples
    Potthoff, Sebastian
    Sitek, Barbara
    Schulenborg, Thomas
    Stebauer, Johannes
    Vinke, Tobias
    Rump, Lars-Christian
    Meyer, Helmut E.
    Stuehler, Kai
    Vonend, Oliver
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2006, 21 : 35 - 36
  • [45] Simultaneous speculation of 401 monomeric or homo-oligomeric subunit structures of human cellular proteins, mining the information in 1901 native 2D protein maps reconstructed from one nondenaturing 2DE gel
    Jin, Ya
    Manabe, Takashi
    JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2020, 1144 (1144):