Positome: A Method for Improving Protein-Protein Interaction Quality and Prediction Accuracy

被引:0
|
作者
Dick, Kevin [1 ]
Dehne, Frank [2 ]
Golshani, Ashkan [3 ]
Green, James R. [1 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON, Canada
[2] Carleton Univ, Sch Comp Sci, Ottawa, ON, Canada
[3] Carleton Univ, Inst Biochem, Dept Biol, Ottawa, ON, Canada
关键词
protein-protein interaction prediction; data quality; datasets; data provenance; machine learning; INTERACTION DATABASE; NETWORK; INTACT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The progressive elucidation of positive protein-protein interactions (PPIs) as wet-lab techniques continue to improve in both throughput and precision has increased the number and quality of known PPIs across the spectrum of life. Creating high quality datasets of positive PPIs is critical for training PPI prediction algorithms and for assessing the performance of PPI detection efforts. We present the Positome, a web service to acquire sets of positive PPIs based on user-defined criteria pertaining to data provenance including interaction type, throughput level, and detection method selection in addition to filtration by multiple lines of evidence (i.e. PPIs reported by independent research groups). The Positome provides a tunable interface to obtain a specified subset of interacting PPIs from the BioGRID database. Both intra-and inter-species PPIs are supported. Using a number of model organisms, we demonstrate the trade-off between data quality and quantity, and the benefit of higher data quality on PPI prediction precision and recall. A web interface and REST web service are available at http://bioinf.sce.carleton.ca/POSITOME/.
引用
收藏
页码:162 / 169
页数:8
相关论文
共 50 条
  • [1] Improving accuracy of protein-protein interaction prediction by considering the converse problem for sequence representation
    Xianwen Ren
    Yong-Cui Wang
    Yong Wang
    Xiang-Sun Zhang
    Nai-Yang Deng
    BMC Bioinformatics, 12
  • [2] Improving accuracy of protein-protein interaction prediction by considering the converse problem for sequence representation
    Ren, Xianwen
    Wang, Yong-Cui
    Wang, Yong
    Zhang, Xiang-Sun
    Deng, Nai-Yang
    BMC BIOINFORMATICS, 2011, 12
  • [3] Assessment of prediction accuracy of protein function from protein-protein interaction data
    Hishigaki, H
    Nakai, K
    Ono, T
    Tanigami, A
    Takagi, T
    YEAST, 2001, 18 (06) : 523 - 531
  • [4] Effect of Dimensionality Reduction on Classification Accuracy for Protein-Protein Interaction Prediction
    Mahapatra, Satyajit
    Kumar, Anish
    Sharma, Animesh
    Sahu, Sitanshu Sekhar
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 3 - 12
  • [5] Prediction of protein-protein interaction sites using an ensemble method
    Lei Deng
    Jihong Guan
    Qiwen Dong
    Shuigeng Zhou
    BMC Bioinformatics, 10
  • [6] Prediction of protein-protein interaction sites using an ensemble method
    Deng, Lei
    Guan, Jihong
    Dong, Qiwen
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2009, 10
  • [7] Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
    Varnai, Csilla
    Burkoff, Nikolas S.
    Wild, David L.
    PLOS ONE, 2017, 12 (02):
  • [8] Improving protein-protein interaction prediction using protein language model and protein network features
    Hu, Jun
    Li, Zhe
    Rao, Bing
    Thafar, Maha A.
    Arif, Muhammad
    ANALYTICAL BIOCHEMISTRY, 2024, 693
  • [9] Human protein-protein interaction prediction
    Mark D McDowall
    Michelle S Scott
    Geoffrey J Barton
    BMC Bioinformatics, 11 (Suppl 10)
  • [10] Improving protein-protein interaction prediction by using encoding strategies and random indices
    Al-Daoud, Essam
    World Academy of Science, Engineering and Technology, 2011, 51 : 265 - 269