FunPred 3.0: improved protein function prediction using protein interaction network

被引:12
|
作者
Saha, Sovan [1 ]
Chatterjee, Piyali [2 ]
Basu, Subhadip [3 ]
Nasipuri, Mita [3 ]
Plewczynski, Dariusz [4 ,5 ]
机构
[1] Dr Sudhir Chandra Degree Engn Coll, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[2] Netaji Subhash Engn Coll, Dept Comp Sci & Engn, Kolkata, India
[3] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, W Bengal, India
[4] Univ Warsaw, Ctr New Technol, Lab Funct & Struct Genom, Warsaw, Poland
[5] Warsaw Univ Technol, Fac Math & Informat Sci, Warsaw, Poland
来源
PEERJ | 2019年 / 7卷
基金
欧盟地平线“2020”;
关键词
Protein-protein interactions; Protein interaction networks; Neighborhood approach; MIPS Database; Protein function prediction; Physico-chemical properties; SEQUENCE;
D O I
10.7717/peerj.6830
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Proteins are the most versatile macromolecules in living systems and perform crucial biological functions. In the advent of the post-genomic era, the next generation sequencing is done routinely at the population scale for a variety of species. The challenging problem is to massively determine the functions of proteins that are yet not characterized by detailed experimental studies. Identification of protein functions experimentally is a laborious and time-consuming task involving many resources. We therefore propose the automated protein function prediction methodology using in silico algorithms trained on carefully curated experimental datasets. We present the improved protein function prediction tool FunPred 3.0, an extended version of our previous methodology FunPred 2, which exploits neighborhood properties in protein-protein interaction network (PPIN) and physicochemical properties of amino acids. Our method is validated using the available functional annotations in the PPIN network of Saccharomyces cerevisiae in the latest Munich information center for protein (MIPS) dataset. The PPIN data of S. cerevisiae in MIPS dataset includes 4,554 unique proteins in 13,528 protein-protein interactions after the elimination of the self-replicating and the self-interacting protein pairs. Using the developed FunPred 3.0 tool, we are able to achieve the mean precision, the recall and the F-score values of 0.55, 0.82 and 0.66, respectively. FunPred 3.0 is then used to predict the functions of unpredicted protein pairs (incomplete and missing functional annotations) in MIPS dataset of S. cerevisiae. The method is also capable of predicting the subcellular localization of proteins along with its corresponding functions. The code and the complete prediction results are available freely at: https://github.com/SovanSaha/FunPred-3.0.git.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] FunPred-1: Protein function prediction from a protein interaction network using neighborhood analysis
    Saha, Sovan
    Chatterjee, Piyali
    Basu, Subhadip
    Kundu, Mahantapas
    Nasipuri, Mita
    CELLULAR & MOLECULAR BIOLOGY LETTERS, 2014, 19 (04) : 675 - 691
  • [2] FunPred-1: Protein function prediction from a protein interaction network using neighborhood analysis
    Sovan Saha
    Piyali Chatterjee
    Subhadip Basu
    Mahantapas Kundu
    Mita Nasipuri
    Cellular & Molecular Biology Letters, 2014, 19 : 675 - 691
  • [3] Protein Function Prediction Using Function Associations in Protein-Protein Interaction Network
    Sun, Pingping
    Tan, Xian
    Guo, Sijia
    Zhang, Jingbo
    Sun, Bojian
    Du, Ning
    Wang, Han
    Sun, Hui
    IEEE ACCESS, 2018, 6 : 30892 - 30902
  • [4] Protein function prediction using neighbor relativity in protein-protein interaction network
    Moosavi, Sobhan
    Rahgozar, Masoud
    Rahimi, Amir
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2013, 43 : 11 - 16
  • [5] Protein Function Prediction by Clustering of Protein-Protein Interaction Network
    Cingovska, Ivana
    Bogojeska, Aleksandra
    Trivodaliev, Kire
    Kalajdziski, Slobodan
    ICT INNOVATIONS 2011, 2011, 150 : 39 - 49
  • [6] Protein Function Prediction by Spectral Clustering of Protein Interaction Network
    Trivodaliev, Kire
    Cingovska, Ivana
    Kalajdziski, Slobodan
    DATABASE THEORY AND APPLICATION, BIO-SCIENCE AND BIO-TECHNOLOGY, 2011, 258 : 108 - 117
  • [7] Prediction of protein function using protein-protein interaction data
    Deng, MH
    Zhang, K
    Mehta, S
    Chen, T
    Sun, FZ
    CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 197 - 206
  • [8] Prediction of protein function using protein-protein interaction data
    Deng, MH
    Zhang, K
    Mehta, S
    Chen, T
    Sun, FZ
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) : 947 - 960
  • [9] Protein function prediction from dynamic protein interaction network using gene expression data
    Saha, Sovan
    Prasad, Abhimanyu
    Chatterjee, Piyali
    Basu, Subhadip
    Nasipuri, Mita
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (04)
  • [10] Improving prediction of Protein Function from Protein Interaction Network using Intelligent Neighborhood Approach
    Saha, Sovan
    Chatterjee, Piyali
    Basu, Subhadip
    Kundu, Mahantapas
    Nasipuri, Mita
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, DEVICES AND INTELLIGENT SYSTEMS (CODLS), 2012, : 584 - 587