Protein Function Prediction Using Kernal Logistic Regresssion with ROC Curves

被引:0
|
作者
Liu, Jingwei [1 ]
Qian, Minping [2 ]
机构
[1] Beihang Univ, Sch Math & Syst Sci, LMIB, Minist Educ, Beijing 100191, Peoples R China
[2] Peking Univ, Ctr Theoret Biol, Sch Math Sci, LMAM, Beijing 100871, Peoples R China
来源
关键词
protein-protein interaction; logistic regression; kernel logistic regression; receiver operating characteristic; optimal operating point; NETWORK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To avoid the "over-fitting" problem in protein function prediction based on protein-protein interactions (PPI), we propose a pattern recognition strategy that all the features of PPI observation data are divided into three sets, training set, learning set and testing set. The employed classifiers are trained on training sets, the receiver operating characteristic (ROC) curve and optimal operating point (OOP) is calculated on learning set, and the accuracy rate is reported on the testing set with OOP. Under this framework, we compare the performances of logistic regression (LR) model with kernel logistic regression (KLR) model on two different feature selection sets, 1-order feature and 2-order feature according to PPI data. The experiment results on a standard PPI data show that KLR model performs better than I-R model on training sets of both 1-order feature set and 2-order feature set, and the 2-order feature outperforms 1-order feature set with KLR model on training set. The predictive rates on testing set of both 1-order feature and 2-order feature with LR and KLR can achieve 95%.
引用
收藏
页码:491 / +
页数:3
相关论文
共 50 条
  • [41] Protein function prediction using domain families
    Robert Rentzsch
    Christine A Orengo
    BMC Bioinformatics, 14
  • [42] The Prediction Value of Left Ventricular Systolic Function by the Numbers of Noncampacted Segments in Patients with Left Ventricular Noncompaction by ROC Curves Analysis
    Li, Y. L.
    Xie, M. X.
    Lv, Q.
    Li, L.
    Yang, Y. L.
    He, L.
    Fang, L. Y.
    Lu, X. F.
    Li, Y.
    Wang, Q.
    CARDIOLOGY, 2010, 117 : 105 - 106
  • [43] Protein function prediction using ProMOL and PyMOL
    McKay, Talia
    Hart, Kaitlin
    Bernstein, Herbert
    Tedla-Boyd, Weinishet
    Craig, Paul
    FASEB JOURNAL, 2014, 28 (01):
  • [44] A Gene Prediction Function for Type 2 Diabetes Mellitus using Logistic Regression
    Alshamlan, Hala
    Bin Taleb, Hind
    Al Sahow, Areej
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 038 - 041
  • [45] Prediction of protein function using protein-protein interaction data
    Deng, MH
    Zhang, K
    Mehta, S
    Chen, T
    Sun, FZ
    CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 197 - 206
  • [46] Prediction of protein function using protein-protein interaction data
    Deng, MH
    Zhang, K
    Mehta, S
    Chen, T
    Sun, FZ
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) : 947 - 960
  • [47] Prediction of human-Streptococcus pneumoniae protein-protein interactions using logistic regression
    Prasasty, Vivitri Dewi
    Hutagalung, Rory Anthony
    Gunadi, Reinhart
    Sofia, Dewi Yustika
    Rosmalena, Rosmalena
    Yazid, Fatmawaty
    Sinaga, Ernawati
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 92
  • [48] Protein function prediction using the Protein Link EXplorer (PLEX)
    Date, SV
    Marcotte, EM
    BIOINFORMATICS, 2005, 21 (10) : 2558 - 2559
  • [49] Modeling compression curves of reconstituted clays based on logistic function
    Wang, Heng
    Zeng, Ling-Ling
    Hong, Zhen-Shun
    MARINE GEORESOURCES & GEOTECHNOLOGY, 2024, 42 (09) : 1293 - 1299
  • [50] Prediction of breakthrough curves for multicomponent adsorption in a fixed-bed column using logistic and Gompertz functions
    Hu, Qili
    Wang, Dan
    Pang, Shuyue
    Xu, Li
    ARABIAN JOURNAL OF CHEMISTRY, 2022, 15 (09)