Protein Function Prediction Using Kernal Logistic Regresssion with ROC Curves

被引:0
|
作者
Liu, Jingwei [1 ]
Qian, Minping [2 ]
机构
[1] Beihang Univ, Sch Math & Syst Sci, LMIB, Minist Educ, Beijing 100191, Peoples R China
[2] Peking Univ, Ctr Theoret Biol, Sch Math Sci, LMAM, Beijing 100871, Peoples R China
来源
关键词
protein-protein interaction; logistic regression; kernel logistic regression; receiver operating characteristic; optimal operating point; NETWORK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To avoid the "over-fitting" problem in protein function prediction based on protein-protein interactions (PPI), we propose a pattern recognition strategy that all the features of PPI observation data are divided into three sets, training set, learning set and testing set. The employed classifiers are trained on training sets, the receiver operating characteristic (ROC) curve and optimal operating point (OOP) is calculated on learning set, and the accuracy rate is reported on the testing set with OOP. Under this framework, we compare the performances of logistic regression (LR) model with kernel logistic regression (KLR) model on two different feature selection sets, 1-order feature and 2-order feature according to PPI data. The experiment results on a standard PPI data show that KLR model performs better than I-R model on training sets of both 1-order feature set and 2-order feature set, and the 2-order feature outperforms 1-order feature set with KLR model on training set. The predictive rates on testing set of both 1-order feature and 2-order feature with LR and KLR can achieve 95%.
引用
收藏
页码:491 / +
页数:3
相关论文
共 50 条
  • [31] Automatic Misclassification Rejection for LDA Classifier using ROC Curves
    Menon, Radhika
    Di Caterina, Gaetano
    Lakany, Heba
    Petropoulakis, Lykourgos
    Conway, Bernard A.
    Soraghan, John J.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 482 - 485
  • [32] Image thresholding of historical documents using entropy and ROC curves
    Mello, CAB
    Costa, AHM
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 905 - 916
  • [33] COMPARISON OF THE EFFICACY OF THE NEWER COAGULATION TESTS USING ROC CURVES
    CEMBROWSKI, GS
    MOSHER, DF
    GARBER, CC
    GRIFFIN, JH
    CLINICAL BIOCHEMISTRY, 1983, 16 (02) : 114 - 114
  • [34] PERFORMANCE EVALUATION OF MEDICAL EXPERT SYSTEMS USING ROC CURVES
    ADLASSNIG, KP
    SCHEITHAUER, W
    COMPUTERS AND BIOMEDICAL RESEARCH, 1989, 22 (04): : 297 - 313
  • [35] IMPROVEMENT BLOOD TESTING STRATEGY OF HBSAG BY USING OF ROC CURVES
    Li, Z.
    Ge, H. W.
    VOX SANGUINIS, 2012, 103 : 168 - 168
  • [36] Finding software metrics threshold values using ROC curves
    Shatnawi, Raed
    Li, Wei
    Swain, James
    Newman, Tim
    JOURNAL OF SOFTWARE MAINTENANCE AND EVOLUTION-RESEARCH AND PRACTICE, 2010, 22 (01): : 1 - 16
  • [37] Evaluation of Distribution Fault Diagnosis Algorithms using ROC Curves
    Cai, Yixin
    Chow, Mo-Yuen
    Lu, Wenbin
    Li, Lexin
    IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [38] Protein Function Prediction Using Decision Trees
    Yedida, Venkata Rama Kumar Swamy
    Chan, Chien-Chung
    Duan, Zhong-Hui
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, PROCEEDINGS, 2008, : 193 - 199
  • [39] Protein function prediction using domain families
    Rentzsch, Robert
    Orengo, Christine A.
    BMC BIOINFORMATICS, 2013, 14
  • [40] Protein Function Prediction Using ProMOL and PyMOL
    Hart, Kaitlin
    McKay, Talia
    Tedla-Boyd, Weinishet
    Mills, Jeffrey
    Bernstein, Herbert
    Craig, Paul
    FASEB JOURNAL, 2015, 29