A Feature Selection Method for Prediction Essential Protein

被引:31
|
作者
Zhong, Jiancheng [1 ,2 ]
Wang, Jianxin [1 ]
Peng, Wei [3 ]
Zhang, Zhen [1 ]
Li, Min [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
[2] Hunan Normal Univ, Coll Polytech, Changsha 410083, Peoples R China
[3] Kunming Univ Sci & Technol, Ctr Comp, Kunming 650093, Peoples R China
基金
中国国家自然科学基金;
关键词
essential protein; feature selection; Protein-Protein Interaction (PPI); machine learning; centrality algorithm; ESSENTIAL GENES; SACCHAROMYCES-CEREVISIAE; IDENTIFICATION; CENTRALITY; NETWORKS; LOCALIZATION; INTEGRATION; ORTHOLOGY; IDENTIFY; DATABASE;
D O I
10.1109/TST.2015.7297748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential proteins are vital to the survival of a cell. There are various features related to the essentiality of proteins, such as biological and topological features. Many computational methods have been developed to identify essential proteins by using these features. However, it is still a big challenge to design an effective method that is able to select suitable features and integrate them to predict essential proteins. In this work, we first collect 26 features, and use SVM-RFE to select some of them to create a feature space for predicting essential proteins, and then remove the features that share the biological meaning with other features in the feature space according to their Pearson Correlation Coefficients (PCC). The experiments are carried out on S. cerevisiae data. Six features are determined as the best subset of features. To assess the prediction performance of our method, we further compare it with some machine learning methods, such as SVM, Naive Bayes, Bayes Network, and NBTree when inputting the different number of features. The results show that those methods using the 6 features outperform that using other features, which confirms the effectiveness of our feature selection method for essential protein prediction.
引用
收藏
页码:491 / 499
页数:9
相关论文
共 50 条
  • [11] Prediction of protein amidation sites by feature selection and analysis
    Cui, Weiren
    Niu, Shen
    Zheng, Lulu
    Hu, Lele
    Huang, Tao
    Gu, Lei
    Feng, Kaiyan
    Zhang, Ning
    Cai, Yudong
    Li, Yixue
    MOLECULAR GENETICS AND GENOMICS, 2013, 288 (09) : 391 - 400
  • [12] Prediction of Protein Subcellular Locations with Feature Selection and Analysis
    Cai, Yudong
    He, Jianfeng
    Li, Xinlei
    Feng, Kaiyan
    Lu, Lin
    Feng, Kairui
    Kong, Xiangyin
    Lu, Wencong
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (04): : 464 - 472
  • [13] Prediction of Protein Domain with mRMR Feature Selection and Analysis
    Li, Bi-Qing
    Hu, Le-Le
    Chen, Lei
    Feng, Kai-Yan
    Cai, Yu-Dong
    Chou, Kuo-Chen
    PLOS ONE, 2012, 7 (06):
  • [14] Feature selection with genetic algorithm for protein function prediction
    Santos, Bruno C.
    Rodrigues, Marcos W.
    Pinto, Cristiano L. N.
    Nobre, Cristiane N.
    Zarate, Luis E.
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 2434 - 2439
  • [15] LncRNA–protein interaction prediction with reweighted feature selection
    Guohao Lv
    Yingchun Xia
    Zhao Qi
    Zihao Zhao
    Lianggui Tang
    Cheng Chen
    Shuai Yang
    Qingyong Wang
    Lichuan Gu
    BMC Bioinformatics, 24
  • [16] Prediction of protein amidation sites by feature selection and analysis
    Weiren Cui
    Shen Niu
    Lulu Zheng
    Lele Hu
    Tao Huang
    Lei Gu
    Kaiyan Feng
    Ning Zhang
    Yudong Cai
    Yixue Li
    Molecular Genetics and Genomics, 2013, 288 : 391 - 400
  • [17] Prediction of Essential Proteins Using Genetic Algorithm as a Feature Selection Technique
    Inzamam-Ul-Hossain, Md.
    Islam, Md. Rafiqul
    IEEE ACCESS, 2024, 12 : 126200 - 126220
  • [18] A Hybrid Feature Selection Method for Software Fault Prediction
    Jian, Yiheng
    Yu, Xiao
    Xu, Zhou
    Ma, Ziyi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (10) : 1966 - 1975
  • [19] Prediction of protein structural classes based on feature selection technique
    Hui Ding
    Hao Lin
    Wei Chen
    Zi-Qiang Li
    Feng-Biao Guo
    Jian Huang
    Nini Rao
    Interdisciplinary Sciences: Computational Life Sciences, 2014, 6 : 235 - 240
  • [20] Integrative approaches to the prediction of protein functions based on the feature selection
    Ko, Seokha
    Lee, Hyunju
    BMC BIOINFORMATICS, 2009, 10