Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection

被引:47
|
作者
Zhao, Xiaowei [1 ,2 ]
Li, Xiangtao [1 ,2 ]
Ma, Zhiqiang [1 ,2 ]
Yin, Minghao [2 ]
机构
[1] NE Normal Univ, Coll Life Sci, Changchun 130024, Peoples R China
[2] NE Normal Univ, Coll Comp Sci, Changchun 130117, Peoples R China
基金
中国国家自然科学基金;
关键词
ubiquitylation; ensemble classifier; support vector machine; lysine ubiquitylation sites; UBIQUITIN-LIKE PROTEINS; PROTEOMICS APPROACH; INTRINSIC DISORDER; IDENTIFICATION; RELEVANCE; LOCATION;
D O I
10.3390/ijms12128347
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.
引用
收藏
页码:8347 / 8361
页数:15
相关论文
共 50 条
  • [1] Prediction of subcellular location apoptosis proteins with ensemble classifier and feature selection
    Quan Gu
    Yong-Sheng Ding
    Xiao-Ying Jiang
    Tong-Liang Zhang
    Amino Acids, 2010, 38 : 975 - 983
  • [2] Prediction of subcellular location apoptosis proteins with ensemble classifier and feature selection
    Gu, Quan
    Ding, Yong-Sheng
    Jiang, Xiao-Ying
    Zhang, Tong-Liang
    AMINO ACIDS, 2010, 38 (04) : 975 - 983
  • [3] Classifier ensemble methods in feature selection
    Kiziloz, Hakan Ezgi
    NEUROCOMPUTING, 2021, 419 : 97 - 107
  • [4] An ensemble svm classifier with feature selection
    Hu, Han
    En-en, Ren
    2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 6 - 8
  • [5] Feature Selection Inspired Classifier Ensemble Reduction
    Diao, Ren
    Chao, Fei
    Peng, Taoxin
    Snooke, Neal
    Shen, Qiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (08) : 1259 - 1268
  • [6] Ensemble Selection based on Classifier Prediction Confidence
    Tien Thanh Nguyen
    Anh Vu Luong
    Manh Truong Dang
    Liew, Alan Wee-Chung
    McCall, John
    PATTERN RECOGNITION, 2020, 100
  • [7] A Feature Selection Based Serial SVM Ensemble Classifier
    Cao, Jianjun
    Lv, Guojun
    Chang, Chen
    Li, Hongmei
    IEEE ACCESS, 2019, 7 : 144516 - 144523
  • [8] Integrate Classifier Diversity Evaluation to Feature Selection Based Classifier Ensemble Reduction
    Yao, Gang
    Chao, Fei
    Zeng, Hualin
    Shi, Minghui
    Jiang, Min
    Zhou, Changle
    2014 14TH UK WORKSHOP ON COMPUTATIONAL INTELLIGENCE (UKCI), 2014, : 37 - 43
  • [9] Classifier ensemble for mammography CAD system combining feature selection with ensemble learning
    Nemoto, M
    Shimizu, A
    Kobatake, H
    Takeo, H
    Nawano, S
    CARS 2005: Computer Assisted Radiology and Surgery, 2005, 1281 : 1047 - 1051
  • [10] Classifier Ensemble with Relevance-Based Feature Subset Selection
    Zhao, Junyang
    Zhang, Zhili
    Chang, Zhenjun
    Liu, Dianjian
    2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 1137 - 1141