Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection

被引:47
|
作者
Zhao, Xiaowei [1 ,2 ]
Li, Xiangtao [1 ,2 ]
Ma, Zhiqiang [1 ,2 ]
Yin, Minghao [2 ]
机构
[1] NE Normal Univ, Coll Life Sci, Changchun 130024, Peoples R China
[2] NE Normal Univ, Coll Comp Sci, Changchun 130117, Peoples R China
基金
中国国家自然科学基金;
关键词
ubiquitylation; ensemble classifier; support vector machine; lysine ubiquitylation sites; UBIQUITIN-LIKE PROTEINS; PROTEOMICS APPROACH; INTRINSIC DISORDER; IDENTIFICATION; RELEVANCE; LOCATION;
D O I
10.3390/ijms12128347
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.
引用
收藏
页码:8347 / 8361
页数:15
相关论文
共 50 条
  • [21] Prediction of lysine ubiquitination with mRMR feature selection and analysis
    Yudong Cai
    Tao Huang
    Lele Hu
    Xiaohe Shi
    Lu Xie
    Yixue Li
    Amino Acids, 2012, 42 : 1387 - 1395
  • [22] A Novel Ensemble Classifier Selection Method for Software Defect Prediction
    Dong, Xin
    Wang, Jie
    Liang, Yan
    IEEE ACCESS, 2025, 13 : 25578 - 25597
  • [23] An Ensemble Classifier Approach on Different Feature Selection Methods for Intrusion Detection
    Vinutha, H. P.
    Poornima, B.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 442 - 451
  • [24] Feature Selection and Ensemble Meta Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Alias, Suraya
    Lammasha, Mohamed A. M.
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2018, 2018, : 134 - 139
  • [25] Software Cost Estimation using Stacked Ensemble Classifier and Feature Selection
    Al-Karak, Mustafa Hammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 183 - 189
  • [26] Efficient Twitter Sentiment Analysis System with Feature Selection and Classifier Ensemble
    Fouad, Mohammed M.
    Gharib, Tarek F.
    Mashat, Abdulfattah S.
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 516 - 527
  • [27] An Ensemble Classifier Based on Feature Selection Using Ant Colony Optimization
    Cao, Jianjun
    Lv, Guojun
    Shang, Yuling
    Weng, Nianfeng
    Chang, Chen
    Liu, Yi
    2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [28] Improving protein-protein interactions prediction accuracy using XGBoost feature selection and stacked ensemble classifier
    Chen, Cheng
    Zhang, Qingmei
    Yu, Bin
    Yu, Zhaomin
    Lawrence, Patrick J.
    Ma, Qin
    Zhang, Yan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123
  • [29] A Dropout Prediction Framework Combined with Ensemble Feature Selection
    Ai, Dan
    Zhang, Tiancheng
    Yu, Ge
    Shao, Xinying
    ICIET 2020: 2020 8TH INTERNATIONAL CONFERENCE ON INFORMATION AND EDUCATION TECHNOLOGY, 2020, : 179 - 185
  • [30] Ensemble of LSTMs and Feature Selection for Human Action Prediction
    Petkovic, Tomislav
    Petrovic, Luka
    Markovic, Ivan
    Petrovic, Ivan
    INTELLIGENT AUTONOMOUS SYSTEMS 16, IAS-16, 2022, 412 : 429 - 441