Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection

被引:47
|
作者
Zhao, Xiaowei [1 ,2 ]
Li, Xiangtao [1 ,2 ]
Ma, Zhiqiang [1 ,2 ]
Yin, Minghao [2 ]
机构
[1] NE Normal Univ, Coll Life Sci, Changchun 130024, Peoples R China
[2] NE Normal Univ, Coll Comp Sci, Changchun 130117, Peoples R China
基金
中国国家自然科学基金;
关键词
ubiquitylation; ensemble classifier; support vector machine; lysine ubiquitylation sites; UBIQUITIN-LIKE PROTEINS; PROTEOMICS APPROACH; INTRINSIC DISORDER; IDENTIFICATION; RELEVANCE; LOCATION;
D O I
10.3390/ijms12128347
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.
引用
收藏
页码:8347 / 8361
页数:15
相关论文
共 50 条
  • [11] Sentiment classification using hybrid feature selection and ensemble classifier
    Jain, Achin
    Jain, Vanita
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 659 - 668
  • [12] MODE: multiobjective differential evolution for feature selection and classifier ensemble
    Utpal Kumar Sikdar
    Asif Ekbal
    Sriparna Saha
    Soft Computing, 2015, 19 : 3529 - 3549
  • [13] Multi-layer Heterogeneous Ensemble with Classifier and Feature Selection
    Tien Thanh Nguyen
    Nang Van Pham
    Manh Truong Dang
    Anh Vu Luong
    McCall, John
    Liew, Alan Wee Chung
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 725 - 733
  • [14] Feature Selection SDA Method in Ensemble Nearest Neighbor Classifier
    Alimardani, Fateme
    Boostani, Reza
    Ansari, Ebrahim
    ADVANCES IN COMPUTER SCIENCE AND ENGINEERING, 2008, 6 : 884 - 887
  • [15] MODE: multiobjective differential evolution for feature selection and classifier ensemble
    Sikdar, Utpal Kumar
    Ekbal, Asif
    Saha, Sriparna
    SOFT COMPUTING, 2015, 19 (12) : 3529 - 3549
  • [16] Detection for JPEG steganography based on evolutionary feature selection and classifier ensemble selection
    Ma, Xiaofeng
    Zhang, Yi
    Song, Xiangfeng
    Fan, Chao
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (11): : 5592 - 5609
  • [17] Integration of classifier diversity measures for feature selection-based classifier ensemble reduction
    Gang Yao
    Hualin Zeng
    Fei Chao
    Chang Su
    Chih-Min Lin
    Changle Zhou
    Soft Computing, 2016, 20 : 2995 - 3005
  • [18] An Ensemble Feature Selection Method for Prediction of CKD
    Manonmani, M.
    Balakrishnan, Sarojini
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 667 - 672
  • [19] Integration of classifier diversity measures for feature selection-based classifier ensemble reduction
    Yao, Gang
    Zeng, Hualin
    Chao, Fei
    Su, Chang
    Lin, Chih-Min
    Zhou, Changle
    SOFT COMPUTING, 2016, 20 (08) : 2995 - 3005
  • [20] Prediction of lysine ubiquitination with mRMR feature selection and analysis
    Cai, Yudong
    Huang, Tao
    Hu, Lele
    Shi, Xiaohe
    Xie, Lu
    Li, Yixue
    AMINO ACIDS, 2012, 42 (04) : 1387 - 1395