Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection

被引:47
|
作者
Zhao, Xiaowei [1 ,2 ]
Li, Xiangtao [1 ,2 ]
Ma, Zhiqiang [1 ,2 ]
Yin, Minghao [2 ]
机构
[1] NE Normal Univ, Coll Life Sci, Changchun 130024, Peoples R China
[2] NE Normal Univ, Coll Comp Sci, Changchun 130117, Peoples R China
基金
中国国家自然科学基金;
关键词
ubiquitylation; ensemble classifier; support vector machine; lysine ubiquitylation sites; UBIQUITIN-LIKE PROTEINS; PROTEOMICS APPROACH; INTRINSIC DISORDER; IDENTIFICATION; RELEVANCE; LOCATION;
D O I
10.3390/ijms12128347
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.
引用
收藏
页码:8347 / 8361
页数:15
相关论文
共 50 条
  • [31] Student performance prediction with BPSO feature selection and CNN classifier
    Begum, Safira
    Padmannavar, Sunita S.
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2022, 9 (11): : 84 - 92
  • [32] Ensemble classifier based big data classification with hybrid optimal feature selection
    Pamila, J. C. Miraclin Joyce
    Selvi, R. Senthamil
    Santhi, P.
    Nithya, T. M.
    ADVANCES IN ENGINEERING SOFTWARE, 2022, 173
  • [33] Forest optimization algorithm-based feature selection using classifier ensemble
    Moorthy, Usha
    Gandhi, Usha Devi
    COMPUTATIONAL INTELLIGENCE, 2020, 36 (04) : 1445 - 1462
  • [34] A Class Centric Feature and Classifier Ensemble Selection Approach for Music Genre Classification
    Ariyaratne, Hasitha Bimsara
    Zhang, Dengsheng
    Lu, Guojun
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 666 - 674
  • [35] An Experimental evaluation of Feature selection based Classifier Ensemble for Handwritten Numeral Recognition
    Singh, Pratibha
    Verma, Ajay
    Chaudhari, Narendra S.
    2014 INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2014,
  • [36] Building an efficient intrusion detection system based on feature selection and ensemble classifier
    Zhou, Yuyang
    Cheng, Guang
    Jiang, Shanqing
    Dai, Mian
    COMPUTER NETWORKS, 2020, 174
  • [37] Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition
    Ekbal, Asif
    Saha, Sriparna
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (02) : 143 - 166
  • [38] Hybrid ensemble techniques used for classifier and feature selection in intrusion detection systems
    Kharwar, Ankit
    Thakor, Devendra
    INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2022, 28 (04) : 389 - 413
  • [39] Classifier ensemble design using artificial bee colony based feature selection
    Palanisamy, Shunmugapriya
    Kanmani, S.
    International Journal of Computer Science Issues, 2012, 9 (3 3-2): : 522 - 529
  • [40] Feature selection model for healthcare analysis and classification using classifier ensemble technique
    Nagarajan, Senthil Murugan
    Muthukumaran, V.
    Murugesan, R.
    Joseph, Rose Bindu
    Munirathanam, Meram
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2021,