An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins

被引:24
|
作者
Nanni, Loris [1 ]
Lumini, Alessandra [1 ]
机构
[1] Univ Bologna, DEIS, CNR, IEIIT, I-40136 Bologna, Italy
关键词
Multi-classifier; Amino-acid alphabets; Support vector machine; DNA-binding proteins; Ensemble classifier; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINE; SUBCELLULAR LOCATION PREDICTION; STRUCTURAL CLASS PREDICTION; COMPLEXITY MEASURE FACTOR; ENZYME SUBFAMILY CLASSES; IMPROVED HYBRID APPROACH; WEB-SERVER; CELLULAR-AUTOMATA; SUBNUCLEAR LOCALIZATION;
D O I
10.1007/s00726-008-0044-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It is well known in the literature that an ensemble of classifiers obtains good performance with respect to that obtained by a stand-alone method. Hence, it is very important to develop ensemble methods well suited for bioinformatics data. In this work, we propose to combine the feature extraction method based on grouped weight with a set of amino-acid alphabets obtained by a Genetic Algorithm. The proposed method is applied for predicting DNA-binding proteins. As classifiers, the linear support vector machine and the radial basis function support vector machine are tested. As performance indicators, the accuracy and Matthews's correlation coefficient are reported. Matthews's correlation coefficient obtained by our ensemble method is a parts per thousand 0.97 when the jackknife cross-validation is used. This result outperforms the performance obtained in the literature using the same dataset where the features are extracted directly from the amino-acid sequence.
引用
收藏
页码:167 / 175
页数:9
相关论文
共 50 条
  • [1] An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins
    Loris Nanni
    Alessandra Lumini
    Amino Acids, 2009, 36 : 167 - 175
  • [2] Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
    Chen, Chien-Yu
    Chien, Ting-Ying
    Lin, Chih-Kang
    Lin, Chih-Wei
    Weng, Yi-Zhong
    Chang, Darby Tien-Hao
    PLOS ONE, 2012, 7 (02):
  • [3] StackPDB: Predicting DNA-binding proteins based on XGB-RFE feature optimization and stacked ensemble classifier
    Zhang, Qingmei
    Liu, Peishun
    Wang, Xue
    Zhang, Yaqun
    Han, Yu
    Yu, Bin
    APPLIED SOFT COMPUTING, 2021, 99
  • [4] StackDPP: a stacking ensemble based DNA-binding protein prediction model
    Ahmed, Sheikh Hasib
    Bose, Dibyendu Brinto
    Khandoker, Rafi
    Rahman, M. Saifur
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [5] StackDPP: a stacking ensemble based DNA-binding protein prediction model
    Sheikh Hasib Ahmed
    Dibyendu Brinto Bose
    Rafi Khandoker
    M Saifur Rahman
    BMC Bioinformatics, 25
  • [6] Predicting Functional Interactions Among DNA-Binding Proteins
    Khushi, Matloob
    Choudhury, Nazim
    Arthur, Jonathan W.
    Clarke, Christine L.
    Graham, J. Dinny
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 70 - 80
  • [7] Robust ensemble of handcrafted and learned approaches for DNA-binding proteins
    Nanni, Loris
    Brahnam, Sheryl
    APPLIED COMPUTING AND INFORMATICS, 2025, 21 (1/2) : 37 - 52
  • [8] DETECTION OF DNA-BINDING PROTEINS BY PROTEIN BLOTTING
    BOWEN, B
    STEINBERG, J
    LAEMMLI, UK
    WEINTRAUB, H
    NUCLEIC ACIDS RESEARCH, 1980, 8 (01) : 1 - 20
  • [9] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [10] Kernel-based machine learning protocol for predicting DNA-binding proteins
    Bhardwaj, N
    Langlois, RE
    Zhao, GJ
    Lu, H
    NUCLEIC ACIDS RESEARCH, 2005, 33 (20) : 6486 - 6493