Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition

被引:121
|
作者
Shi, J.-Y. [1 ]
Zhang, S.-W. [1 ]
Pan, Q. [1 ]
Cheng, Y.-M. [1 ]
Xie, J. [1 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Xian 710072, Peoples R China
关键词
multi-scale energy; Wavelet transform; support vector machines; Chou's pseudo amino acid composition; protein subcellular localizations;
D O I
10.1007/s00726-006-0475-y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As more and more genomes have been discovered in recent years, there is an urgent need to develop a reliable method to predict the subcellular localization for the explosion of newly found proteins. However, many well-known prediction methods based on amino acid composition have problems utilizing the sequence-order information. Here, based on the concept of Chou's pseudo amino acid composition (PseAA), a new feature extraction method, the multi-scale energy ( MSE) approach, is introduced to incorporate the sequence-order information. First, a protein sequence was mapped to a digital signal using the amino acid index. Then, by wavelet transform, the mapped signal was broken down into several scales in which the energy factors were calculated and further formed into an MSE feature vector. Following this, combining this MSE feature vector with amino acid composition ( AA), we constructed a series of MSEPseAA feature vectors to represent the protein subcellular localization sequences. Finally, according to a new kind of normalization approach, the MSEPseAA feature vectors were normalized to form the improved MSEPseAA vectors, named as IEPseAA. Using the technique of IEPseAA, C-support vector machine (C-SVM) and three multi-class SVMs strategies, quite promising results were obtained, indicating that MSE is quite effective in reflecting the sequence-order effects and might become a useful tool for predicting the other attributes of proteins as well.
引用
收藏
页码:69 / 74
页数:6
相关论文
共 50 条
  • [31] Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition
    Chen, Ying-Li
    Li, Qian-Zhong
    JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) : 377 - 381
  • [32] Using pseudo-amino acid composition and support vector machine to predict protein structural class
    Chen, Chao
    Tian, Yuan-Xin
    Zou, Xiao-Yong
    Cai, Pei-Xiang
    Mo, Jin-Yuan
    JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) : 444 - 448
  • [33] Prediction of Protein Subcellular Multi-localization by Using a Min-Max Modular Support Vector Machine
    Yang, Yang
    Lu, Bao-Liang
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2009, 61 : 133 - 143
  • [34] PROTEIN SUBCELLULAR MULTI-LOCALIZATION PREDICTION USING A MIN-MAX MODULAR SUPPORT VECTOR MACHINE
    Yang, Yang
    Lu, Bao-Liang
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2010, 20 (01) : 13 - 28
  • [35] Protein subcellular location prediction based on pseudo amino acid composition and immune genetic algorithm
    Zhang, Tongliang
    Ding, Yongsheng
    Shao, Shihuang
    COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, PT 3, PROCEEDINGS, 2006, 4115 : 534 - 542
  • [36] Amino acid features for prediction of protein-protein interface residues with Support Vector Machines
    Nguyen, Minh N.
    Rajapakse, Jagath C.
    Duan, Kai-Bo
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2007, 4447 : 187 - +
  • [37] mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    BMC BIOINFORMATICS, 2012, 13
  • [38] mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines
    Shibiao Wan
    Man-Wai Mak
    Sun-Yuan Kung
    BMC Bioinformatics, 13
  • [39] A novel representation for apoptosis protein subcellular localization prediction using support vector machine
    Zhang, Li
    Liao, Bo
    Li, Dachao
    Zhu, Wen
    JOURNAL OF THEORETICAL BIOLOGY, 2009, 259 (02) : 361 - 365
  • [40] Predicting subcellular localization of mycobacterial proteins by using Chou's pseudo amino acid composition
    Lin, Hao
    Ding, Hui
    Guo, Feng-Biao
    Zhang, An-Ying
    Huang, Jian
    PROTEIN AND PEPTIDE LETTERS, 2008, 15 (07): : 739 - 744