Predicting protein secondary structure using a mixed-modal SVM method in a compound pyramid model

被引:32
|
作者
Yang, Bingru [1 ]
Wu, Qu [1 ]
Ying, Zhou [1 ]
Sui, Haifeng [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Informat Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein secondary structure prediction; Physicochemical properties; Mixed-modal SVM; Compound pyramid model; FOLD-RECOGNITION; SERVER; ACCURACY; MATRICES;
D O I
10.1016/j.knosys.2010.10.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate protein secondary structure prediction plays an important role in direct tertiary structure modeling, and can also significantly improve sequence analysis and sequence-structure threading for structure and function determination. Hence improving the accuracy of secondary structure prediction is essential for future developments throughout the field of protein research. In this article, we propose a mixed-modal support vector machine (SVM) method for predicting protein secondary structure. Using the evolutionary information contained in the physicochemical properties of each amino acid and a position-specific scoring matrix generated by a PSI-BLAST multiple sequence alignment as input for a mixed-modal SVM, secondary structure can be predicted at significantly increased accuracy. Using a Knowledge Discovery Theory based on the Inner Cognitive Mechanism (KDTICM) method, we have proposed a compound pyramid model, which is composed of three layers of intelligent interface that integrate a mixed-modal SVM (MMS) module, a modified Knowledge Discovery in Databases (KDD*) process, a mixed-modal back propagation neural network (MMBP) module and so on. Testing against data sets of non-redundant protein sequences returned values for the Q(3) accuracy measure that ranged from 84.0% to 85.6%,while values for the SOV99 segment overlap measure ranged from 79.8% to 80.6%. When compared using a blind test dataset from the CASP8 meeting against currently available secondary structure prediction methods, our new approach shows superior accuracy. Availability: http://www.kdd.ustb.edu.cn/protein_Web/. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:304 / 313
页数:10
相关论文
共 50 条
  • [1] Protein Secondary Structure Prediction Based on Improved SVM Method in Compound Pyramid Model
    Yang, Bingru
    Qu, Wu
    Zhai, Yun
    Sui, Haifeng
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 4405 - 4410
  • [2] An Approach of Protein Secondary Structure Prediction Based on SVM Method in Compound Pyramid Model
    Yang, Bingru
    Qu, Wu
    Zhai, Yun
    Sui, Haifeng
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 455 - 459
  • [3] An Approach of Protein Secondary Structure Prediction Based on Homology Analysis Method in Compound Pyramid Model
    Yang, Bingru
    Qu, Wu
    Zhai, Yun
    Sui, Haifeng
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 450 - 454
  • [4] KAAPRO: An approach of protein secondary structure prediction based on KDD* in the compound pyramid prediction model
    Yang, Bingru
    Hou, Wei
    Zhou, Zhun
    Quan, Huabin
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) : 9000 - 9006
  • [5] Improving protein secondary structure prediction using a multi-modal BP method
    Qu, Wu
    Sui, Haifeng
    Yang, Bingru
    Qian, Wenbin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2011, 41 (10) : 946 - 959
  • [6] A method of predicting the secondary protein structure based on dictionaries
    Roterman-Konieczna, Irena
    Fabian, Piotr
    Stapor, Katarzyna
    BIO-ALGORITHMS AND MED-SYSTEMS, 2015, 11 (03) : 163 - 170
  • [7] A novel method for protein secondary structure prediction using dual-layer SVM and profiles
    Guo, J
    Chen, H
    Sun, ZR
    Lin, YL
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (04) : 738 - 743
  • [8] Compound method of protein secondary structure prediction and its implementation
    Chen, Hang
    Gu, Fei
    Huang, Zhengge
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 104 - +
  • [9] A catalog method for predicting protein secondary structure by dynamic programming
    Stanfel, LE
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON THEORETICAL BIOPHYSICS AND BIOMATHEMATICS, 1997, : 59 - 64
  • [10] PHDcleav: a SVM based method for predicting human Dicer cleavage sites using sequence and secondary structure of miRNA precursors
    Firoz Ahmed
    Rakesh Kaundal
    Gajendra PS Raghava
    BMC Bioinformatics, 14