Predicting protein secondary structure using a mixed-modal SVM method in a compound pyramid model

被引:32
|
作者
Yang, Bingru [1 ]
Wu, Qu [1 ]
Ying, Zhou [1 ]
Sui, Haifeng [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Informat Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein secondary structure prediction; Physicochemical properties; Mixed-modal SVM; Compound pyramid model; FOLD-RECOGNITION; SERVER; ACCURACY; MATRICES;
D O I
10.1016/j.knosys.2010.10.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate protein secondary structure prediction plays an important role in direct tertiary structure modeling, and can also significantly improve sequence analysis and sequence-structure threading for structure and function determination. Hence improving the accuracy of secondary structure prediction is essential for future developments throughout the field of protein research. In this article, we propose a mixed-modal support vector machine (SVM) method for predicting protein secondary structure. Using the evolutionary information contained in the physicochemical properties of each amino acid and a position-specific scoring matrix generated by a PSI-BLAST multiple sequence alignment as input for a mixed-modal SVM, secondary structure can be predicted at significantly increased accuracy. Using a Knowledge Discovery Theory based on the Inner Cognitive Mechanism (KDTICM) method, we have proposed a compound pyramid model, which is composed of three layers of intelligent interface that integrate a mixed-modal SVM (MMS) module, a modified Knowledge Discovery in Databases (KDD*) process, a mixed-modal back propagation neural network (MMBP) module and so on. Testing against data sets of non-redundant protein sequences returned values for the Q(3) accuracy measure that ranged from 84.0% to 85.6%,while values for the SOV99 segment overlap measure ranged from 79.8% to 80.6%. When compared using a blind test dataset from the CASP8 meeting against currently available secondary structure prediction methods, our new approach shows superior accuracy. Availability: http://www.kdd.ustb.edu.cn/protein_Web/. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:304 / 313
页数:10
相关论文
共 50 条
  • [31] TertProt: A Protein Fold Recognition Method Using Protein Secondary Structure Program
    Kaladhar, D. S. V. G. K.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 161 - 168
  • [32] Comprehensive model for predicting phreatic line of tailings dam using Grey theory and SVM method
    Wang, Feiyue
    DISASTER ADVANCES, 2013, 6 : 126 - 134
  • [33] A method for predicting number of tsunami fires using generalized linear mixed model
    Nishino, Tomoaki
    Hokugo, Akihiko
    Journal of Environmental Engineering (Japan), 2015, 80 (718): : 1105 - 1114
  • [34] PREDICTING SECONDARY STRUCTURE OF A PROTEIN USING MIRA ALGORITHM BASED ON TETRAPEPTIDE STRUCTURAL WORDS
    Ramyachitra, D.
    Rani, R. Ranjani
    Kamalakkannan, V.
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 844 - 848
  • [35] On predicting protein secondary structure from their aminoacid sequences using Inductive Logic Programming
    Magalhaes, Alexandre
    Fonseca, Nuno A.
    2005 PORTUGUESE CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, : 136 - 139
  • [36] PROTEIN SECONDARY STRUCTURE PREDICTION USING A STATISTICAL-MECHANICAL METHOD
    KOBAYASHI, Y
    SAITO, N
    PROTEIN ENGINEERING, 1994, 7 (09): : 1164 - 1164
  • [37] Protein Secondary Structure Prediction Using an Evolutionary Computation Method and Clustering
    Zamani, Masood
    Kremer, Stefan C.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 118 - 123
  • [38] Protein Secondary Structure Prediction Using Cascaded Feature Learning Model
    Geethu, S.
    Vimina, E. R.
    APPLIED SOFT COMPUTING, 2023, 140
  • [39] CONCORD: a consensus method for protein secondary structure prediction via mixed integer linear optimization
    Wei, Y.
    Thompson, J.
    Floudas, C. A.
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2012, 468 (2139): : 831 - 850
  • [40] Predicting protein secondary structure using structure-based models of evolutionarily-derived site heterogeneity
    Thompson, MJ
    Goldstein, RA
    BIOPHYSICAL JOURNAL, 1997, 72 (02) : MP286 - MP286