Predicting protein secondary structure using a mixed-modal SVM method in a compound pyramid model

被引:32
|
作者
Yang, Bingru [1 ]
Wu, Qu [1 ]
Ying, Zhou [1 ]
Sui, Haifeng [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Informat Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein secondary structure prediction; Physicochemical properties; Mixed-modal SVM; Compound pyramid model; FOLD-RECOGNITION; SERVER; ACCURACY; MATRICES;
D O I
10.1016/j.knosys.2010.10.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate protein secondary structure prediction plays an important role in direct tertiary structure modeling, and can also significantly improve sequence analysis and sequence-structure threading for structure and function determination. Hence improving the accuracy of secondary structure prediction is essential for future developments throughout the field of protein research. In this article, we propose a mixed-modal support vector machine (SVM) method for predicting protein secondary structure. Using the evolutionary information contained in the physicochemical properties of each amino acid and a position-specific scoring matrix generated by a PSI-BLAST multiple sequence alignment as input for a mixed-modal SVM, secondary structure can be predicted at significantly increased accuracy. Using a Knowledge Discovery Theory based on the Inner Cognitive Mechanism (KDTICM) method, we have proposed a compound pyramid model, which is composed of three layers of intelligent interface that integrate a mixed-modal SVM (MMS) module, a modified Knowledge Discovery in Databases (KDD*) process, a mixed-modal back propagation neural network (MMBP) module and so on. Testing against data sets of non-redundant protein sequences returned values for the Q(3) accuracy measure that ranged from 84.0% to 85.6%,while values for the SOV99 segment overlap measure ranged from 79.8% to 80.6%. When compared using a blind test dataset from the CASP8 meeting against currently available secondary structure prediction methods, our new approach shows superior accuracy. Availability: http://www.kdd.ustb.edu.cn/protein_Web/. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:304 / 313
页数:10
相关论文
共 50 条
  • [41] Predicting Protein Therapeutic Candidates for Bovine Babesiosis Using Secondary Structure Properties and Machine Learning
    Goodswen, Stephen J.
    Kennedy, Paul J.
    Ellis, John T.
    FRONTIERS IN GENETICS, 2021, 12
  • [42] PREDICTING MEMBRANE-PROTEIN SECONDARY STRUCTURE - PREFERENCE FUNCTIONS METHOD FOR FINDING OPTIMAL CONFORMATIONAL PARAMETERS
    JURETIC, D
    LUCIC, B
    TRINAJSTIC, N
    CROATICA CHEMICA ACTA, 1993, 66 (01) : 201 - 208
  • [43] Prediction of protein secondary structure using Bayesian method and support vector machines
    Nguyen, MN
    Rajapakse, JC
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 616 - 620
  • [44] Vibrational spectral simulation for peptides of mixed secondary structure: Method comparisons with the TrpZip model hairpin
    Bour, P
    Keiderling, TA
    JOURNAL OF PHYSICAL CHEMISTRY B, 2005, 109 (49): : 23687 - 23697
  • [45] Secondary structure characterization of mixed food protein complexes using microfluidic modulation spectroscopy (MMS)
    Kim, Woojeong
    Wang, Yong
    Ma, Maria
    Ye, Qianyu
    Collins, Valerie Ivancic
    Selomulya, Cordelia
    FOOD BIOSCIENCE, 2023, 53
  • [46] Using Maximum Entropy Model to Predict Protein Secondary Structure with Single Sequence
    Ding, Yong-Sheng
    Zhang, Tong-Liang
    Gu, Quan
    Zhao, Pei-Ying
    Chou, Kuo-Chen
    PROTEIN AND PEPTIDE LETTERS, 2009, 16 (05): : 552 - 560
  • [47] Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure
    Song, Jiangning
    Yuan, Zheng
    Tan, Hao
    Huber, Thomas
    Burrage, Kevin
    BIOINFORMATICS, 2007, 23 (23) : 3147 - 3154
  • [48] ALIGNMENT OF PROTEIN SEQUENCES USING SECONDARY STRUCTURE - A MODIFIED DYNAMIC-PROGRAMMING METHOD
    FISCHELGHODSIAN, F
    MATHIOWITZ, G
    SMITH, TF
    PROTEIN ENGINEERING, 1990, 3 (07): : 577 - 581
  • [49] Improving protein secondary structure prediction using a simple k-mer model
    Madera, Martin
    Calmus, Ryan
    Thiltgen, Grant
    Karplus, Kevin
    Gough, Julian
    BIOINFORMATICS, 2010, 26 (05) : 596 - 602
  • [50] Protein model discrimination attempts using mutational sensitivity, predicted secondary structure, and model quality information
    Khare, Shruti
    Bhasin, Munmun
    Sahoo, Anusmita
    Varadarajan, Raghavan
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (04) : 326 - 336