Physicochemical feature-based classification of amino acid mutations

被引:31
|
作者
Shen, Bairong [1 ]
Bai, Jinwei [1 ]
Vihinen, Mauno [1 ,2 ]
机构
[1] Univ Tampere, Inst Med Technol, FI-33014 Tampere, Finland
[2] Tampere Univ Hosp, Res Unit, FI-33520 Tampere, Finland
来源
PROTEIN ENGINEERING DESIGN & SELECTION | 2008年 / 21卷 / 01期
关键词
amino acid; mutation; physicochemical properties; protein stability; support vector machine;
D O I
10.1093/protein/gzm084
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A huge quantity of gene and protein sequences have become available during the post-genomic era, and information about genetic variations, including amino acid substitutions and SNPs, is accumulating rapidly. To understand the effects of these changes, it is often essential to apply bioinformatics tools. Where there is a lack of homologous sequences or a three-dimensional structure, it becomes essential to predict the effects of mutations based solely on protein sequence information. Several computational methods utilizing machine learning techniques have been developed. These predictions generally use the 20-alphabet amino acid code to train the model. With limited available data, the 20-alphabet amino acid features may introduce so many parameters that the model becomes over-fitted. To decrease the number of parameters, we propose a physicochemical feature-based method to forecast the effects of amino acid substitutions on protein stability. Protein structure alterations caused by mutations can be classified as stabilizing or destabilizing. Based on experimental folding-unfolding free energy (Delta Delta G) values, we trained a support vector machine with a cleaned data set. The physicochemical properties of the mutated residues, the number of neighboring residues in the primary sequence and the temperature and pH were used as input attributes. Different kernel functions, attributes and window sizes were optimized. An average accuracy of 80% was obtained in cross-validation experiments.
引用
收藏
页码:37 / 44
页数:8
相关论文
共 50 条
  • [1] Feature-Based Classification of Amino Acid Substitutions outside Conserved Functional Protein Domains
    Gemovic, Branislava
    Perovic, Vladimir
    Glisic, Sanja
    Veljkovic, Nevena
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [2] A PROPOSAL FOR FEATURE CLASSIFICATION IN FEATURE-BASED DESIGN
    OVTCHAROVA, J
    PAHL, G
    RIX, J
    COMPUTERS & GRAPHICS, 1992, 16 (02) : 187 - 195
  • [3] Feature-Based Dissimilarity Space Classification
    Duin, Robert P. W.
    Loog, Marco
    Pekalska, Elzbieta
    Tax, David M. J.
    RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 46 - +
  • [4] STATISTICAL FEATURE-BASED CRAQUELURE CLASSIFICATION
    Crisologo, Irene
    Monterola, Christopher
    Soriano, Maricor
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2011, 22 (11): : 1191 - 1209
  • [5] Feature-Based Lung Nodule Classification
    Farag, Amal
    Ali, Asem
    Graham, James
    Elhabian, Shireen
    Farag, Aly
    Falk, Robert
    ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 79 - +
  • [6] Feature-Based Terrain Classification For LittleDog
    Filitchkin, Paul
    Byl, Katie
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1387 - 1392
  • [7] EndNote: Feature-based classification of networks
    Barnett, Ian
    Malik, Nishant
    Kuijjer, Marieke L.
    Mucha, Peter J.
    Onnela, Jukka-Pekka
    NETWORK SCIENCE, 2019, 7 (03) : 438 - 444
  • [8] A feature-based trust sequence classification algorithm
    Yahyaoui, Hamdi
    Al-Mutairi, Aisha
    INFORMATION SCIENCES, 2016, 328 : 455 - 484
  • [9] A FEATURE-BASED ULTRASONIC SYSTEM FOR IGSCC CLASSIFICATION
    ROSE, J
    AVIOLI, M
    LAPIDES, M
    MATERIALS EVALUATION, 1982, 40 (03) : A13 - A13
  • [10] Multimodal Feature-Based Surface Material Classification
    Strese, Matti
    Schuwerk, Clemens
    Iepure, Albert
    Steinbach, Eckehard
    IEEE TRANSACTIONS ON HAPTICS, 2017, 10 (02) : 226 - 239