Proteins and Domains Vary in Their Tolerance of Non-Synonymous Single Nucleotide Polymorphisms (nsSNPs)

被引:29
|
作者
Yates, Christopher M. [1 ]
Sternberg, Michael J. E. [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Life Sci, Ctr Integrat Syst Biol & Bioinformat, London SW7 2AZ, England
基金
英国医学研究理事会;
关键词
disease; mutation; SNP; protein structure; bioinfornnatics; INTRINSICALLY DISORDERED PROTEINS; NETWORK PROPERTIES; HUMAN-DISEASES; EVOLUTION; DATABASE; SERVER; GENES; SNPS;
D O I
10.1016/j.jmb.2013.01.026
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The widespread application of whole-genome sequencing is identifying numerous non-synonymous single nucleotide polymorphisms (nsSNPs), many of which are associated with disease. We analyzed nsSNPs from Humsavar and the 1000 Genomes Project to investigate why some proteins and domains are more tolerant of mutations than others. We identified 311 proteins and 112 Pfam families, corresponding to 2910 domains, as disease susceptible and 32 proteins and 67 Pfam families (10,783 domains) as disease resistant based on the relative numbers of disease-associated and neutral polymorphisms. Proteins with no significant difference from expected numbers of disease and polymorphism nsSNPs are classified as other. This classification takes into account the phenotypes of all known mutations in the protein or domain rather than simply classifying based on the presence or absence of disease nsSNPs. Of the two hypotheses suggested, our results support the model that disease-resistant domains and proteins are more able to tolerate mutations rather than having more lethal mutations that are not observed. Disease-resistant proteins and domains show significantly higher mutation rates and lower sequence conservation than disease-susceptible proteins and domains. Disease-susceptible proteins are more likely to be encoded by essential genes, are more central in protein-protein interaction networks and are less likely to contain loss-of-function mutations in healthy individuals. We use this classification for nsSNP phenotype prediction, predicting nsSNPs in disease-susceptible domains to be disease and those in disease-resistant domains to be polymorphism. In this way, we achieve higher accuracy than SIFT, a state-of-the-art algorithm. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1274 / 1286
页数:13
相关论文
共 50 条
  • [31] Prediction by Graph Theoretic Measures of Structural Effects in Proteins Arising from Non-Synonymous Single Nucleotide Polymorphisms
    Cheng, Tammy M. K.
    Lu, Yu-En
    Vendruscolo, Michele
    Lio, Pietro
    Blundell, Tom L.
    PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (07)
  • [32] Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms
    Clifford, RJ
    Edmonson, MN
    Nguyen, C
    Buetow, KH
    BIOINFORMATICS, 2004, 20 (07) : 1006 - 1014
  • [33] An in silico evaluation of non-synonymous single nucleotide polymorphisms of mastitis resistance genes in cattle
    Jacob, Kala K.
    Radhika, G.
    Aravindakshan, T. V.
    ANIMAL BIOTECHNOLOGY, 2020, 31 (01) : 25 - 31
  • [34] Functional impacts of non-synonymous single nucleotide polymorphisms: Selective constraint and structural environments
    Bao, L
    Cui, Y
    FEBS LETTERS, 2006, 580 (05) : 1231 - 1234
  • [35] Predicting Deleterious Non-Synonymous Single Nucleotide Polymorphisms (nsSNPs) of HRAS Gene and In Silico Evaluation of Their Structural and Functional Consequences towards Diagnosis and Prognosis of Cancer
    Chai, Chuan-Yu
    Maran, Sathiya
    Thew, Hin-Yee
    Tan, Yong-Chiang
    Abd Rahman, Nik Mohd Afizan Nik
    Cheng, Wan-Hee
    Lai, Kok-Song
    Loh, Jiun-Yan
    Yap, Wai-Sum
    BIOLOGY-BASEL, 2022, 11 (11):
  • [36] Machine learning techniques for pathogenicity prediction of non-synonymous single nucleotide polymorphisms in human body
    El Houby, Enas M. F.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (7) : 8099 - 8113
  • [37] TopoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association
    Stitziel, NO
    Binkowski, TA
    Tseng, YY
    Kasif, S
    Liang, J
    NUCLEIC ACIDS RESEARCH, 2004, 32 : D520 - D522
  • [38] Computational prediction of the effects of non-synonymous single nucleotide polymorphisms in human DNA repair genes
    Nakken, S.
    Alseth, I.
    Rognes, T.
    NEUROSCIENCE, 2007, 145 (04) : 1273 - 1279
  • [39] Predicting the phenotypic effects of non-synonymous single nucleotide polymorphisms based on support vector machines
    Tian, Jian
    Wu, Ningfeng
    Guo, Xuexia
    Guo, Jun
    Zhang, Juhua
    Fan, Yunliu
    BMC BIOINFORMATICS, 2007, 8
  • [40] Predicting the phenotypic effects of non-synonymous single nucleotide polymorphisms based on support vector machines
    Jian Tian
    Ningfeng Wu
    Xuexia Guo
    Jun Guo
    Juhua Zhang
    Yunliu Fan
    BMC Bioinformatics, 8