The widespread application of whole-genome sequencing is identifying numerous non-synonymous single nucleotide polymorphisms (nsSNPs), many of which are associated with disease. We analyzed nsSNPs from Humsavar and the 1000 Genomes Project to investigate why some proteins and domains are more tolerant of mutations than others. We identified 311 proteins and 112 Pfam families, corresponding to 2910 domains, as disease susceptible and 32 proteins and 67 Pfam families (10,783 domains) as disease resistant based on the relative numbers of disease-associated and neutral polymorphisms. Proteins with no significant difference from expected numbers of disease and polymorphism nsSNPs are classified as other. This classification takes into account the phenotypes of all known mutations in the protein or domain rather than simply classifying based on the presence or absence of disease nsSNPs. Of the two hypotheses suggested, our results support the model that disease-resistant domains and proteins are more able to tolerate mutations rather than having more lethal mutations that are not observed. Disease-resistant proteins and domains show significantly higher mutation rates and lower sequence conservation than disease-susceptible proteins and domains. Disease-susceptible proteins are more likely to be encoded by essential genes, are more central in protein-protein interaction networks and are less likely to contain loss-of-function mutations in healthy individuals. We use this classification for nsSNP phenotype prediction, predicting nsSNPs in disease-susceptible domains to be disease and those in disease-resistant domains to be polymorphism. In this way, we achieve higher accuracy than SIFT, a state-of-the-art algorithm. (C) 2013 Elsevier Ltd. All rights reserved.
机构:
Univ Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Chai, Chuan-Yu
Maran, Sathiya
论文数: 0引用数: 0
h-index: 0
机构:
Monash Univ Malaysia, Sch Pharm, Jalan Lagoon Selatan, Bandar Sunway 47500, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Maran, Sathiya
Thew, Hin-Yee
论文数: 0引用数: 0
h-index: 0
机构:
Monash Univ Malaysia, Sch Pharm, Jalan Lagoon Selatan, Bandar Sunway 47500, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Thew, Hin-Yee
Tan, Yong-Chiang
论文数: 0引用数: 0
h-index: 0
机构:
Int Med Univ, Sch Postgrad Studies, Jalan Jalil Perkasa 19, Kuala Lumpur 57000, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Tan, Yong-Chiang
Abd Rahman, Nik Mohd Afizan Nik
论文数: 0引用数: 0
h-index: 0
机构:
Univ Putra Malaysia, Fac Biotechnol & Biomol Sci, Dept Cell & Mol Biol, Serdang 43400, Selangor, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Abd Rahman, Nik Mohd Afizan Nik
Cheng, Wan-Hee
论文数: 0引用数: 0
h-index: 0
机构:
INTI Int Univ, Fac Hlth & Life Sci, Nilai 71800, Negeri Sembilan, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Cheng, Wan-Hee
Lai, Kok-Song
论文数: 0引用数: 0
h-index: 0
机构:
Higher Coll Technol, Abu Dhabi Womens Coll, Hlth Sci Div, Abu Dhabi 41012, U Arab EmiratesUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Lai, Kok-Song
Loh, Jiun-Yan
论文数: 0引用数: 0
h-index: 0
机构:
UCSI Univ, Ctr Res Adv Aquaculture CORAA, 1 Jalan Menara Gading UCSI Height, Kuala Lumpur 56000, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
Loh, Jiun-Yan
Yap, Wai-Sum
论文数: 0引用数: 0
h-index: 0
机构:
He & Ni Acad, Off Tower B, Kuala Lumpur 59200, MalaysiaUniv Tunku Abdul Rahman, Fac Sci, Dept Biol Sci, Jalan Univ, Bandar Barat 31900, Kampar, Malaysia
机构:
Chinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China
Tian, Jian
Wu, Ningfeng
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China
Wu, Ningfeng
Guo, Xuexia
论文数: 0引用数: 0
h-index: 0
机构:
Acad Planning & Designing, Minist Agr, Agr Byprod Proc Res Inst, Beijing 100026, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China
Guo, Xuexia
Guo, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China
Guo, Jun
Zhang, Juhua
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Inst Technol, Dept Biomed Engn, Beijing 100081, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China
Zhang, Juhua
Fan, Yunliu
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China