Imputing missing genotypic data of single-nucleotide polymorphisms using neural networks

被引:0
|
作者
Yan V Sun
Sharon L R Kardia
机构
[1] School of Public Health,Department of Epidemiology
[2] University of Michigan,undefined
来源
关键词
SNP; neural networks; missing data imputation; genotype prediction;
D O I
暂无
中图分类号
学科分类号
摘要
With advances in high-throughput single-nucleotide polymorphism (SNP) genotyping, the amount of genotype data available for genetic studies is steadily increasing, and with it comes new abilities to study multigene interactions as well as to develop higher dimensional genetic models that more closely represent the polygenic nature of common disease risk. The combined impact of even small amounts of missing data on a multi-SNP analysis may be considerable. In this study, we present a neural network method for imputing missing SNP genotype data. We compared its imputation accuracy with fastPHASE and an expectation–maximization algorithm implemented in HelixTree. In a simulation data set of 1000 SNPs and 1000 subjects, 1, 5 and 10% of genotypes were randomly masked. Four levels of linkage disequilibrium (LD), LD R2<0.2, R2<0.5, R2<0.8 and no LD threshold, were examined to evaluate the impact of LD on imputation accuracy. All three methods are capable of imputing most missing genotypes accurately (accuracy >86%). The neural network method accurately predicted 92.0–95.9% of the missing genotypes. In a real data set comparison with 419 subjects and 126 SNPs from chromosome 2, the neural network method achieves the highest imputation accuracies >83.1% with missing rate from 1 to 5%. Using 90 HapMap subjects with 1962 SNPs, fastPHASE had the highest accuracy (∼97%) while the other two methods had >95% accuracy. These results indicate that the neural network model is an accurate and convenient tool, requiring minimal parameter tuning for SNP data recovery, and provides a valuable alternative to usual complete-case analysis.
引用
收藏
页码:487 / 495
页数:8
相关论文
共 50 条
  • [41] Prediction of Single-Nucleotide Polymorphisms Causative of Rare Diseases
    Ferraro, Maria Brigida
    Guarracino, Mario Rosario
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS: 10TH INTERNATIONAL MEETING, 2014, 8452 : 213 - 224
  • [42] Single-nucleotide polymorphisms in the p53 pathway
    Harris, S. L.
    Gil, G.
    Hu, W.
    Robins, H.
    Bond, E.
    Hirshfield, K.
    Feng, Z.
    Yu, X.
    Teresky, A. K.
    Bond, G.
    Levine, A. J.
    MOLECULAR APPROACHES TO CONTROLLING CANCER, 2005, 70 : 111 - 119
  • [43] StructMAn: annotation of single-nucleotide polymorphisms in the structural context
    Gress, Alexander
    Ramensky, Vasily
    Buech, Joachim
    Keller, Andreas
    Kalinina, Olga V.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W463 - W468
  • [44] Energy-Based Temporal Neural Networks for Imputing Missing Values
    Brakel, Philemon
    Schrauwen, Benjamin
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II, 2012, 7664 : 575 - 582
  • [45] Single-nucleotide polymorphisms and genome diversity in Plasmodium vivax
    Feng, XR
    Carlton, JM
    Joy, DA
    Mu, JB
    Furuya, T
    Suh, BB
    Wang, YF
    Barnwell, JW
    Su, XZ
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (14) : 8502 - 8507
  • [46] Single-nucleotide polymorphisms in the public domain: how useful are they?
    Gabor Marth
    Raymond Yeh
    Matthew Minton
    Rachel Donaldson
    Qun Li
    Shenghui Duan
    Ruth Davenport
    Raymond D. Miller
    Pui-Yan Kwok
    Nature Genetics, 2001, 27 : 371 - 372
  • [47] Effectiveness of Single-Nucleotide Polymorphisms to Investigate Cattle Rustling
    Fernandez, Maria E.
    Rogberg-Munoz, Andres
    Liron, Juan P.
    Goszczynski, Daniel E.
    Ripoli, Maria V.
    Carino, Monica H.
    Peral-Garcia, Pilar
    Giovambattista, Guillermo
    JOURNAL OF FORENSIC SCIENCES, 2014, 59 (06) : 1607 - 1613
  • [48] Single-nucleotide polymorphisms of the PRKCG gene and osteosarcoma susceptibility
    Zhang, Ying
    Hu, Xu
    Wang, Hong-Kai
    Shen, Wei-Wei
    Liao, Tong-Quan
    Chen, Pei
    Chu, Tong-Wei
    TUMOR BIOLOGY, 2014, 35 (12) : 12671 - 12677
  • [49] Novel single-nucleotide polymorphisms associated with pemphigus vulgaris
    Handa, Sanjeev
    Mahajan, Rahul
    De, Dipankar
    Kumar, Sheetanshu
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2020, 83 (06) : AB128 - AB128
  • [50] Using gold nanoparticles to detect single-nucleotide polymorphisms: toward liquid biopsy
    Iglesias, Maria Sanroman
    Grzelczak, Marek
    BEILSTEIN JOURNAL OF NANOTECHNOLOGY, 2020, 11 : 263 - 284