Imputing missing genotypic data of single-nucleotide polymorphisms using neural networks

被引:0
|
作者
Yan V Sun
Sharon L R Kardia
机构
[1] School of Public Health,Department of Epidemiology
[2] University of Michigan,undefined
来源
关键词
SNP; neural networks; missing data imputation; genotype prediction;
D O I
暂无
中图分类号
学科分类号
摘要
With advances in high-throughput single-nucleotide polymorphism (SNP) genotyping, the amount of genotype data available for genetic studies is steadily increasing, and with it comes new abilities to study multigene interactions as well as to develop higher dimensional genetic models that more closely represent the polygenic nature of common disease risk. The combined impact of even small amounts of missing data on a multi-SNP analysis may be considerable. In this study, we present a neural network method for imputing missing SNP genotype data. We compared its imputation accuracy with fastPHASE and an expectation–maximization algorithm implemented in HelixTree. In a simulation data set of 1000 SNPs and 1000 subjects, 1, 5 and 10% of genotypes were randomly masked. Four levels of linkage disequilibrium (LD), LD R2<0.2, R2<0.5, R2<0.8 and no LD threshold, were examined to evaluate the impact of LD on imputation accuracy. All three methods are capable of imputing most missing genotypes accurately (accuracy >86%). The neural network method accurately predicted 92.0–95.9% of the missing genotypes. In a real data set comparison with 419 subjects and 126 SNPs from chromosome 2, the neural network method achieves the highest imputation accuracies >83.1% with missing rate from 1 to 5%. Using 90 HapMap subjects with 1962 SNPs, fastPHASE had the highest accuracy (∼97%) while the other two methods had >95% accuracy. These results indicate that the neural network model is an accurate and convenient tool, requiring minimal parameter tuning for SNP data recovery, and provides a valuable alternative to usual complete-case analysis.
引用
收藏
页码:487 / 495
页数:8
相关论文
共 50 条
  • [1] Imputing missing genotypic data of single-nucleotide polymorphisms using neural networks
    Sun, Yan V.
    Kardia, Sharon L. R.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (04) : 487 - 495
  • [2] Selection of single-nucleotide polymorphisms in disease association data
    Joo, J
    Tian, X
    Zheng, G
    Lin, JP
    Geller, NL
    BMC GENETICS, 2005, 6 (Suppl 1)
  • [3] Selection of single-nucleotide polymorphisms in disease association data
    Jungnam Joo
    Xin Tian
    Gang Zheng
    Jing-Ping Lin
    Nancy L Geller
    BMC Genetics, 6
  • [4] Single-nucleotide polymorphisms in soybean
    Zhu, YL
    Song, QJ
    Hyten, DL
    Van Tassell, CP
    Matukumalli, LK
    Grimm, DR
    Hyatt, SM
    Fickus, EW
    Young, ND
    Cregan, PB
    GENETICS, 2003, 163 (03) : 1123 - 1134
  • [5] Accurate haplotype inference for multiple linked single-nucleotide polymorphisms using sibship data
    Liu, Peng-Yuan
    Lu, Yan
    Deng, Hong-Wen
    GENETICS, 2006, 174 (01) : 499 - 509
  • [6] Application of Single-Nucleotide Polymorphisms in the Diagnosis of Autism Spectrum Disorders: A Preliminary Study with Artificial Neural Networks
    Soudeh Ghafouri-Fard
    Mohammad Taheri
    Mir Davood Omrani
    Amir Daaee
    Hossein Mohammad-Rahimi
    Hosein Kazazi
    Journal of Molecular Neuroscience, 2019, 68 : 515 - 521
  • [7] Application of Single-Nucleotide Polymorphisms in the Diagnosis of Autism Spectrum Disorders: A Preliminary Study with Artificial Neural Networks
    Ghafouri-Fard, Soudeh
    Taheri, Mohammad
    Omrani, Mir Davood
    Daaee, Amir
    Mohammad-Rahimi, Hossein
    Kazazi, Hosein
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2019, 68 (04) : 515 - 521
  • [8] Picking single-nucleotide polymorphisms in forests
    Daniel F Schwarz
    Silke Szymczak
    Andreas Ziegler
    Inke R König
    BMC Proceedings, 1 (Suppl 1)
  • [9] Single-nucleotide polymorphisms and glaucoma severity
    Bunce, C
    Hitchings, RA
    Bhattacharyya, S
    Lehmann, OJ
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (06) : 1593 - 1594
  • [10] Genetic susceptibility and single-nucleotide polymorphisms
    Hanchard, NA
    SEMINARS IN FETAL & NEONATAL MEDICINE, 2005, 10 (03): : 283 - 289