Imputing missing genotypic data of single-nucleotide polymorphisms using neural networks

被引:0
|
作者
Yan V Sun
Sharon L R Kardia
机构
[1] School of Public Health,Department of Epidemiology
[2] University of Michigan,undefined
来源
关键词
SNP; neural networks; missing data imputation; genotype prediction;
D O I
暂无
中图分类号
学科分类号
摘要
With advances in high-throughput single-nucleotide polymorphism (SNP) genotyping, the amount of genotype data available for genetic studies is steadily increasing, and with it comes new abilities to study multigene interactions as well as to develop higher dimensional genetic models that more closely represent the polygenic nature of common disease risk. The combined impact of even small amounts of missing data on a multi-SNP analysis may be considerable. In this study, we present a neural network method for imputing missing SNP genotype data. We compared its imputation accuracy with fastPHASE and an expectation–maximization algorithm implemented in HelixTree. In a simulation data set of 1000 SNPs and 1000 subjects, 1, 5 and 10% of genotypes were randomly masked. Four levels of linkage disequilibrium (LD), LD R2<0.2, R2<0.5, R2<0.8 and no LD threshold, were examined to evaluate the impact of LD on imputation accuracy. All three methods are capable of imputing most missing genotypes accurately (accuracy >86%). The neural network method accurately predicted 92.0–95.9% of the missing genotypes. In a real data set comparison with 419 subjects and 126 SNPs from chromosome 2, the neural network method achieves the highest imputation accuracies >83.1% with missing rate from 1 to 5%. Using 90 HapMap subjects with 1962 SNPs, fastPHASE had the highest accuracy (∼97%) while the other two methods had >95% accuracy. These results indicate that the neural network model is an accurate and convenient tool, requiring minimal parameter tuning for SNP data recovery, and provides a valuable alternative to usual complete-case analysis.
引用
收藏
页码:487 / 495
页数:8
相关论文
共 50 条
  • [31] Mapping of complex traits by single-nucleotide polymorphisms
    Zhao, LP
    Aragaki, C
    Hsu, L
    Quiaoit, F
    AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 63 (01) : 225 - 240
  • [32] Understanding the Function of a Locus Using the Knowledge Available at Single-Nucleotide Polymorphisms
    Nikpay, Majid
    Ravati, Sepehr
    Dent, Robert
    McPherson, Ruth
    CARDIOGENETICS, 2021, 11 (04) : 255 - 262
  • [33] Imputation of Single-Nucleotide Polymorphisms in Inbred Mice Using Local Phylogeny
    Wang, Jeremy R.
    de Villena, Fernando Pardo-Manuel
    Lawson, Heather A.
    Cheverud, James M.
    Churchill, Gary A.
    McMillan, Leonard
    GENETICS, 2012, 190 (02) : 449 - U247
  • [34] Prediction of Early Childhood Caries Based on Single Nucleotide Polymorphisms Using Neural Networks
    Zaorska, Katarzyna
    Szczapa, Tomasz
    Borysewicz-Lewicka, Maria
    Nowicki, Michal
    Gerreth, Karolina
    GENES, 2021, 12 (04)
  • [35] Identification of relevant single-nucleotide polymorphisms in Pneumocystis jirovecii: relationship with clinical data
    Esteves, F.
    Gaspar, J.
    Marques, T.
    Leite, R.
    Antunes, F.
    Mansinho, K.
    Matos, O.
    CLINICAL MICROBIOLOGY AND INFECTION, 2010, 16 (07) : 878 - 884
  • [36] Imputing missing sleep data from wearables with neural networks in real-world settings
    Lee, Minki P.
    Hoang, Kien
    Park, Sungkyu
    Song, Yun Min
    Joo, Eun Yeon
    Chang, Won
    Kim, Jee Hyun
    Kim, Jae Kyoung
    SLEEP, 2024, 47 (01)
  • [37] DNA mixture interpretation using linear regression and neural networks on massively parallel sequencing data of single nucleotide polymorphisms
    Yang, Ta-Wei
    Li, Yi-Hao
    Chou, Cheng-Fu
    Lai, Fei-Pei
    Chien, Yin-Hsiu
    Yin, Hsiang-, I
    Lee, Tsui-Ting
    Hwa, Hsiao-Lin
    AUSTRALIAN JOURNAL OF FORENSIC SCIENCES, 2022, 54 (02) : 150 - 162
  • [38] Application of structural equation models to construct genetic networks using differentially expressed genes and single-nucleotide polymorphisms
    Seungmook Lee
    Mina Jhun
    Eun-Kyung Lee
    Taesung Park
    BMC Proceedings, 1 (Suppl 1)
  • [39] SELPLG and SELP single-nucleotide polymorphisms in multiple sclerosis
    Fenoglio, C
    Galimberti, D
    Ban, M
    Maranian, M
    Scalabrini, D
    Venturelli, E
    Piccio, L
    De Riz, M
    Yeo, TW
    Goris, A
    Gray, J
    Bresolin, N
    Scarpini, E
    Compston, A
    Sawcer, S
    NEUROSCIENCE LETTERS, 2006, 394 (02) : 92 - 96
  • [40] Detection of functional single-nucleotide polymorphisms that affect apoptosis
    Harris, SL
    Gil, G
    Robins, H
    Hu, WW
    Hirshfield, K
    Bond, E
    Bond, G
    Levine, AJ
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (45) : 16297 - 16302