A Likelihood-Based Approach for Missing Genotype Data

被引：1

作者：

D'Angelo, Gina M. ^{[1
]}

Kamboh, M. Ilyas ^{[3
,4
]}

Feingold, Eleanor ^{[2
]}

机构：

[1] Washington Univ, Div Biostat, Sch Med, St Louis, MO 63110 USA

[2] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA

[3] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Human Genet, Pittsburgh, PA 15261 USA

[4] Univ Pittsburgh, Alzheimers Dis Res Ctr, Sch Med, Pittsburgh, PA 15261 USA

来源：

HUMAN HEREDITY | 2010年 / 69卷 / 03期

关键词：

Missing data; SNPs; Association studies; Logistic regression; Likelihood-based methods; PARAMETRIC REGRESSION-MODELS; GENOME-WIDE ASSOCIATION; LATENT VARIABLE MODELS; MAXIMUM-LIKELIHOOD; MULTIPLE IMPUTATION; COVARIATE DATA; POLYTOMOUS DATA; POLYMORPHISMS; INFERENCE; EQUATION;

D O I：

10.1159/000273732

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

Missing genotype data in a candidate gene association study can make it difficult to model the effects of multiple genetic variants simultaneously. In particular, when regression models are used to model phenotype as a function of SNP genotypes in several different genes, the most common approach is a complete case analysis, in which only individuals with no missing genotypes are included. But this can lead to substantial reduction in sample size and thus potential bias and loss in efficiency. A number of other methods for handling missing data are applicable, but have rarely been used in this context. The purpose of this paper is to describe how several standard methods for handling missing data can be applied or adapted to this problem, and to compare their performance using a simulation study. We demonstrate these techniques using an Alzheimer's disease association study. We show that the expectation-maximization algorithm and multiple imputation with a bootstrapped expectation-maximization sampling algorithm have the best properties of all the estimators studied. Copyright (C) 2010 S. Karger AG, Basel

引用

页码：171 / 183

页数：13

共 50 条

[1] Likelihood-based association analysis for nuclear families and unrelated subjects with missing genotype data
Dudbridge, Frank
HUMAN HEREDITY, 2008, 66 (02) : 87 - 98
[2] A likelihood-based approach for multivariate one-sided tests with missing data
Zhou, Guohai
Wu, Lang
Brant, Rollin
Ansermino, J. Mark
JOURNAL OF APPLIED STATISTICS, 2017, 44 (11) : 2000 - 2016
[3] Likelihood-based missing data analysis in crossover trials
Pareek, Savita
Das, Kalyan
Mukhopadhyay, Siuli
BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2023, 37 (02) : 329 - 350
[4] Likelihood-based Inference with Missing Data Under Missing-at-Random
Yang, Shu
Kim, Jae Kwang
SCANDINAVIAN JOURNAL OF STATISTICS, 2016, 43 (02) : 436 - 454
[5] Likelihood-based inference for spatiotemporal data with censored and missing responses
Valeriano, Katherine A. L.
Lachos, Victor H.
Prates, Marcos O.
Matos, Larissa A.
ENVIRONMETRICS, 2021, 32 (03)
[6] Robust likelihood-based analysis of multivariate data with missing values
Little, R
An, HG
STATISTICA SINICA, 2004, 14 (03) : 949 - 968
[7] Empirical likelihood-based inference in linear models with missing data
Wang, QH
Rao, JNK
SCANDINAVIAN JOURNAL OF STATISTICS, 2002, 29 (03) : 563 - 576
[8] Empirical Likelihood-based Inferences in Varying Coefficient Models with Missing Data
Xiao-hui LIU
ActaMathematicaeApplicataeSinica, 2015, 31 (03) : 823 - 840
[9] Misleading results of likelihood-based phylogenetic analyses in the presence of missing data
Simmons, Mark P.
CLADISTICS, 2012, 28 (02) : 208 - 222
[10] Empirical likelihood-based inferences in varying coefficient models with missing data
Xiao-hui Liu
Acta Mathematicae Applicatae Sinica, English Series, 2015, 31 : 823 - 840

← 1 2 3 4 5 →