Bias from the use of generalized estimating equations to analyze incomplete longitudinal binary data

被引:5
|
作者
Copas, Andrew J. [1 ,2 ]
Seaman, Shaun R. [3 ]
机构
[1] MRC, Clin Trials Unit, London NW1 2DA, England
[2] UCL, London NW1 2DA, England
[3] MRC, Biostat Unit, Inst Publ Hlth, Cambridge CB2 0SR, England
关键词
binary data; generalized estimating equations; missing data; missing at random; repeated measures; MISSING DATA; REPEATED OUTCOMES; DROP-OUTS; SEMIPARAMETRIC REGRESSION; LINEAR-MODELS; ASSOCIATION; NONRESPONSE; RESPONSES; TRIAL; RATES;
D O I
10.1080/02664760902939604
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Patient dropout is a common problem in studies that collect repeated binary measurements. Generalized estimating equations (GEE) are often used to analyze such data. The dropout mechanism may be plausibly missing at random (MAR), i.e. unrelated to future measurements given covariates and past measurements. In this case, various authors have recommended weighted GEE with weights based on an assumed dropout model, or an imputation approach, or a doubly robust approach based on weighting and imputation. These approaches provide asymptotically unbiased inference, provided the dropout or imputation model (as appropriate) is correctly specified. Other authors have suggested that, provided the working correlation structure is correctly specified, GEE using an improved estimator of the correlation parameters ('modified GEE') show minimal bias. These modified GEE have not been thoroughly examined. In this paper, we study the asymptotic bias under MAR dropout of these modified GEE, the standard GEE, and also GEE using the true correlation. We demonstrate that all three methods are biased in general. The modified GEE may be preferred to the standard GEE and are subject to only minimal bias in many MAR scenarios but in others are substantially biased. Hence, we recommend the modified GEE be used with caution.
引用
收藏
页码:911 / 922
页数:12
相关论文
共 50 条