Correcting hazard ratio estimates for outcome misclassification using multiple imputation with internal validation data

被引:2
|
作者
Ni, Jiayi [1 ,2 ]
Leong, Aaron [1 ]
Dasgupta, Kaberi [1 ,3 ]
Rahme, Elham [1 ,3 ]
机构
[1] McGill Univ, Res Inst, Ctr Hlth, Montreal, PQ, Canada
[2] McGill Univ, Dept Epidemiol Biostat & Occupat Hlth, Montreal, PQ, Canada
[3] McGill Univ, Dept Med, Div Clin Epidemiol, Montreal, PQ, Canada
基金
加拿大健康研究院;
关键词
hazard ratio; misclassification; internal validation; multiple imputation; diabetes; statin; LOGISTIC-REGRESSION; MAXIMUM-LIKELIHOOD; PUBLIC-HEALTH; DIABETES RISK; PREVALENCE; SCORE; BIAS; CLASSIFICATION; INDIVIDUALS; DIAGNOSIS;
D O I
10.1002/pds.4223
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objective Outcome misclassification may occur in observational studies using administrative databases. We evaluated a two-step multiple imputation approach based on complementary internal validation data obtained from two subsamples of study participants to reduce bias in hazard ratio (HR) estimates in Cox regressions. Methods We illustrated this approach using data from a surveyed sample of 6247 individuals in a study of statin-diabetes association in Quebec. We corrected diabetes status and onset assessed from health administrative data against self-reported diabetes and/or elevated fasting blood glucose (FBG) assessed in subsamples. The association between statin use and new onset diabetes was evaluated using administrative data and the corrected data. By simulation, we assessed the performance of this method varying the true HR, sensitivity, specificity, and the size of validation subsamples. Results The adjusted HR of new onset diabetes among statin users versus non-users was 1.61 (95% confidence interval: 1.09-2.38) using administrative data only, 1.49 (0.95-2.34) when diabetes status and onset were corrected based on self-report and undiagnosed diabetes (FBG >= 7 mmol/L), and 1.36 (0.92-2.01) when corrected for self-report and undiagnosed diabetes/impaired FBG (>= 6 mmol/L). In simulations, the multiple imputation approach yielded less biased HR estimates and appropriate coverage for both non-differential and differential misclassification. Large variations in the corrected HR estimates were observed using validation subsamples with low participation proportion. The bias correction was sometimes outweighed by the uncertainty introduced by the unknown time of event occurrence. Conclusion Multiple imputation is useful to correct for outcome misclassification in time-to-event analyses if complementary validation data are available from subsamples. Copyright (C) 2017 John Wiley & Sons, Ltd.
引用
收藏
页码:925 / 934
页数:10
相关论文
共 50 条
  • [21] Validation Data-based Adjustments for Outcome Misclassification in Logistic Regression An Illustration
    Lyles, Robert H.
    Tang, Li
    Superak, Hillary M.
    King, Caroline C.
    Celentano, David D.
    Lo, Yungtai
    Sobel, Jack D.
    EPIDEMIOLOGY, 2011, 22 (04) : 589 - 598
  • [22] Correcting effect estimates for unmeasured confounding in cohort studies with validation data using propensity score calibration
    Stürmer, T
    Spiegelman, D
    Schneeweiss, S
    Avorn, J
    Glynn, RJ
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2004, 159 (11) : S49 - S49
  • [23] Multiple imputation for simple estimation of the hazard function based on interval censored data
    Bebchuk, JD
    Betensky, RA
    STATISTICS IN MEDICINE, 2000, 19 (03) : 405 - 419
  • [24] Multiple imputation for longitudinal data using Bayesian lasso imputation model
    Yamaguchi, Yusuke
    Yoshida, Satoshi
    Misumi, Toshihiro
    Maruo, Kazushi
    STATISTICS IN MEDICINE, 2022, 41 (06) : 1042 - 1058
  • [25] A method of identifying and correcting miscoding, misclassification and misdiagnosis in diabetes: a pilot and validation study of routinely collected data
    de Lusignan, S.
    Khunti, K.
    Belsey, J.
    Hattersley, A.
    van Vlymen, J.
    Gallagher, H.
    Millett, C.
    Hague, N. J.
    Tomson, C.
    Harris, K.
    Majeed, A.
    DIABETIC MEDICINE, 2010, 27 (02) : 203 - 209
  • [26] Multiple imputation for survey data that are missing by design: A validation study.
    Yost, K
    Levine, R
    Gold, E
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 157 (11) : S34 - S34
  • [27] Conditional validation sampling for consistent risk estimation with binary outcome data subject to misclassification
    Gravel, Christopher A.
    Farrell, Patrick J.
    Krewski, Daniel
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2019, 28 (02) : 227 - 233
  • [28] Outcome Validation to Assess Misclassification Bias in Studies Using Electronic Healthcare Claims
    Doherty, Brett
    Crowe, Christopher L.
    Sponholtz, Todd
    Beachler, Daniel
    Lanes, Stephan
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 : 615 - 615
  • [29] Using a multiple imputation technique to merge data sets
    Brown, JG
    APPLIED ECONOMICS LETTERS, 2002, 9 (05) : 311 - 314
  • [30] Multiple Imputation for Missing Data Using Genetic Programming
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 583 - 590