Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study

被引:0
|
作者
Kawabata, Emily [1 ,2 ]
Major-Smith, Daniel [1 ,2 ]
Clayton, Gemma L. [1 ,2 ]
Shapland, Chin Yang [1 ,2 ]
Morris, Tim P. [3 ]
Carter, Alice R. [1 ,2 ]
Fernandez-Sanles, Alba [4 ]
Borges, Maria Carolina [1 ,2 ]
Tilling, Kate [1 ,2 ]
Griffith, Gareth J. [1 ,2 ]
Millard, Louise A. C. [1 ,2 ]
Smith, George Davey [1 ,2 ]
Lawlor, Deborah A. [1 ,2 ]
Hughes, Rachael A. [1 ,2 ]
机构
[1] Univ Bristol, MRC Integrat Epidemiol Unit, Bristol, England
[2] Univ Bristol, Bristol Med Sch, Populat Hlth Sci, Bristol, England
[3] UCL, MRC Clin Trials Unit, London, England
[4] UCL, MRC Unit Lifelong Hlth & Ageing, London, England
基金
英国惠康基金; 英国医学研究理事会;
关键词
Bayesian bias analysis; Inverse probability weighting; Missing not at random; Monte Carlo bias analysis; Multiple imputation; Probabilistic bias analysis; Sensitivity analysis; UK Biobank; FULLY CONDITIONAL SPECIFICATION; PATTERN-MIXTURE ANALYSIS; MULTIPLE IMPUTATION; SELECTION BIAS; FRAMEWORK; MODELS;
D O I
10.1186/s12874-024-02382-4
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundBias from data missing not at random (MNAR) is a persistent concern in health-related research. A bias analysis quantitatively assesses how conclusions change under different assumptions about missingness using bias parameters that govern the magnitude and direction of the bias. Probabilistic bias analysis specifies a prior distribution for these parameters, explicitly incorporating available information and uncertainty about their true values. A Bayesian bias analysis combines the prior distribution with the data's likelihood function whilst a Monte Carlo bias analysis samples the bias parameters directly from the prior distribution. No study has compared a Monte Carlo bias analysis to a Bayesian bias analysis in the context of MNAR missingness.MethodsWe illustrate an accessible probabilistic bias analysis using the Monte Carlo bias analysis approach and a well-known imputation method. We designed a simulation study based on a motivating example from the UK Biobank study, where a large proportion of the outcome was missing and missingness was suspected to be MNAR. We compared the performance of our Monte Carlo bias analysis to a principled Bayesian bias analysis, complete case analysis (CCA) and multiple imputation (MI) assuming missing at random.ResultsAs expected, given the simulation study design, CCA and MI estimates were substantially biased, with 95% confidence interval coverages of 7-48%. Including auxiliary variables (i.e., variables not included in the substantive analysis that are predictive of missingness and the missing data) in MI's imputation model amplified the bias due to assuming missing at random. With reasonably accurate and precise information about the bias parameter, the Monte Carlo bias analysis performed as well as the Bayesian bias analysis. However, when very limited information was provided about the bias parameter, only the Bayesian bias analysis was able to eliminate most of the bias due to MNAR whilst the Monte Carlo bias analysis performed no better than the CCA and MI.ConclusionThe Monte Carlo bias analysis we describe is easy to implement in standard software and, in the setting we explored, is a viable alternative to a Bayesian bias analysis. We caution careful consideration of choice of auxiliary variables when applying imputation where data may be MNAR.
引用
收藏
页数:14
相关论文
共 43 条
  • [1] Attrition Bias Related to Missing Outcome Data: A Longitudinal Simulation Study
    Lewin, Antoine
    Brondeel, Ruben
    Benmarhnia, Tarik
    Thomas, Frederique
    Chaix, Basile
    EPIDEMIOLOGY, 2018, 29 (01) : 87 - 95
  • [2] ACCOUNTING FOR BIAS IN SURVIVAL ANALYSIS DUE TO MISSING COVARIATE VALUES
    GLYNN, RJ
    FIELD, TS
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 1995, 141 (11) : S59 - S59
  • [3] Evaluation of bias and precision in methods of analysis for pragmatic trials with missing outcome data: a simulation study
    Royes Joseph
    Julius Sim
    Reuben Ogollah
    Martyn Lewis
    Trials, 14 (Suppl 1)
  • [4] Estimating the Bias in Meta Analysis Estimates for Continuous Data With Non-Random Missing Study Variance
    Idris, Nik Ruzni Nik
    MATEMATIKA, 2011, 27 (02) : 121 - 128
  • [5] plasmode simulation to address confounding bias due to missing data in a large electronic health records dataset
    Puzhko, Svetlana
    Bartlett, Gillian
    Schuster, Tibor
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 388 - 388
  • [6] TWO APPROACHES TO EVALUATE MISSING CLINICAL OUTCOME ASSESSMENT RESPONSES: A SIMULATION STUDY
    Qin, S.
    Ma, J.
    Nelson, L.
    VALUE IN HEALTH, 2019, 22 : S319 - S319
  • [7] Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study
    van Kuijk, Sander M. J.
    Viechtbauer, Wolfgang
    Peeters, Louis L.
    Smits, Luc
    EPIDEMIOLOGY BIOSTATISTICS AND PUBLIC HEALTH, 2016, 13 (01)
  • [8] Bias of complete-case analysis of williams square crossover designs when data are missing not at random
    Sofia Bazakou
    Robin Henderson
    Linda Sharples
    John Matthews
    Trials, 16
  • [9] Bias of complete-case analysis of williams square crossover designs when data are missing not at random
    Bazakou, Sofia
    Henderson, Robin
    Sharples, Linda
    Matthews, John
    TRIALS, 2015, 16
  • [10] The M-Value: A Simple Sensitivity Analysis for Bias Due to Missing Data in Treatment Effect Estimates
    Mathur, Maya B.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (04) : 612 - 620