Data-Driven Covariate Selection for Confounding Adjustment by Focusing on the Stability of the Effect Estimator

被引:4
|
作者
Loh, Wen Wei [1 ,2 ,4 ]
Ren, Dongning [3 ]
机构
[1] Emory Univ, Dept Quantitat Theory & Methods, Atlanta, GA USA
[2] Univ Ghent, Dept Data Anal, Ghent, Belgium
[3] Tilburg Univ, Dept Social Psychol, Tilburg, Netherlands
[4] Emory Univ, Dept Quantitat Theory & Methods, 36 Eagle Row, Atlanta, GA 30322 USA
关键词
causal inference; double selection; observational studies; propensity scores; strong ignorability; DOUBLY ROBUST ESTIMATION; PROPENSITY SCORE; CAUSAL INFERENCE; VARIABLE SELECTION; SENSITIVITY-ANALYSIS; MODEL-SELECTION; BIAS REDUCTION; REGRESSION; LASSO; MISSPECIFICATION;
D O I
10.1037/met0000564
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Valid inference of cause-and-effect relations in observational studies necessitates adjusting for common causes of the focal predictor (i.e., treatment) and the outcome. When such common causes, henceforth termed confounders, remain unadjusted for, they generate spurious correlations that lead to biased causal effect estimates. But routine adjustment for all available covariates, when only a subset are truly confounders, is known to yield potentially inefficient and unstable estimators. In this article, we introduce a data-driven confounder selection strategy that focuses on stable estimation of the treatment effect. The approach exploits the causal knowledge that after adjusting for confounders to eliminate all confounding biases, adding any remaining non-confounding covariates associated with only treatment or outcome, but not both, should not systematically change the effect estimator. The strategy proceeds in two steps. First, we prioritize covariates for adjustment by probing how strongly each covariate is associated with treatment and outcome. Next, we gauge the stability of the effect estimator by evaluating its trajectory adjusting for different covariate subsets. The smallest subset that yields a stable effect estimate is then selected. Thus, the strategy offers direct insight into the (in)sensitivity of the effect estimator to the chosen covariates for adjustment. The ability to correctly select confounders and yield valid causal inferences following data-driven covariate selection is evaluated empirically using extensive simulation studies. Furthermore, we compare the introduced method empirically with routine variable selection methods. Finally, we demonstrate the procedure using two publicly available real-world datasets. A step-by-step practical guide with user-friendly R functions is included.
引用
收藏
页码:947 / 966
页数:20
相关论文
共 50 条
  • [41] Data-driven local bandwidth selection for additive models with missing data
    Raya-Miranda, R.
    Martinez-Miranda, M. D.
    APPLIED MATHEMATICS AND COMPUTATION, 2011, 217 (24) : 10328 - 10342
  • [42] Data-Driven Colormap Adjustment for Exploring Spatial Variations in Scalar Fields
    Zeng, Qiong
    Zhao, Yongwei
    Wang, Yinqiao
    Zhang, Jian
    Cao, Yi
    Tu, Changhe
    Viola, Ivan
    Wang, Yunhai
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (12) : 4902 - 4917
  • [43] Stability analysis of data-driven local model networks
    Hametner, Christoph
    Mayr, Christian H.
    Kozek, Martin
    Jakubek, Stefan
    MATHEMATICAL AND COMPUTER MODELLING OF DYNAMICAL SYSTEMS, 2014, 20 (03) : 224 - 247
  • [44] Optimal direct data-driven control with stability guarantees
    Selvi, Daniela
    Piga, Dario
    Battistelli, Giorgio
    Bemporad, Alberto
    EUROPEAN JOURNAL OF CONTROL, 2021, 59 : 175 - 187
  • [45] A data-driven interpretation of the stability of organic molecular crystals
    Cersonsky, Rose K.
    Pakhnova, Maria
    Engel, Edgar A.
    Ceriotti, Michele
    CHEMICAL SCIENCE, 2023, 14 (05) : 1272 - 1285
  • [46] Stability in data-driven MPC: an inherent robustness perspective
    Berberich, Julian
    Koehler, Johannes
    Mueller, Matthias A.
    Allgower, Frank
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 1105 - 1110
  • [47] Data-driven stability analysis of switched affine systems
    Della Rossa, Matteo
    Wang, Zheming
    Egidio, Lucas N.
    Jungers, Raphael M.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 3204 - 3209
  • [48] Automatic data-driven spectral analysis based on a multi-estimator approach
    Martin, Nadine
    Mailhes, Corinne
    SIGNAL PROCESSING, 2018, 146 : 112 - 125
  • [49] Data-Driven Deep Learning to Design Pilot and Channel Estimator for Massive MIMO
    Ma, Xisuo
    Gao, Zhen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) : 5677 - 5682
  • [50] Data-driven stability margin for linear multivariable systems
    Ren, Jinrui
    Quan, Quan
    Xu, Bin
    Wang, Shuai
    Cai, Kai-Yuan
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8844 - 8862