Data-Driven Covariate Selection for Confounding Adjustment by Focusing on the Stability of the Effect Estimator

被引:4
|
作者
Loh, Wen Wei [1 ,2 ,4 ]
Ren, Dongning [3 ]
机构
[1] Emory Univ, Dept Quantitat Theory & Methods, Atlanta, GA USA
[2] Univ Ghent, Dept Data Anal, Ghent, Belgium
[3] Tilburg Univ, Dept Social Psychol, Tilburg, Netherlands
[4] Emory Univ, Dept Quantitat Theory & Methods, 36 Eagle Row, Atlanta, GA 30322 USA
关键词
causal inference; double selection; observational studies; propensity scores; strong ignorability; DOUBLY ROBUST ESTIMATION; PROPENSITY SCORE; CAUSAL INFERENCE; VARIABLE SELECTION; SENSITIVITY-ANALYSIS; MODEL-SELECTION; BIAS REDUCTION; REGRESSION; LASSO; MISSPECIFICATION;
D O I
10.1037/met0000564
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Valid inference of cause-and-effect relations in observational studies necessitates adjusting for common causes of the focal predictor (i.e., treatment) and the outcome. When such common causes, henceforth termed confounders, remain unadjusted for, they generate spurious correlations that lead to biased causal effect estimates. But routine adjustment for all available covariates, when only a subset are truly confounders, is known to yield potentially inefficient and unstable estimators. In this article, we introduce a data-driven confounder selection strategy that focuses on stable estimation of the treatment effect. The approach exploits the causal knowledge that after adjusting for confounders to eliminate all confounding biases, adding any remaining non-confounding covariates associated with only treatment or outcome, but not both, should not systematically change the effect estimator. The strategy proceeds in two steps. First, we prioritize covariates for adjustment by probing how strongly each covariate is associated with treatment and outcome. Next, we gauge the stability of the effect estimator by evaluating its trajectory adjusting for different covariate subsets. The smallest subset that yields a stable effect estimate is then selected. Thus, the strategy offers direct insight into the (in)sensitivity of the effect estimator to the chosen covariates for adjustment. The ability to correctly select confounders and yield valid causal inferences following data-driven covariate selection is evaluated empirically using extensive simulation studies. Furthermore, we compare the introduced method empirically with routine variable selection methods. Finally, we demonstrate the procedure using two publicly available real-world datasets. A step-by-step practical guide with user-friendly R functions is included.
引用
收藏
页码:947 / 966
页数:20
相关论文
共 50 条
  • [1] A Simple Data-Driven Estimator for the Semiparametric Sample Selection Model
    Escanciano, Juan Carlos
    Zhu, Lin
    ECONOMETRIC REVIEWS, 2015, 34 (6-10) : 733 - 761
  • [2] A data-driven bandwidth selection method for the smoothed maximum score estimator
    Chen, Xirong
    Gao, Wenzheng
    Li, Zheng
    ECONOMICS LETTERS, 2018, 170 : 24 - 26
  • [3] A data-driven kernel estimator of the density function
    Karczewski, Maciej
    Michalski, Andrzej
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (17) : 3529 - 3541
  • [4] Data-driven Site Selection
    Schuh G.
    Gützlaff A.
    Adlon T.
    Schupp S.
    Endrikat M.
    Schlosser T.X.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2022, 117 (05): : 258 - 263
  • [5] A Data-Driven Approach to SAR Data-Focusing
    Guaragnella, Cataldo
    D'Orazio, Tiziana
    SENSORS, 2019, 19 (07):
  • [6] Variable Selection for Confounding Adjustment in High-dimensional Covariate Spaces When Analyzing Healthcare Databases
    Schneeweiss, Sebastian
    Eddings, Wesley
    Glynn, Robert J.
    Patorno, Elisabetta
    Rassen, Jeremy
    Franklin, Jessica M.
    EPIDEMIOLOGY, 2017, 28 (02) : 237 - 248
  • [7] A Data-Driven Wavelet Estimator For Deconvolution Density Estimations
    Cao, Kaikai
    Zeng, Xiaochen
    RESULTS IN MATHEMATICS, 2023, 78 (04)
  • [8] A Data-Driven Wavelet Estimator For Deconvolution Density Estimations
    Kaikai Cao
    Xiaochen Zeng
    Results in Mathematics, 2023, 78
  • [9] Causal Effect Identification by Adjustment under Confounding and Selection Biases
    Correa, Juan D.
    Bareinboim, Elias
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3740 - 3746
  • [10] Data-driven Exemplar Model Selection
    Misra, Ishan
    Shrivastava, Abhinav
    Hebert, Martial
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 339 - 346