Penalized regression procedures for variable selection in the potential outcomes framework

被引:28
|
作者
Ghosh, Debashis [1 ]
Zhu, Yeying [2 ]
Coffman, Donna L. [3 ]
机构
[1] Colorado Sch Publ Hlth, Dept Biostat & Informat, Aurora, CO 80045 USA
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
[3] Penn State Univ, Methodol Ctr, University Pk, PA 16802 USA
基金
美国国家卫生研究院;
关键词
average causal effect; counterfactual; imputed data; L-1; penalty; treatment heterogeneity; MARGINAL STRUCTURAL MODELS; PROPENSITY SCORE; CAUSAL INFERENCE; LINEAR-MODELS; IMPUTATION;
D O I
10.1002/sim.6433
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A recent topic of much interest in causal inference is model selection. In this article, we describe a framework in which to consider penalized regression approaches to variable selection for causal effects. The framework leads to a simple impute, then select' class of procedures that is agnostic to the type of imputation algorithm as well as penalized regression used. It also clarifies how model selection involves a multivariate regression model for causal inference problems and that these methods can be applied for identifying subgroups in which treatment effects are homogeneous. Analogies and links with the literature on machine learning methods, missing data, and imputation are drawn. A difference least absolute shrinkage and selection operator algorithm is defined, along with its multiple imputation analogs. The procedures are illustrated using a well-known right-heart catheterization dataset. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1645 / 1658
页数:14
相关论文
共 50 条
  • [41] Variable selection using penalized empirical likelihood
    Ren YunWen
    Zhang XinSheng
    SCIENCE CHINA-MATHEMATICS, 2011, 54 (09) : 1829 - 1845
  • [42] Penalized empirical likelihood based variable selection for partially linear quantile regression models with missing responses
    Tang, Xinrong
    Zhao, Peixin
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2018, 47 (03): : 721 - 739
  • [43] A Penalized h-Likelihood Variable Selection Algorithm for Generalized Linear Regression Models with Random Effects
    Xie, Yanxi
    Li, Yuewen
    Xia, Zhijie
    Yan, Ruixia
    Luan, Dongqing
    COMPLEXITY, 2020, 2020
  • [44] Variable Selection via SCAD-Penalized Quantile Regression for High-Dimensional Count Data
    Khan, Dost Muhammad
    Yaqoob, Anum
    Iqbal, Nadeem
    Wahid, Abdul
    Khalil, Umair
    Khan, Mukhtaj
    Abd Rahman, Mohd Amiruddin
    Mustafa, Mohd Shafie
    Khan, Zardad
    IEEE ACCESS, 2019, 7 : 153205 - 153216
  • [45] Unified distributed robust regression and variable selection framework for massive data
    Wang, Kangning
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [46] Penalized likelihood and Bayesian function selection in regression models
    Fabian Scheipl
    Thomas Kneib
    Ludwig Fahrmeir
    AStA Advances in Statistical Analysis, 2013, 97 : 349 - 385
  • [47] Assessing Tuning Parameter Selection Variability in Penalized Regression
    Hu, Wenhao
    Laber, Eric B.
    Barker, Clay
    Stefanski, Leonard A.
    TECHNOMETRICS, 2019, 61 (02) : 154 - 164
  • [48] Penalized likelihood and Bayesian function selection in regression models
    Scheipl, Fabian
    Kneib, Thomas
    Fahrmeir, Ludwig
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2013, 97 (04) : 349 - 385
  • [49] PENALIZED MAXIMUM LIKELIHOOD ESTIMATION AND VARIABLE SELECTION IN GEOSTATISTICS
    Chu, Tingjin
    Zhu, Jun
    Wang, Haonan
    ANNALS OF STATISTICS, 2011, 39 (05): : 2607 - 2625
  • [50] Penalized methods for bi-level variable selection
    Breheny, Patrick
    Huang, Jian
    STATISTICS AND ITS INTERFACE, 2009, 2 (03) : 369 - 380