The Role of Prediction Modeling in Propensity Score Estimation: An Evaluation of Logistic Regression, bCART, and the Covariate-Balancing Propensity Score

被引:80
|
作者
Wyss, Richard [1 ]
Ellis, Alan R. [2 ]
Brookhart, M. Alan [1 ]
Girman, Cynthia J. [1 ,3 ]
Funk, Michele Jonsson [1 ]
LoCasale, Robert [4 ]
Stuermer, Til [1 ]
机构
[1] Univ N Carolina, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Cecil G Sheps Ctr Hlth Serv Res, Chapel Hill, NC 27599 USA
[3] Merck Sharp & Dohme Corp, Merck Res Labs, Ctr Observat & Real World Evidence, Data Analyt & Observat Methods, N Wales, PA USA
[4] Merck Sharp & Dohme Corp, Merck Res Labs, Dept Epidemiol, N Wales, PA USA
基金
美国医疗保健研究与质量局;
关键词
cardiovascular disease; covariate balance; diabetes; epidemiologic methods; propensity score; regression; simulation;
D O I
10.1093/aje/kwu181
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The covariate-balancing propensity score (CBPS) extends logistic regression to simultaneously optimize covariate balance and treatment prediction. Although the CBPS has been shown to perform well in certain settings, its performance has not been evaluated in settings specific to pharmacoepidemiology and large database research. In this study, we use both simulations and empirical data to compare the performance of the CBPS with logistic regression and boosted classification and regression trees. We simulated various degrees of model misspecification to evaluate the robustness of each propensity score (PS) estimation method. We then applied these methods to compare the effect of initiating glucagonlike peptide-1 agonists versus sulfonylureas on cardiovascular events and all-cause mortality in the US Medicare population in 2007-2009. In simulations, the CBPS was generally more robust in terms of balancing covariates and reducing bias compared with misspecified logistic PS models and boosted classification and regression trees. All PS estimation methods performed similarly in the empirical example. For settings common to pharmacoepidemiology, logistic regression with balance checks to assess model specification is a valid method for PS estimation, but it can require refitting multiple models until covariate balance is achieved. The CBPS is a promising method to improve the robustness of PS models.
引用
收藏
页码:645 / 655
页数:11
相关论文
共 50 条
  • [31] Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation
    Wyss, Richard
    Schneeweiss, Sebastian
    van der Laan, Mark
    Lendle, Samuel D.
    Ju, Cheng
    Franklin, Jessica M.
    EPIDEMIOLOGY, 2018, 29 (01) : 96 - 106
  • [32] Propensity score estimation with boosted regression for evaluating causal effects in observational studies
    McCaffrey, DF
    Ridgeway, G
    Morral, AR
    PSYCHOLOGICAL METHODS, 2004, 9 (04) : 403 - 425
  • [33] The Effect of Latent Binary Variables on the Uncertainty of the Prediction of a Dichotomous Outcome Using Logistic Regression Based Propensity Score Matching
    Szeker, Szabolcs
    Vathy-Fogarassy, Agnes
    HEALTH INFORMATICS MEETS EHEALTH: BIOMEDICAL MEETS EHEALTH - FROM SENSORS TO DECISIONS, 2018, 248 : 1 - 8
  • [34] On the role of the propensity score in efficient semiparametric estimation of average treatment effects
    Hahn, JY
    ECONOMETRICA, 1998, 66 (02) : 315 - 331
  • [35] EFFICACY OF PHOTOTHERAPY FOR JAUNDICED NEWBORNS: COMPARISON OF LOGISTIC REGRESSION, PROPENSITY SCORE AND INSTRUMENTAL VARIABLE ANALYSES
    Newman, T.
    McCulloch, C.
    Vittinghoff, E.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 171 : S107 - S107
  • [36] Estimators and confidence intervals for the marginal odds ratio using logistic regression and propensity score stratification
    Stampf, Susanne
    Graf, Erika
    Schmoor, Claudia
    Schumacher, Martin
    STATISTICS IN MEDICINE, 2010, 29 (7-8) : 760 - 769
  • [37] Logistic regression frequently outperformed propensity score methods especially for large datasets: a simulation study
    Wilkinson, Jack D.
    Mamas, Mamas A.
    Kontopantelis, Evangelos
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 152 : 176 - 184
  • [38] Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders
    Cepeda, MS
    Boston, R
    Farrar, JT
    Strom, BL
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 158 (03) : 280 - 287
  • [39] Optimization-Based Stable Balancing Weights Versus Propensity Score Weighting for Samples With High Covariate Imbalance
    Wallace, Stuart R.
    Singh, Sachinkumar B.
    Blakney, Rebekah
    Rene, Lexi
    Johnston, Stephen S.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 (07)
  • [40] Optimization-based stable balancing weights versus propensity score weighting for samples with high covariate imbalance
    Johnston, Stephen S.
    Singh, Sachin
    Blakney, Rebekah
    Rene, Lexi
    Wallace, Stuart
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 567 - 567