Introduction to computational causal inference using reproducible Stata, R, and Python']Python code: A tutorial

被引:33
|
作者
Smith, Matthew J. [1 ]
Mansournia, Mohammad A. [2 ]
Maringe, Camille [1 ]
Zivich, Paul N. [3 ,4 ]
Cole, Stephen R. [3 ]
Leyrat, Clemence [1 ]
Belot, Aurelien [1 ]
Rachet, Bernard [1 ]
Luque-Fernandez, Miguel A. [1 ,5 ,6 ]
机构
[1] London Sch Hyg & Trop Med, Dept Noncommunicable Dis Epidemiol, Inequal Canc Outcomes Network, Keppel St, London WC1E 7HT, England
[2] Univ Tehran Med Sci, Dept Epidemiol & Biostat, Tehran, Iran
[3] Univ N Carolina, Dept Epidemiol, Chapel Hill, NC 27515 USA
[4] Univ N Carolina, Carolina Populat Ctr, Chapel Hill, NC 27515 USA
[5] Univ Granada, Andalusian Sch Publ Hlth, Inst Invest Biosanitaria Granada Ibs GRANADA, Noncommunicable Dis & Canc Epidemiol Grp, Granada, Spain
[6] Biomed Network Res Ctr Epidemiol & Publ Hlth CIBE, Madrid, Spain
关键词
causal inference; double-robust methods; g-formula; G-methods; inverse probability weighting; machine learning; propensity score; regression adjustment; targeted maximum likelihood estimation; MARGINAL STRUCTURAL MODELS; PROPENSITY SCORE; MORTALITY;
D O I
10.1002/sim.9234
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The main purpose of many medical studies is to estimate the effects of a treatment or exposure on an outcome. However, it is not always possible to randomize the study participants to a particular treatment, therefore observational study designs may be used. There are major challenges with observational studies; one of which is confounding. Controlling for confounding is commonly performed by direct adjustment of measured confounders; although, sometimes this approach is suboptimal due to modeling assumptions and misspecification. Recent advances in the field of causal inference have dealt with confounding by building on classical standardization methods. However, these recent advances have progressed quickly with a relative paucity of computational-oriented applied tutorials contributing to some confusion in the use of these methods among applied researchers. In this tutorial, we show the computational implementation of different causal inference estimators from a historical perspective where new estimators were developed to overcome the limitations of the previous estimators (ie, nonparametric and parametric g-formula, inverse probability weighting, double-robust, and data-adaptive estimators). We illustrate the implementation of different methods using an empirical example from the Connors study based on intensive care medicine, and most importantly, we provide reproducible and commented code in Stata, R, and Python for researchers to adapt in their own observational study. The code can be accessed at .
引用
收藏
页码:407 / 432
页数:26
相关论文
共 50 条
  • [41] libNeuroML and PyLEMS: using Python']Python to combine procedural and declarative modeling approaches in computational neuroscience
    Vella, Michael
    Cannon, Robert C.
    Crook, Sharon
    Davison, Andrew P.
    Ganapathy, Gautham
    Robinson, Hugh P. C.
    Silver, R. Angus
    Gleeson, Padraig
    FRONTIERS IN NEUROINFORMATICS, 2014, 8
  • [42] Improving computational efficiency for implementing a sound propagation simulation environment using Python']Python and GPU
    Bucurica, Mihai
    Dogaru, Ioana
    Dogaru, Radu
    2016 8TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2016,
  • [43] Identify the Influential User in Online Social Networks using R, Hadoop and Python']Python
    Kumar, K. Sailaja
    Geetha, D. Evangelin
    Nagesh, N.
    Manoj, T. V. Sai
    2016 INTERNATIONAL CONFERENCE ON CIRCUITS, CONTROLS, COMMUNICATIONS AND COMPUTING (I4C), 2016,
  • [44] COMPUTATIONAL PERFORMANCE OF IB-LBM BASED VIV SIMULATION USING PYTHON']PYTHON WITH JAX
    Zhu, Haiming
    Du, Zunfeng
    Yang, Yuan
    Hani, Muxuan
    PROCEEDINGS OF ASME 2024 43RD INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, OMAE2024, VOL 6, 2024,
  • [45] A convenient analytic method for gel quantification using ImageJ paired with Python']Python or R
    Tomlinson, Cassidy
    Rajasekaran, Ashwini
    Brochu-Gaudreau, Karine
    Dubois, Claire
    Farmilo, A. James
    Gris, Pavel
    Khatiz, Ariane
    Matthews, Amanda
    Piltonen, Marjo
    Amrani, Abdelaziz
    Gris, Denis
    PLOS ONE, 2024, 19 (11):
  • [46] Natural Language to Python Source Code using Transformers
    Shah, Meet
    Shenoy, Rajat
    Shankarmani, Radha
    2021 International Conference on Intelligent Technologies, CONIT 2021, 2021,
  • [47] Conditional density estimation tools in python']python and R with applications to photometric redshifts and likelihood-free cosmological inference
    Dalmasso, N.
    Pospisil, T.
    Lee, A. B.
    Izbicki, R.
    Freeman, P. E.
    Malz, A. I.
    ASTRONOMY AND COMPUTING, 2020, 30
  • [48] Short -Term Electric Load Forecasting Using SVR Implementing LibSVM Package and Python']Python Code
    Baghel, Manoj
    Ghosh, Abir
    Singh, Navneet Kumar
    Singh, Asheesh Kumar
    2016 IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ELECTRONICS ENGINEERING (UPCON), 2016, : 485 - 489
  • [49] Introduction to Reproducible Geospatial Analysis and Figures in R: A Tutorial Article
    Maesen, Philippe
    Salingros, Edouard
    DATA, 2024, 9 (04)
  • [50] Poster: Improving Spectrum Based Fault Localization For Python']Python Programs Using Weighted Code Elements
    Sarhan, Qusay Idrees
    Beszedes, Arpad
    2023 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST, 2023, : 478 - 481