Combining Multiple Observational Data Sources to Estimate Causal Effects

被引:39
|
作者
Yang, Shu [1 ]
Ding, Peng [2 ]
机构
[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr Campus Box 8203, Raleigh, NC 27695 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Calibration; Causal inference; Inverse probability weighting; Missing confounder; Two-phase sampling; PROPENSITY SCORE CALIBRATION; DOUBLY ROBUST ESTIMATION; LARGE-SAMPLE PROPERTIES; AUXILIARY INFORMATION; MISSING CONFOUNDERS; MATCHING ESTIMATORS; VALIDATION DATA; REGRESSION; INFERENCE; 2-PHASE;
D O I
10.1080/01621459.2019.1609973
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The era of big data has witnessed an increasing availability of multiple data sources for statistical analyses. We consider estimation of causal effects combining big main data with unmeasured confounders and smaller validation data withon these confounders. Under the unconfoundedness assumption with completely observed confounders, the smaller validation data allow for constructing consistent estimators for causal effects, but the big main data can only give error-prone estimators in general. However, by leveraging the information in the big main data in a principled way, we can improve the estimation efficiencies yet preserve the consistencies of the initial estimators based solely on the validation data. Our framework applies to asymptotically normal estimators, including the commonly used regression imputation, weighting, and matching estimators, and does not require a correct specification of the model relating the unmeasured confounders to the observed variables. We also propose appropriate bootstrap procedures, which makes our method straightforward to implement using software routines for existing estimators.for this article are available online.
引用
收藏
页码:1540 / 1554
页数:15
相关论文
共 50 条
  • [21] Approaches for combining primary care electronic health record data from multiple sources: a systematic review of observational studies
    Dedman, Daniel
    Cabecinha, Melissa
    Williams, Rachael
    Evans, Stephen J. W.
    Bhaskaran, Krishnan
    Douglas, Ian J.
    BMJ OPEN, 2020, 10 (10):
  • [22] Bayesian Federated Estimation of Causal Effects from Observational Data
    Thanh Vinh Vo
    Lee, Young
    Trong Nghia Hoang
    Leong, Tze-Yun
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2024 - 2034
  • [23] A Bayesian Approach for Estimating Causal Effects from Observational Data
    Pensar, Johan
    Talvitie, Topi
    Hyttinen, Antti
    Koivisto, Mikko
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5395 - 5402
  • [24] Is it time to use observational data to estimate treatment effectiveness in multiple sclerosis?
    Trojano, Maria
    NEUROLOGY, 2007, 69 (15) : 1478 - 1479
  • [25] Combining observational and experimental data to estimate environmental and species drivers of fungal metacommunity dynamics
    Nenzen, Hedvig Kristina
    Moor, Helen
    O'Hara, Robert B.
    Jonsson, Mari
    Norden, Jenni
    Ottosson, Elisabet
    Snall, Tord
    ECOLOGY, 2025, 106 (02)
  • [26] Combining Multiple Sources of Data for Situational Awareness of Geomagnetic Disturbances
    Basu, Chumki
    Padmanaban, Manikandan
    Guillon, Sebastien
    de Montigny, Martin
    Kamwa, Innocent
    2015 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, 2015,
  • [27] Causal inference and observational data
    Ivan Olier
    Yiqiang Zhan
    Xiaoyu Liang
    Victor Volovici
    BMC Medical Research Methodology, 23
  • [28] Causal inference and observational data
    Olier, Ivan
    Zhan, Yiqiang
    Liang, Xiaoyu
    Volovici, Victor
    BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
  • [29] Causal inference with observational data
    Nichols, Austin
    STATA JOURNAL, 2007, 7 (04): : 507 - 541
  • [30] Combining snow water equivalent data from multiple sources to estimate spatio-temporal trends and compare measurement systems
    Cowles, MK
    Zimmerman, DL
    Christ, A
    McGinnis, DL
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2002, 7 (04) : 536 - 557