Causal Discovery in the Presence of Missing Data

被引:0
|
作者
Tu, Ruibo [1 ]
Zhang, Cheng [2 ]
Ackermann, Paul [3 ]
Mohan, Karthika [4 ]
Kjellstrom, Hedvig [1 ]
Zhang, Kun [5 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] Microsoft Res, Cambridge, England
[3] Karolinska Inst, Solna, Sweden
[4] Univ Calif Berkeley, Berkeley, CA 94720 USA
[5] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
INFERENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing data are ubiquitous in many domains such as healthcare. When these data entries are not missing completely at random, the (conditional) independence relations in the observed data may be different from those in the complete data generated by the underlying causal process. Consequently, simply applying existing causal discovery methods to the observed data may lead to wrong conclusions. In this paper, we aim at developing a causal discovery method to recover the underlying causal structure from observed data that are missing under different mechanisms, including missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). With missingness mechanisms represented by missingness graphs (m-graphs), we analyze conditions under which additional correction is needed to derive conditional independence/dependence relations in the complete data. Based on our analysis, we propose Missing Value PC (MVPC), which extends the PC algorithm to incorporate additional corrections. Our proposed MVPC is shown in theory to give asymptotically correct results even on data that are MAR or MNAR. Experimental results on both synthetic data and real healthcare applications illustrate that the proposed algorithm is able to find correct causal relations even in the general case of MNAR.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models
    Gao, Erdun
    Ng, Ignavier
    Gong, Mingming
    Shen, Li
    Huang, Wei
    Liu, Tongliang
    Zhang, Kun
    Bondell, Howard
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] Causal Discovery with Missing Data in a Multicentric Clinical Study
    Zanga, Alessio
    Bernasconi, Alice
    Lucas, Peter J. F.
    Pijnenborg, Hanny
    Reijnen, Casper
    Scutari, Marco
    Stella, Fabio
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2023, 2023, 13897 : 40 - 44
  • [3] Reconstruction of causal graphs for multivariate processes in the presence of missing data
    Agarwal, Piyush
    Tangirala, Arun K.
    2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 389 - 394
  • [4] Identification of Causal Structure in the Presence of Missing Data with Additive Noise Model
    Qiao, Jie
    Chen, Zhengming
    Yu, Jianhua
    Cai, Ruichu
    Hao, Zhifeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20516 - 20523
  • [5] Causal Discovery from Medical Data: Dealing with Missing Values and a Mixture of Discrete and Continuous Data
    Sokolova, Elena
    Groot, Perry
    Claassen, Tom
    von Rhein, Daniel
    Buitelaar, Jan
    Heskes, Tom
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 177 - 181
  • [6] Recoverability of causal effects under presence of missing data: a longitudinal case study
    Holovchak, Anastasiia
    McIlleron, Helen
    Denti, Paolo
    Schomaker, Michael
    BIOSTATISTICS, 2024, 26 (01)
  • [7] Causal Discovery for Rolling Bearing Fault Under Missing Data: From the Perspective of Causal Effect and Information Flow
    Ding, Xu
    Wu, Hao
    Wang, Junlong
    Xu, Juan
    Xin, Miao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [8] Causal Inference: A Missing Data Perspective
    Ding, Peng
    Li, Fan
    STATISTICAL SCIENCE, 2018, 33 (02) : 214 - 237
  • [9] Missing Data as a Causal and Probabilistic Problem
    Shpitser, Ilya
    Mohan, Karthika
    Pearl, Judea
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 802 - 811
  • [10] Causal Feature Selection with Missing Data
    Yu, Kui
    Yang, Yajing
    Ding, Wei
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (04)