Interventional Causal Structure Discovery Over Graphical Models With Convergence and Optimality Guarantees

被引:0
|
作者
Qiu, Chengbo [1 ]
Yang, Kai [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201800, Peoples R China
关键词
Optimization; Polynomials; Convergence; Privacy; Distributed databases; Data privacy; Data models; Symbols; Noise; Graphical models; Bilevel optimization; causal structure learning; directed acyclic graph; distributed setting; graphical model; interventional data; polynomial optimization; OPTIMIZATION; POLYNOMIALS;
D O I
10.1109/TNSE.2024.3487301
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Learning causal structure from sampled data is a fundamental problem with applications in various fields, including healthcare, machine learning and artificial intelligence. Traditional methods predominantly rely on observational data, but there exist limits regarding the identifiability of causal structures with only observational data. Interventional data, on the other hand, helps establish a cause-and-effect relationship by breaking the influence of confounding variables. It remains to date under-explored to develop a mathematical framework that seamlessly integrates both observational and interventional data in causal structure learning. Furthermore, existing studies often focus on centralized approaches, necessitating the transfer of entire datasets to a single server, which lead to considerable communication overhead and heightened risks to privacy. To tackle these challenges, we develop a <bold>b</bold>i<bold>l</bold>evel p<bold>o</bold>lynomial <bold>o</bold>pti<bold>m</bold>ization (Bloom) framework. Bloom not only provides a powerful mathematical modeling framework, underpinned by theoretical support, for causal structure discovery from both interventional and observational data, but also aspires to an efficient causal discovery algorithm with convergence and optimality guarantees. We further extend Bloom to a distributed setting to reduce the communication overhead and mitigate data privacy risks. It is seen through experiments on both synthetic and real-world datasets that Bloom markedly surpasses other leading learning algorithms.
引用
收藏
页码:156 / 172
页数:17
相关论文
共 7 条
  • [1] Review of Causal Discovery Methods Based on Graphical Models
    Glymour, Clark
    Zhang, Kun
    Spirtes, Peter
    FRONTIERS IN GENETICS, 2019, 10
  • [2] Causal Discovery for Climate Research Using Graphical Models
    Ebert-Uphoff, Imme
    Deng, Yi
    JOURNAL OF CLIMATE, 2012, 25 (17) : 5648 - 5665
  • [3] Directed Graphical Models and Causal Discovery for Zero-Inflated Data
    Yu, Shiqing
    Drton, Mathias
    Shojaie, Ali
    CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 213, 2023, 213 : 27 - 67
  • [4] Causal structure learning in directed, possibly cyclic, graphical models
    Semnani, Pardis
    Robeva, Elina
    JOURNAL OF CAUSAL INFERENCE, 2025, 13 (01)
  • [5] Causal Discovery From Unknown Interventional Datasets Over Overlapping Variable Sets
    Cao, Fuyuan
    Wang, Yunxia
    Yu, Kui
    Liang, Jiye
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7725 - 7742
  • [6] Joint structure learning and causal effect estimation for categorical graphical models
    Castelletti, Federico
    Consonni, Guido
    Della Vedova, Marco L.
    BIOMETRICS, 2024, 80 (03)
  • [7] Minimal I-MAP MCMC for Scalable Structure Discovery in Causal DAG Models
    Agrawal, Raj
    Broderick, Tamara
    Uhler, Caroline
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80