Masked Gradient-Based Causal Structure Learning

被引:0
|
作者
Ng, Ignavier [1 ]
Zhu, Shengyu [2 ]
Fang, Zhuangyan [3 ]
Li, Haoyang [4 ]
Chen, Zhitang [2 ]
Wang, Jun [5 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[3] Peking Univ, Beijing, Peoples R China
[4] Ecole Polytech, Lausanne, Switzerland
[5] UCL, London, England
关键词
Causal structure learning; gradient-based optimization; binary adjacency matrix; Gumbel-Softmax; DISCOVERY; SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of learning causal structures from observational data. We reformulate the Structural Equation Model (SEM) with additive noises in a form parameterized by binary graph adjacency matrix and show that, if the original SEM is identifiable, then the binary adjacency matrix can be identified up to super-graphs of the true causal graph under mild conditions. We then utilize the reformulated SEM to develop a causal structure learning method that can be efficiently trained using gradient-based optimization, by leveraging a smooth characterization on acyclicity and the Gumbel-Softmax approach to approximate the binary adjacency matrix. It is found that the obtained entries are typically near zero or one and can be easily thresholded to identify the edges. We conduct experiments on synthetic and real datasets to validate the e.ectiveness of the proposed method, and show that it readily includes di.erent smooth model functions and achieves a much improved performance on most datasets considered.
引用
收藏
页码:424 / 432
页数:9
相关论文
共 50 条
  • [31] Understanding Benign Overfitting in Gradient-based Meta Learning
    Chen, Lisha
    Lu, Songtao
    Chen, Tianyi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [32] A Gradient-based reinforcement learning model of market equilibration
    He, Zhongzhi
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 152
  • [33] Adaptive Gradient-Based Meta-Learning Methods
    Khodak, Mikhail
    Balcan, Maria-Florina
    Talwalkar, Ameet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Estimation and approximation bounds for gradient-based reinforcement learning
    Bartlett, PL
    Baxter, J
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) : 133 - 150
  • [35] Convergence Analysis of Gradient-Based Learning in Continuous Games
    Chasnov, Benjamin
    Ratliff, Lillian
    Mazumdar, Eric
    Burden, Samuel
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 935 - 944
  • [36] Gradient-based Hyperparameter Optimization through Reversible Learning
    Maclaurin, Dougal
    Duvenaud, David
    Adams, Ryan P.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 2113 - 2122
  • [37] Intractability of Learning the Discrete Logarithm with Gradient-Based Methods
    Takhanov, Rustem
    Tezekbayev, Maxat
    Pak, Artur
    Bolatov, Arman
    Kadyrsizova, Zhibek
    Assylbekov, Zhenisbek
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [38] Inverse Reinforcement Learning from a Gradient-based Learner
    Ramponi, Giorgia
    Drappo, Gianluca
    Restelli, Marcello
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [39] Optimal gradient-based learning using importance weights
    Hochreiter, S
    Obermayer, K
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 114 - 119
  • [40] Understanding transfer learning and gradient-based meta-learning techniques
    Huisman, Mike
    Plaat, Aske
    van Rijn, Jan N.
    MACHINE LEARNING, 2024, 113 (07) : 4113 - 4132