Penalized estimation of directed acyclic graphs from discrete data

被引:0
|
作者
Jiaying Gu
Fei Fu
Qing Zhou
机构
[1] University of California,Department of Statistics
来源
Statistics and Computing | 2019年 / 29卷
关键词
Coordinate descent; Discrete Bayesian network; Multi-logit regression; Structure learning; Group norm penalty;
D O I
暂无
中图分类号
学科分类号
摘要
Bayesian networks, with structure given by a directed acyclic graph (DAG), are a popular class of graphical models. However, learning Bayesian networks from discrete or categorical data is particularly challenging, due to the large parameter space and the difficulty in searching for a sparse structure. In this article, we develop a maximum penalized likelihood method to tackle this problem. Instead of the commonly used multinomial distribution, we model the conditional distribution of a node given its parents by multi-logit regression, in which an edge is parameterized by a set of coefficient vectors with dummy variables encoding the levels of a node. To obtain a sparse DAG, a group norm penalty is employed, and a blockwise coordinate descent algorithm is developed to maximize the penalized likelihood subject to the acyclicity constraint of a DAG. When interventional data are available, our method constructs a causal network, in which a directed edge represents a causal relation. We apply our method to various simulated and real data sets. The results show that our method is very competitive, compared to many existing methods, in DAG estimation from both interventional and high-dimensional observational data.
引用
收藏
页码:161 / 176
页数:15
相关论文
共 50 条
  • [41] Recursive processing of directed acyclic graphs
    Bianchini, M
    Gori, M
    Scarselli, F
    NEURAL NETS WIRN VIETRI-01, 2002, : 96 - 101
  • [42] Ternary directed acyclic word graphs
    Miyamoto, S
    Inenaga, S
    Takeda, M
    Shinohara, A
    IMPLEMENTATION AND APPLICATION OF AUTOMATA, PROCEEDINGS, 2003, 2759 : 120 - 130
  • [43] HEURISTIC CHAINING IN DIRECTED ACYCLIC GRAPHS
    VENUGOPAL, R
    SRIKANT, YN
    COMPUTER LANGUAGES, 1993, 19 (03): : 169 - 184
  • [44] On compact directed acyclic word graphs
    Crochemore, M
    Verin, R
    STRUCTURES IN LOGIC AND COMPUTER SCIENCE: A SELECTION OF ESSAYS IN HONOR OF A. EHRENFEUCHT, 1997, 1261 : 192 - 211
  • [45] Broadcasting on Random Directed Acyclic Graphs
    Makur, Anuran
    Mossel, Elchanan
    Polyanskiy, Yury
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (02) : 780 - 812
  • [46] Information Flow on Directed Acyclic Graphs
    Donders, Michael
    More, Sara Miner
    Naumov, Pavel
    LOGIC, LANGUAGE, INFORMATION AND COMPUTATION, WOLLIC 2011, 2011, 6642 : 95 - 109
  • [47] Community detection in directed acyclic graphs
    Leo Speidel
    Taro Takaguchi
    Naoki Masuda
    The European Physical Journal B, 2015, 88
  • [48] Covering Pairs in Directed Acyclic Graphs
    Beerenwinkel, Niko
    Beretta, Stefano
    Bonizzoni, Paola
    Dondi, Riccardo
    Pirola, Yuri
    COMPUTER JOURNAL, 2015, 58 (07): : 1673 - 1686
  • [49] On counting homomorphisms to directed acyclic graphs
    Dyer, Martin
    Goldberg, Leslie Ann
    Paterson, Mike
    AUTOMATA, LANGUAGES AND PROGRAMMING, PT 1, 2006, 4051 : 38 - 49
  • [50] Transformers over Directed Acyclic Graphs
    Luo, Yuankai
    Thost, Veronika
    Shi, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,