Directed Graphical Models and Causal Discovery for Zero-Inflated Data

被引:0
|
作者
Yu, Shiqing [1 ]
Drton, Mathias [2 ,3 ]
Shojaie, Ali [4 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Tech Univ Munich, Dept Math, Munich, Germany
[3] Tech Univ Munich, Munich Data Sci Inst, Munich, Germany
[4] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院; 欧洲研究理事会;
关键词
Bayesian network; causal discovery; directed acyclic graph; identifiability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With advances in technology, gene expression measurements from single cells can be used to gain refined insights into regulatory relationships among genes. Directed graphical models are wellsuited to explore such (cause-effect) relationships. However, statistical analyses of single cell data are complicated by the fact that the data often show zero-inflated expression patterns. To address this challenge, we propose directed graphical models that are based on Hurdle conditional distributions parametrized in terms of polynomials in parent variables and their 0/1 indicators of being zero or nonzero. While directed graphs for Gaussian models are only identifiable up to an equivalence class in general, we show that, under a natural and weak assumption, the exact directed acyclic graph of our zero-inflated models can be identified. We propose methods for graph recovery, apply our model to real single-cell gene expression data on T helper cells, and show simulated experiments that validate the identifiability and graph estimation methods in practice.
引用
收藏
页码:27 / 67
页数:41
相关论文
共 50 条
  • [11] A class of models for large zero-inflated spatial data
    Lee, Ben Seiyon
    Haran, Murali
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2024,
  • [12] Grouped zero-inflated count data models of coital frequency
    Moffatt, PG
    Peters, SA
    JOURNAL OF POPULATION ECONOMICS, 2000, 13 (02) : 205 - 220
  • [13] The analysis of zero-inflated count data: Beyond zero-inflated Poisson regression.
    Loeys, Tom
    Moerkerke, Beatrijs
    De Smet, Olivia
    Buysse, Ann
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2012, 65 (01): : 163 - 180
  • [14] Identifiability of zero-inflated Poisson models
    Li, Chin-Shang
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2012, 26 (03) : 306 - 312
  • [15] Grouped zero-inflated count data models of coital frequency
    Peter G. Moffatt
    Simon A. Peters
    Journal of Population Economics, 2000, 13 : 205 - 220
  • [16] Hierarchical Mixture Models for Zero-inflated Correlated Count Data
    Chen, Xue-dong
    Shi, Hong-xing
    Wang, Xue-ren
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2016, 32 (02): : 373 - 384
  • [17] Hierarchical Mixture Models for Zero-inflated Correlated Count Data
    Xue-dong CHEN
    Hong-xing SHI
    Xue-ren WANG
    Acta Mathematicae Applicatae Sinica, 2016, 32 (02) : 373 - 384
  • [18] Hierarchical mixture models for zero-inflated correlated count data
    Xue-dong Chen
    Hong-xing Shi
    Xue-ren Wang
    Acta Mathematicae Applicatae Sinica, English Series, 2016, 32 : 373 - 384
  • [19] SPLINE REGRESSION FOR ZERO-INFLATED MODELS
    Opitz, T.
    Tramini, P.
    Molinari, N.
    JP JOURNAL OF BIOSTATISTICS, 2013, 9 (01) : 53 - 66
  • [20] Generalized Additive Models for Zero-Inflated Data with Partial Constraints
    Liu, Hai
    Chan, Kung-Sik
    SCANDINAVIAN JOURNAL OF STATISTICS, 2011, 38 (04) : 650 - 665