Directed Graphical Models and Causal Discovery for Zero-Inflated Data

被引:0
|
作者
Yu, Shiqing [1 ]
Drton, Mathias [2 ,3 ]
Shojaie, Ali [4 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Tech Univ Munich, Dept Math, Munich, Germany
[3] Tech Univ Munich, Munich Data Sci Inst, Munich, Germany
[4] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院; 欧洲研究理事会;
关键词
Bayesian network; causal discovery; directed acyclic graph; identifiability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With advances in technology, gene expression measurements from single cells can be used to gain refined insights into regulatory relationships among genes. Directed graphical models are wellsuited to explore such (cause-effect) relationships. However, statistical analyses of single cell data are complicated by the fact that the data often show zero-inflated expression patterns. To address this challenge, we propose directed graphical models that are based on Hurdle conditional distributions parametrized in terms of polynomials in parent variables and their 0/1 indicators of being zero or nonzero. While directed graphs for Gaussian models are only identifiable up to an equivalence class in general, we show that, under a natural and weak assumption, the exact directed acyclic graph of our zero-inflated models can be identified. We propose methods for graph recovery, apply our model to real single-cell gene expression data on T helper cells, and show simulated experiments that validate the identifiability and graph estimation methods in practice.
引用
收藏
页码:27 / 67
页数:41
相关论文
共 50 条
  • [1] Model-Based Causal Discovery for Zero-Inflated Count Data
    Choi, Junsouk
    Ni, Yang
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [2] Zero-inflated modeling part II: Zero-inflated models for complex data structures
    Young, Derek S.
    Roemmele, Eric S.
    Shi, Xuan
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (02)
  • [3] GRAPHICAL MODELS FOR ZERO-INFLATED SINGLE CELL GENE EXPRESSION
    McDavid, Andrew
    Gottardo, Raphael
    Simon, Noah
    Drton, Mathias
    ANNALS OF APPLIED STATISTICS, 2019, 13 (02): : 848 - 873
  • [4] Zero-inflated models and estimation in zero-inflated Poisson distribution
    Wagh, Yogita S.
    Kamalja, Kirtee K.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (08) : 2248 - 2265
  • [5] Sufficient dimension reduction for a novel class of zero-inflated graphical models
    Koplin, Eric
    Forzani, Liliasna
    Tomassi, Diego
    Pfeiffer, Ruth M.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 196
  • [6] Zero-inflated models with application to spatial count data
    Agarwal, DK
    Gelfand, AE
    Citron-Pousty, S
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2002, 9 (04) : 341 - 355
  • [7] Marginal zero-inflated regression models for count data
    Martin, Jacob
    Hall, Daniel B.
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (10) : 1807 - 1826
  • [8] Marginal Mean Models for Zero-Inflated Count Data
    Todem, David
    Kim, KyungMann
    Hsu, Wei-Wen
    BIOMETRICS, 2016, 72 (03) : 986 - 994
  • [9] Zero-inflated Bell regression models for count data
    Lemonte, Artur J.
    Moreno-Arenas, German
    Castellares, Fredy
    JOURNAL OF APPLIED STATISTICS, 2020, 47 (02) : 265 - 286
  • [10] Zero-inflated models with application to spatial count data
    Deepak K. Agarwal
    Alan E. Gelfand
    Steven Citron-Pousty
    Environmental and Ecological Statistics, 2002, 9 : 341 - 355