Directed Graphical Models and Causal Discovery for Zero-Inflated Data

被引:0
|
作者
Yu, Shiqing [1 ]
Drton, Mathias [2 ,3 ]
Shojaie, Ali [4 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Tech Univ Munich, Dept Math, Munich, Germany
[3] Tech Univ Munich, Munich Data Sci Inst, Munich, Germany
[4] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院; 欧洲研究理事会;
关键词
Bayesian network; causal discovery; directed acyclic graph; identifiability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With advances in technology, gene expression measurements from single cells can be used to gain refined insights into regulatory relationships among genes. Directed graphical models are wellsuited to explore such (cause-effect) relationships. However, statistical analyses of single cell data are complicated by the fact that the data often show zero-inflated expression patterns. To address this challenge, we propose directed graphical models that are based on Hurdle conditional distributions parametrized in terms of polynomials in parent variables and their 0/1 indicators of being zero or nonzero. While directed graphs for Gaussian models are only identifiable up to an equivalence class in general, we show that, under a natural and weak assumption, the exact directed acyclic graph of our zero-inflated models can be identified. We propose methods for graph recovery, apply our model to real single-cell gene expression data on T helper cells, and show simulated experiments that validate the identifiability and graph estimation methods in practice.
引用
收藏
页码:27 / 67
页数:41
相关论文
共 50 条
  • [21] Assessment and Selection of Competing Models for Zero-Inflated Microbiome Data
    Xu, Lizhen
    Paterson, Andrew D.
    Turpin, Williams
    Xu, Wei
    PLOS ONE, 2015, 10 (07):
  • [22] Exponential dispersion models for overdispersed zero-inflated count data
    Bar-Lev, Shaul K.
    Ridder, Ad
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3286 - 3304
  • [23] A Zero-Inflated Model for Insurance Data
    Choi, Jong-Hoo
    Ko, In-Mi
    Cheon, Sooyoung
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (03) : 485 - 494
  • [24] On modeling zero-inflated insurance data
    Perez Sanchez, J. M.
    Gomez-Deniz, E.
    JOURNAL OF RISK MODEL VALIDATION, 2016, 10 (04): : 23 - 37
  • [25] Zero-inflated proportion data models applied to a biological control assay
    Vieira, AMC
    Hinde, JP
    Demétrio, CGB
    JOURNAL OF APPLIED STATISTICS, 2000, 27 (03) : 373 - 389
  • [26] Zero-Inflated Poisson Regression Models with Right Censored Count Data
    Saffari, Seyed Ehsan
    Adnan, Robiah
    MATEMATIKA, 2011, 27 (01) : 21 - 29
  • [27] Marginalized Zero-Inflated Bell Regression Models for Overdispersed Count Data
    Amani, Kouakou Mathias
    Kouakou, Konan Jean Geoffroy
    Hili, Ouagnina
    JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2025, 19 (02)
  • [28] Zero-inflated multiscale models for aggregated small area health data
    Aregay, Mehreteab
    Lawson, Andrew B.
    Faes, Christel
    Kirby, Russell S.
    Carroll, Rachel
    Watjou, Kevin
    ENVIRONMETRICS, 2018, 29 (01)
  • [29] Score tests for zero-inflated Poisson models
    Jansakul, N
    Hinde, JP
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 40 (01) : 75 - 96
  • [30] Bayesian Analysis for the Zero-inflated Regression Models
    Jane, Hakjin
    Kang, Yunhee
    Lee, S.
    Kim, Seong W.
    KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (04) : 603 - 613