Directed Graphical Models and Causal Discovery for Zero-Inflated Data

被引:0
|
作者
Yu, Shiqing [1 ]
Drton, Mathias [2 ,3 ]
Shojaie, Ali [4 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Tech Univ Munich, Dept Math, Munich, Germany
[3] Tech Univ Munich, Munich Data Sci Inst, Munich, Germany
[4] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院; 欧洲研究理事会;
关键词
Bayesian network; causal discovery; directed acyclic graph; identifiability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With advances in technology, gene expression measurements from single cells can be used to gain refined insights into regulatory relationships among genes. Directed graphical models are wellsuited to explore such (cause-effect) relationships. However, statistical analyses of single cell data are complicated by the fact that the data often show zero-inflated expression patterns. To address this challenge, we propose directed graphical models that are based on Hurdle conditional distributions parametrized in terms of polynomials in parent variables and their 0/1 indicators of being zero or nonzero. While directed graphs for Gaussian models are only identifiable up to an equivalence class in general, we show that, under a natural and weak assumption, the exact directed acyclic graph of our zero-inflated models can be identified. We propose methods for graph recovery, apply our model to real single-cell gene expression data on T helper cells, and show simulated experiments that validate the identifiability and graph estimation methods in practice.
引用
收藏
页码:27 / 67
页数:41
相关论文
共 50 条
  • [31] Zero-Inflated gaussian mixed models for analyzing longitudinal microbiome data
    Zhang, Xinyan
    Guo, Boyi
    Yi, Nengjun
    PLOS ONE, 2020, 15 (11):
  • [32] Random effect models for repeated measures of zero-inflated count data
    Min, YY
    Agresti, A
    STATISTICAL MODELLING, 2005, 5 (01) : 1 - 19
  • [33] Models for zero-inflated count data using the Neyman type A distribution
    Dobbie, Melissa J.
    Welsh, Alan H.
    STATISTICAL MODELLING, 2001, 1 (01) : 65 - 80
  • [34] On Zero-Inflated Hierarchical Poisson Models with Application to Maternal Mortality Data
    Tawiah, Kassim
    Iddi, Samuel
    Lotsi, Anani
    INTERNATIONAL JOURNAL OF MATHEMATICS AND MATHEMATICAL SCIENCES, 2020, 2020
  • [35] Multivariate zero-inflated Poisson models and their applications
    Li, CS
    Lu, JC
    Park, JH
    TECHNOMETRICS, 1999, 41 (01) : 29 - 38
  • [36] Zero-Inflated Spatial Models: Application and Interpretation
    Ainsworth, L. M.
    Dean, C. B.
    Joy, R.
    ADVANCES AND CHALLENGES IN PARAMETRIC AND SEMI-PARAMETRIC ANALYSIS FOR CORRELATED DATA, 2016, 218 : 75 - 96
  • [37] Zero-Inflated Patent Data Analysis Using Compound Poisson Models
    Park, Sangsung
    Jun, Sunghae
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [38] CONSISTENT ESTIMATION OF ZERO-INFLATED COUNT MODELS
    Staub, Kevin E.
    Winkelmann, Rainer
    HEALTH ECONOMICS, 2013, 22 (06) : 673 - 686
  • [39] Control Charts for Zero-Inflated Binomial Models
    Yawsaeng, Bunpen
    Mayureesawan, Tidadeaw
    THAILAND STATISTICIAN, 2012, 10 (01): : 107 - 120
  • [40] Analysis of joint modeling of longitudinal zero-inflated power series and zero-inflated time to event data
    Zeinali Najafabadi, Mojtaba
    Bahrami Samani, Ehsan
    Ganjali, Mojtaba
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2020, 30 (05) : 854 - 872