Directed Graphical Models and Causal Discovery for Zero-Inflated Data

被引:0
|
作者
Yu, Shiqing [1 ]
Drton, Mathias [2 ,3 ]
Shojaie, Ali [4 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Tech Univ Munich, Dept Math, Munich, Germany
[3] Tech Univ Munich, Munich Data Sci Inst, Munich, Germany
[4] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院; 欧洲研究理事会;
关键词
Bayesian network; causal discovery; directed acyclic graph; identifiability;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With advances in technology, gene expression measurements from single cells can be used to gain refined insights into regulatory relationships among genes. Directed graphical models are wellsuited to explore such (cause-effect) relationships. However, statistical analyses of single cell data are complicated by the fact that the data often show zero-inflated expression patterns. To address this challenge, we propose directed graphical models that are based on Hurdle conditional distributions parametrized in terms of polynomials in parent variables and their 0/1 indicators of being zero or nonzero. While directed graphs for Gaussian models are only identifiable up to an equivalence class in general, we show that, under a natural and weak assumption, the exact directed acyclic graph of our zero-inflated models can be identified. We propose methods for graph recovery, apply our model to real single-cell gene expression data on T helper cells, and show simulated experiments that validate the identifiability and graph estimation methods in practice.
引用
收藏
页码:27 / 67
页数:41
相关论文
共 50 条
  • [41] Bayesian analysis of zero-inflated regression models
    Ghosh, SK
    Mukhopadhyay, P
    Lu, JC
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2006, 136 (04) : 1360 - 1375
  • [42] A novel causal mediation analysis approach for zero-inflated mediators
    Jiang, Meilin
    Lee, Seonjoo
    O'Malley, A. James
    Stern, Yaakov
    Li, Zhigang
    STATISTICS IN MEDICINE, 2023, 42 (13) : 2061 - 2081
  • [43] Zero-inflated modeling part I: Traditional zero-inflated count regression models, their applications, and computational tools
    Young, Derek S.
    Roemmele, Eric S.
    Yeh, Peng
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (01):
  • [44] A Zero-Inflated Regression Model for Grouped Data
    Brown, Sarah
    Duncan, Alan
    Harris, Mark N.
    Roberts, Jennifer
    Taylor, Karl
    OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 2015, 77 (06) : 822 - 831
  • [45] Zero-inflated modeling part I: Traditional zero-inflated count regression models, their applications, and computational tools
    Young, Derek S.
    Roemmele, Eric S.
    Yeh, Peng
    Wiley Interdisciplinary Reviews: Computational Statistics, 2022, 14 (01)
  • [46] Modelling count data with excessive zeros: The need for class prediction in zero-inflated models and the issue of data generation in choosing between zero-inflated and generic mixture models for dental caries data
    Gilthorpe, Mark S.
    Frydenberg, Morten
    Cheng, Yaping
    Baelum, Vibeke
    STATISTICS IN MEDICINE, 2009, 28 (28) : 3539 - 3553
  • [47] Small Area Estimation for Zero-Inflated Data
    Chandra, Hukum
    Sud, U. C.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2012, 41 (05) : 632 - 643
  • [48] Semiparametric analysis of zero-inflated count data
    Lam, K. F.
    Xue, Hongqi
    Cheung, Yin Bun
    BIOMETRICS, 2006, 62 (04) : 996 - 1003
  • [49] A Bayesian approach to zero-inflated data in extremes
    Quadros Gramosa, Alexandre Henrique
    do Nascimento, Fernando Ferraz
    Castro Morales, Fidel Ernesto
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2020, 49 (17) : 4150 - 4161
  • [50] Zero-inflated Poisson model with group data
    Yang, Jun
    Zhang, Xin
    ADVANCED MATERIALS DESIGN AND MECHANICS, 2012, 569 : 627 - 631