BAYESIAN MIXED EFFECTS MODELS FOR ZERO-INFLATED COMPOSITIONS IN MICROBIOME DATA ANALYSIS

被引:8
|
作者
Ren, Boyu [1 ]
Bacallado, Sergio [2 ]
Favaro, Stefano [3 ,4 ]
Vatanen, Tommi [5 ]
Huttenhower, Curtis [1 ,5 ]
Trippa, Lorenzo [1 ]
机构
[1] Harvard Univ, Dept Biostat, Cambridge, MA 02138 USA
[2] Univ Cambridge, Dept Pure Math & Math Stat, Cambridge, England
[3] Univ Torino, Departimento Sci Econ Sociali & Matemat Stat, Turin, Italy
[4] Coll Carlo Alberto, Turin, Italy
[5] Univ Auckland, Liggins Inst, Auckland, New Zealand
来源
ANNALS OF APPLIED STATISTICS | 2020年 / 14卷 / 01期
基金
欧洲研究理事会;
关键词
Truncated dependent Dirichlet processes; latent factor model; type; 1; diabetes; MULTINOMIAL REGRESSION; GUT MICROBIOME;
D O I
10.1214/19-AOAS1295
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with adjustments for multiple hypothesis testing. We propose a Bayesian analysis for a generalized mixed effects linear model tailored to this application. The marginal prior on each microbial composition is a Dirichlet process, and dependence across compositions is induced through a linear combination of individual covariates, such as disease biomarkers or the subject's age, and latent factors. The latent factors capture residual variability and their dimensionality is learned from the data in a fully Bayesian procedure. The proposed model is tested in data analyses and simulation studies with zero-inflated compositions. In these settings and within each sample, a large proportion of counts per microbial species are equal to zero. In our Bayesian model a priori the probability of compositions with absent microbial species is strictly positive. We propose an efficient algorithm to sample from the posterior and visualizations of model parameters which reveal associations between covariates and microbial compositions. We evaluate the proposed method in simulation studies, and then analyze a microbiome dataset for infants with type 1 diabetes which contains a large proportion of zeros in the sample-specific microbial compositions.
引用
收藏
页码:494 / 517
页数:24
相关论文
共 50 条
  • [21] Compositional zero-inflated network estimation for microbiome data
    Ha, Min Jin
    Kim, Junghi
    Galloway-Pena, Jessica
    Kim-Anh Do
    Peterson, Christine B.
    BMC BIOINFORMATICS, 2020, 21 (Suppl 21)
  • [22] Compositional zero-inflated network estimation for microbiome data
    Min Jin Ha
    Junghi Kim
    Jessica Galloway-Peña
    Kim-Anh Do
    Christine B. Peterson
    BMC Bioinformatics, 21
  • [23] Bayesian Nonparametric Model for Heterogeneous Treatment Effects With Zero-Inflated Data
    Kim, Chanmin
    Li, Yisheng
    Xu, Ting
    Liao, Zhongxing
    STATISTICS IN MEDICINE, 2024, 43 (30) : 5968 - 5982
  • [24] Zero-inflated models and estimation in zero-inflated Poisson distribution
    Wagh, Yogita S.
    Kamalja, Kirtee K.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (08) : 2248 - 2265
  • [25] A framework of zero-inflated bayesian negative binomial regression models for spatiotemporal data
    He, Qing
    Huang, Hsin-Hsiung
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2024, 229
  • [26] Randomized quantile residuals for diagnosing zero-inflated generalized linear mixed models with applications to microbiome count data
    Bai, Wei
    Dong, Mei
    Li, Longhai
    Feng, Cindy
    Xu, Wei
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [27] Randomized quantile residuals for diagnosing zero-inflated generalized linear mixed models with applications to microbiome count data
    Wei Bai
    Mei Dong
    Longhai Li
    Cindy Feng
    Wei Xu
    BMC Bioinformatics, 22
  • [28] The analysis of zero-inflated count data: Beyond zero-inflated Poisson regression.
    Loeys, Tom
    Moerkerke, Beatrijs
    De Smet, Olivia
    Buysse, Ann
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2012, 65 (01): : 163 - 180
  • [29] Assessing influence for pharmaceutical data in zero-inflated generalized Poisson mixed models
    Xie, Feng-Chang
    Wei, Bo-Cheng
    Lin, Jin-Guan
    STATISTICS IN MEDICINE, 2008, 27 (18) : 3656 - 3673
  • [30] Some extensions of zero-inflated models and Bayesian tests for them
    Mersad, M.
    Ganjali, M.
    Rivaz, F.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (18) : 3792 - 3810