BAYESIAN MIXED EFFECTS MODELS FOR ZERO-INFLATED COMPOSITIONS IN MICROBIOME DATA ANALYSIS
被引:8
|
作者:
Ren, Boyu
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Biostat, Cambridge, MA 02138 USAHarvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Ren, Boyu
[1
]
Bacallado, Sergio
论文数: 0引用数: 0
h-index: 0
机构:
Univ Cambridge, Dept Pure Math & Math Stat, Cambridge, EnglandHarvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Bacallado, Sergio
[2
]
Favaro, Stefano
论文数: 0引用数: 0
h-index: 0
机构:
Univ Torino, Departimento Sci Econ Sociali & Matemat Stat, Turin, Italy
Coll Carlo Alberto, Turin, ItalyHarvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Favaro, Stefano
[3
,4
]
Vatanen, Tommi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Auckland, Liggins Inst, Auckland, New ZealandHarvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Vatanen, Tommi
[5
]
Huttenhower, Curtis
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Univ Auckland, Liggins Inst, Auckland, New ZealandHarvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Huttenhower, Curtis
[1
,5
]
Trippa, Lorenzo
论文数: 0引用数: 0
h-index: 0
机构:
Harvard Univ, Dept Biostat, Cambridge, MA 02138 USAHarvard Univ, Dept Biostat, Cambridge, MA 02138 USA
Trippa, Lorenzo
[1
]
机构:
[1] Harvard Univ, Dept Biostat, Cambridge, MA 02138 USA
[2] Univ Cambridge, Dept Pure Math & Math Stat, Cambridge, England
Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with adjustments for multiple hypothesis testing. We propose a Bayesian analysis for a generalized mixed effects linear model tailored to this application. The marginal prior on each microbial composition is a Dirichlet process, and dependence across compositions is induced through a linear combination of individual covariates, such as disease biomarkers or the subject's age, and latent factors. The latent factors capture residual variability and their dimensionality is learned from the data in a fully Bayesian procedure. The proposed model is tested in data analyses and simulation studies with zero-inflated compositions. In these settings and within each sample, a large proportion of counts per microbial species are equal to zero. In our Bayesian model a priori the probability of compositions with absent microbial species is strictly positive. We propose an efficient algorithm to sample from the posterior and visualizations of model parameters which reveal associations between covariates and microbial compositions. We evaluate the proposed method in simulation studies, and then analyze a microbiome dataset for infants with type 1 diabetes which contains a large proportion of zeros in the sample-specific microbial compositions.
机构:
Huzhou Normal Coll, Fac Sci, Huzhou 313000, Zhejiang, Peoples R China
Yunnan Univ, Dept Stat, Kunming 650091, Peoples R ChinaHuzhou Normal Coll, Fac Sci, Huzhou 313000, Zhejiang, Peoples R China
机构:
Forsyth Inst, 245 First St, Cambridge, MA 02142 USA
Harvard Sch Dent Med, Dept Oral Hlth Policy & Epidemiol, Boston, MA 02115 USAForsyth Inst, 245 First St, Cambridge, MA 02142 USA
Lee, Kyu Ha
Coull, Brent A.
论文数: 0引用数: 0
h-index: 0
机构:
Harvard TH Chan Sch Publ Hlth, Dept Biostat, 665 Huntington Ave, Boston, MA 02115 USAForsyth Inst, 245 First St, Cambridge, MA 02142 USA
Coull, Brent A.
Moscicki, Anna-Barbara
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, David Geffen Sch Med, Dept Pediat, Los Angeles, CA 10833 USAForsyth Inst, 245 First St, Cambridge, MA 02142 USA
Moscicki, Anna-Barbara
Paster, Bruce J.
论文数: 0引用数: 0
h-index: 0
机构:
Forsyth Inst, 245 First St, Cambridge, MA 02142 USA
Harvard Sch Dent Med, Dept Oral Med Infect & Immun, Boston, MA 02115 USAForsyth Inst, 245 First St, Cambridge, MA 02142 USA
Paster, Bruce J.
Starr, Jacqueline R.
论文数: 0引用数: 0
h-index: 0
机构:
Forsyth Inst, 245 First St, Cambridge, MA 02142 USA
Harvard Sch Dent Med, Dept Oral Hlth Policy & Epidemiol, Boston, MA 02115 USAForsyth Inst, 245 First St, Cambridge, MA 02142 USA
机构:
Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, CanadaUniv Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, Canada
Xu, Lizhen
Paterson, Andrew D.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, Canada
Hosp Sick Children Toronto, Program Genet & Genome Biol, Toronto, ON M5G 0A4, CanadaUniv Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, Canada
Paterson, Andrew D.
Turpin, Williams
论文数: 0引用数: 0
h-index: 0
机构:
Mt Sinai Hosp, Zane Cohen Ctr Digest Dis, Div Gastroenterol, Toronto, ON M5T 3L9, CanadaUniv Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, Canada
Turpin, Williams
Xu, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, Canada
Princess Margaret Hosp, Dept Biostat, Toronto, ON M5G 2M9, CanadaUniv Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5T 3M7, Canada