Zero-inflated Poisson factor model with application to microbiome read counts

被引:19
|
作者
Xu, Tianchen [1 ]
Demmer, Ryan T. [2 ]
Li, Gen [1 ]
机构
[1] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Epidemiol, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
16S sequencing; factor analysis; low rank; microbiome data; zero inflation; PERIODONTAL-DISEASE; EXPRESSION;
D O I
10.1111/biom.13272
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Dimension reduction of high-dimensional microbiome data facilitates subsequent analysis such as regression and clustering. Most existing reduction methods cannot fully accommodate the special features of the data such as count-valued and excessive zero reads. We propose a zero-inflated Poisson factor analysis model in this paper. The model assumes that microbiome read counts follow zero-inflated Poisson distributions with library size as offset and Poisson rates negatively related to the inflated zero occurrences. The latent parameters of the model form a low-rank matrix consisting of interpretable loadings and low-dimensional scores that can be used for further analyses. We develop an efficient and robust expectation-maximization algorithm for parameter estimation. We demonstrate the efficacy of the proposed method using comprehensive simulation studies. The application to the Oral Infections, Glucose Intolerance, and Insulin Resistance Study provides valuable insights into the relation between subgingival microbiome and periodontal disease.
引用
收藏
页码:91 / 101
页数:11
相关论文
共 50 条
  • [1] Time Series of Multivariate Zero-inflated Poisson Counts
    Zhang, Chen
    Chen, Nan
    Zhang, Linmiao
    2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2016, : 1365 - 1369
  • [2] A GLM-based zero-inflated generalized Poisson factor model for analyzing microbiome data
    Chi, Jinling
    Ye, Jimin
    Zhou, Ying
    FRONTIERS IN MICROBIOLOGY, 2024, 15
  • [3] A Flexible Zero-Inflated Poisson-Gamma Model with Application to Microbiome Sequence Count Data
    Jiang, Roulan
    Zhan, Xiang
    Wang, Tianying
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 792 - 804
  • [4] Hidden Markov models for zero-inflated Poisson counts with an application to substance use
    DeSantis, StaciaM.
    Bandyopadhyay, Dipankar
    STATISTICS IN MEDICINE, 2011, 30 (14) : 1678 - 1694
  • [5] Zero-inflated models and estimation in zero-inflated Poisson distribution
    Wagh, Yogita S.
    Kamalja, Kirtee K.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (08) : 2248 - 2265
  • [6] A zero-inflated overdispersed hierarchical Poisson model
    Kassahun, Wondwosen
    Neyens, Thomas
    Faes, Christel
    Molenberghs, Geert
    Verbeke, Geert
    STATISTICAL MODELLING, 2014, 14 (05) : 439 - 456
  • [7] Testing overdispersion in the zero-inflated Poisson model
    Yang, Zhao
    Hardin, James W.
    Addy, Cheryl L.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (09) : 3340 - 3353
  • [8] Zero-inflated Poisson regression mixture model
    Lim, Hwa Kyung
    Li, Wai Keung
    Yu, Philip L. H.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 151 - 158
  • [9] Zero-inflated Poisson model with group data
    Yang, Jun
    Zhang, Xin
    ADVANCED MATERIALS DESIGN AND MECHANICS, 2012, 569 : 627 - 631
  • [10] The LZIP: A Bayesian Latent Factor Model for Correlated Zero-Inflated Counts
    Neelon, Brian
    Chung, Dongjun
    BIOMETRICS, 2017, 73 (01) : 185 - 196