Zero-inflated Poisson factor model with application to microbiome read counts

被引:19
|
作者
Xu, Tianchen [1 ]
Demmer, Ryan T. [2 ]
Li, Gen [1 ]
机构
[1] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Epidemiol, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
16S sequencing; factor analysis; low rank; microbiome data; zero inflation; PERIODONTAL-DISEASE; EXPRESSION;
D O I
10.1111/biom.13272
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Dimension reduction of high-dimensional microbiome data facilitates subsequent analysis such as regression and clustering. Most existing reduction methods cannot fully accommodate the special features of the data such as count-valued and excessive zero reads. We propose a zero-inflated Poisson factor analysis model in this paper. The model assumes that microbiome read counts follow zero-inflated Poisson distributions with library size as offset and Poisson rates negatively related to the inflated zero occurrences. The latent parameters of the model form a low-rank matrix consisting of interpretable loadings and low-dimensional scores that can be used for further analyses. We develop an efficient and robust expectation-maximization algorithm for parameter estimation. We demonstrate the efficacy of the proposed method using comprehensive simulation studies. The application to the Oral Infections, Glucose Intolerance, and Insulin Resistance Study provides valuable insights into the relation between subgingival microbiome and periodontal disease.
引用
收藏
页码:91 / 101
页数:11
相关论文
共 50 条
  • [21] Identifiability of zero-inflated Poisson models
    Li, Chin-Shang
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2012, 26 (03) : 306 - 312
  • [22] The analysis of zero-inflated count data: Beyond zero-inflated Poisson regression.
    Loeys, Tom
    Moerkerke, Beatrijs
    De Smet, Olivia
    Buysse, Ann
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2012, 65 (01): : 163 - 180
  • [23] An application of zero-inflated Poisson regression for software fault prediction
    Khoshgoftaar, TM
    Gao, KH
    Szabo, RM
    12TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 2001, : 66 - 73
  • [24] A marginalized zero-inflated Poisson regression model with random effects
    Long, D. Leann
    Preisser, John S.
    Herring, Amy H.
    Golin, Carol E.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2015, 64 (05) : 815 - 830
  • [25] Exploring a zero-inflated Poisson regression model of software quality
    Khoshgoftaar, TM
    Gao, KH
    Szabo, RM
    SEVENTH ISSAT INTERNATIONAL CONFERENCE ON RELIABILITY AND QUALITY IN DESIGN, 2002, : 20 - 24
  • [26] A novel approach for zero-inflated count regression model: Zero-inflated Poisson generalized-Lindley linear model with applications
    Altun, Emrah
    Alqifari, Hana
    Eliwa, Mohamed S.
    AIMS MATHEMATICS, 2023, 8 (10): : 23272 - 23290
  • [27] ADAPTIVE LOG-LINEAR ZERO-INFLATED GENERALIZED POISSON AUTOREGRESSIVE MODEL WITH APPLICATIONS TO CRIME COUNTS
    Xu, Xiaofei
    Chen, Ying
    Chen, Cathy W. S.
    Lin, Xiancheng
    ANNALS OF APPLIED STATISTICS, 2020, 14 (03): : 1493 - 1515
  • [28] MODIFIED RIDGE ESTIMATOR IN ZERO-INFLATED POISSON REGRESSION MODEL
    Younus, Farah Abdul Ghani
    Othman, Rafal Adeeb
    Algamal, Zakariya Yahya
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2022, 18 : 1245 - 1250
  • [29] A weighted zero-inflated Poisson model for estimation of recurrence of adenomas
    Hsu, Chiu-Hsieh
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (02) : 155 - 166
  • [30] Some ridge regression estimators for the zero-inflated Poisson model
    Kibria, B. M. Golam
    Mansson, Kristofer
    Shukur, Ghazi
    JOURNAL OF APPLIED STATISTICS, 2013, 40 (04) : 721 - 735