Testing latent classes in gut microbiome data using generalized Poisson regression models

被引:2
|
作者
Qiao, Xinhui [1 ]
He, Hua [2 ]
Sun, Liuquan [3 ]
Bai, Shuo [2 ]
Ye, Peng [1 ,4 ]
机构
[1] Univ Int Business & Econ, Sch Stat, Beijing, Peoples R China
[2] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Epidemiol, New Orleans, LA USA
[3] Chinese Acad Sci, Inst Appl Math, Acad Math & Syst Sci, Beijing, Peoples R China
[4] Univ Int Business & Econ, Sch Stat, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Bogalusa Heart Study; generalized Poisson model; latent class; microbiome data; operational taxonomic units; zero-inflated generalized Poisson model; ZERO-INFLATION; SCORE TEST; MIXTURE;
D O I
10.1002/sim.9944
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human microbiome research has gained increasing importance due to its critical roles in comprehending human health and disease. Within the realm of microbiome research, the data generated often involves operational taxonomic unit counts, which can frequently present challenges such as over-dispersion and zero-inflation. To address dispersion-related concerns, the generalized Poisson model offers a flexible solution, effectively handling data characterized by over-dispersion, equi-dispersion, and under-dispersion. Furthermore, the realm of zero-inflated generalized Poisson models provides a strategic avenue to simultaneously tackle both over-dispersion and zero-inflation. The phenomenon of zero-inflation frequently stems from the heterogeneous nature of study populations. It emerges when specific microbial taxa fail to thrive in the microbial community of certain subjects, consequently resulting in a consistent count of zeros for these individuals. This subset of subjects represents a latent class, where their zeros originate from the genuine absence of the microbial taxa. In this paper, we introduce a novel testing methodology designed to uncover such latent classes within generalized Poisson regression models. We establish a closed-form test statistic and deduce its asymptotic distribution based on estimating equations. To assess its efficacy, we conduct an extensive array of simulation studies, and further apply the test to detect latent classes in human gut microbiome data from the Bogalusa Heart Study.
引用
收藏
页码:102 / 124
页数:23
相关论文
共 50 条
  • [41] A priori ratemaking using bivariate Poisson regression models
    Bermudez i Morata, Lluis
    INSURANCE MATHEMATICS & ECONOMICS, 2009, 44 (01): : 135 - 141
  • [42] THE ANALYSIS OF RATES USING POISSON REGRESSION-MODELS
    FROME, EL
    BIOMETRICS, 1983, 39 (03) : 665 - 674
  • [43] Methods to Account for Uncertainty in Latent Class Assignments When Using Latent Classes as Predictors in Regression Models, with Application to Acculturation Strategy Measures
    Elliott, Michael R.
    Zhao, Zhangchen
    Mukherjee, Bhramar
    Kanaya, Alka
    Needham, Belinda L.
    EPIDEMIOLOGY, 2020, 31 (02) : 194 - 204
  • [44] How many latent classes of delinquent/criminal careers? Results from mixed Poisson regression analyses
    D'Unger, AV
    Land, KC
    McCall, PL
    Nagin, DS
    AMERICAN JOURNAL OF SOCIOLOGY, 1998, 103 (06) : 1593 - 1630
  • [45] Boosting Poisson regression models with telematics car driving data
    Gao, Guangyuan
    Wang, He
    Wuethrich, Mario, V
    MACHINE LEARNING, 2022, 111 (01) : 243 - 272
  • [46] Boosting Poisson regression models with telematics car driving data
    Guangyuan Gao
    He Wang
    Mario V. Wüthrich
    Machine Learning, 2022, 111 : 243 - 272
  • [47] Bayesian compositional generalized linear mixed models for disease prediction using microbiome data
    Zhang, Li
    Zhang, Xinyan
    Leach, Justin M.
    Rahman, A. K. M. F.
    Howell, Carrie R.
    Yi, Nengjun
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [48] ESTIMATING STOCHASTIC POISSON INTENSITIES USING DEEP LATENT MODELS
    Wang, Ruixin
    Jaiswal, Prateek
    Honnappa, Harsha
    2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 596 - 607
  • [49] Testing for Breaks in Regression Models with Dependent Data
    Hidalgo, J.
    Dalla, V.
    NONPARAMETRIC STATISTICS, 2016, 175 : 19 - 45
  • [50] Specification testing for regression models with dependent data
    Hidalgo, J.
    JOURNAL OF ECONOMETRICS, 2008, 143 (01) : 143 - 165