Analyzing clustered count data with a cluster-specific random effect zero-inflated Conway-Maxwell-Poisson distribution

被引:12
|
作者
Choo-Wosoba, Hyoyoung [1 ]
Datta, Somnath [2 ]
机构
[1] Univ Louisville, Dept Bioinformat & Biostat, Louisville, KY 40202 USA
[2] Univ Florida, Dept Biostat, Gainesville, FL USA
基金
美国国家卫生研究院;
关键词
Gaussian-Hermite (G-H) quadrature; mixed effects model; next-generation sequencing (NGS) data; Poisson distribution; under- and over-dispersions; REGRESSION; MODEL;
D O I
10.1080/02664763.2017.1312299
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Count data analysis techniques have been developed in biological and medical research areas. In particular, zero-inflated versions of parametric count distributions have been used to model excessive zeros that are often present in these assays. The most common count distributions for analyzing such data are Poisson and negative binomial. However, a Poisson distribution can only handle equidispersed data and a negative binomial distribution can only cope with overdispersion. However, a Conway-Maxwell-Poisson (CMP) distribution [4] can handle a wide range of dispersion. We show, with an illustrative data set on next-generation sequencing of maize hybrids, that both underdispersion and overdispersion can be present in genomic data. Furthermore, the maize data set consists of clustered observations and, therefore, we develop inference procedures for a zero-inflated CMP regression that incorporates a cluster-specific random effect term. Unlike the Gaussian models, the underlying likelihood is computationally challenging. We use a numerical approximation via a Gaussian quadrature to circumvent this issue. A test for checking zero-inflation has also been developed in our setting. Finite sample properties of our estimators and test have been investigated by extensive simulations. Finally, the statistical methodology has been applied to analyze the maize data mentioned before.
引用
收藏
页码:799 / 814
页数:16
相关论文
共 50 条
  • [41] Modeling zero-inflated count data using a covariate-dependent random effect model
    Wong, Kin-Yau
    Lam, K. F.
    STATISTICS IN MEDICINE, 2013, 32 (08) : 1283 - 1293
  • [42] Multilevel zero-inflated Generalized Poisson regression modeling for dispersed correlated count data
    Almasi, Afshin
    Eshraghian, Mohammad Reza
    Moghimbeigi, Abbas
    Rahimi, Abbas
    Mohammad, Kazem
    Fallahigilan, Sadegh
    STATISTICAL METHODOLOGY, 2016, 30 : 1 - 14
  • [43] A Bayesian approach of joint models for clustered zero-inflated count data with skewness and measurement errors
    Fu, Ying-zi
    Chu, Pei-xiao
    Lu, Li-ying
    JOURNAL OF APPLIED STATISTICS, 2015, 42 (04) : 745 - 761
  • [44] Models for Analyzing Zero-Inflated and Overdispersed Count Data: An Application to Cigarette and Marijuana Use
    Pittman, Brian
    Buta, Eugenia
    Krishnan-Sarin, Suchitra
    O'Malley, Stephanie S.
    Liss, Thomas
    Gueorguieva, Ralitza
    NICOTINE & TOBACCO RESEARCH, 2020, 22 (08) : 1390 - 1398
  • [45] Objective Bayes analysis of zero-inflated Poisson distribution with application to healthcare data
    Xu, Hai-Yan
    Xie, Min
    Goh, Thong Ngee
    IIE TRANSACTIONS, 2014, 46 (08) : 843 - 852
  • [46] A Flexible Zero-Inflated Poisson-Gamma Model with Application to Microbiome Sequence Count Data
    Jiang, Roulan
    Zhan, Xiang
    Wang, Tianying
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 792 - 804
  • [47] Profile monitoring for count data using Poisson and Conway-Maxwell-Poisson regression-based control charts under multicollinearity problem
    Mammadova, Ulduz
    Ozkale, M. Revan
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2021, 388
  • [48] A GLM-based zero-inflated generalized Poisson factor model for analyzing microbiome data
    Chi, Jinling
    Ye, Jimin
    Zhou, Ying
    FRONTIERS IN MICROBIOLOGY, 2024, 15
  • [49] Sample size calculations for clustered count data based on zero-inflated discrete Weibull regression models
    Yoo, Hanna
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2024, 31 (01) : 55 - 64
  • [50] COMPARISON OF PROPENSITY SCORE WITH ZIP MODELS IN ANALYZING ZERO-INFLATED COUNT DATA IN OBSERVATIONAL STUDIES
    Tu, C.
    Koh, W. Y.
    VALUE IN HEALTH, 2016, 19 (03) : A89 - A90