Analyzing clustered count data with a cluster-specific random effect zero-inflated Conway-Maxwell-Poisson distribution

被引:12
|
作者
Choo-Wosoba, Hyoyoung [1 ]
Datta, Somnath [2 ]
机构
[1] Univ Louisville, Dept Bioinformat & Biostat, Louisville, KY 40202 USA
[2] Univ Florida, Dept Biostat, Gainesville, FL USA
基金
美国国家卫生研究院;
关键词
Gaussian-Hermite (G-H) quadrature; mixed effects model; next-generation sequencing (NGS) data; Poisson distribution; under- and over-dispersions; REGRESSION; MODEL;
D O I
10.1080/02664763.2017.1312299
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Count data analysis techniques have been developed in biological and medical research areas. In particular, zero-inflated versions of parametric count distributions have been used to model excessive zeros that are often present in these assays. The most common count distributions for analyzing such data are Poisson and negative binomial. However, a Poisson distribution can only handle equidispersed data and a negative binomial distribution can only cope with overdispersion. However, a Conway-Maxwell-Poisson (CMP) distribution [4] can handle a wide range of dispersion. We show, with an illustrative data set on next-generation sequencing of maize hybrids, that both underdispersion and overdispersion can be present in genomic data. Furthermore, the maize data set consists of clustered observations and, therefore, we develop inference procedures for a zero-inflated CMP regression that incorporates a cluster-specific random effect term. Unlike the Gaussian models, the underlying likelihood is computationally challenging. We use a numerical approximation via a Gaussian quadrature to circumvent this issue. A test for checking zero-inflation has also been developed in our setting. Finite sample properties of our estimators and test have been investigated by extensive simulations. Finally, the statistical methodology has been applied to analyze the maize data mentioned before.
引用
收藏
页码:799 / 814
页数:16
相关论文
共 50 条
  • [1] Marginal regression models for clustered count data based on zero-inflated Conway-Maxwell-Poisson distribution with applications
    Choo-Wosoba, Hyoyoung
    Levy, Steven M.
    Datta, Somnath
    BIOMETRICS, 2016, 72 (02) : 606 - 618
  • [2] Three-level zero-inflated Conway-Maxwell-Poisson regression model for analyzing dispersed clustered count data with extra zeros
    Gholiabad, Somayeh Ghorbani
    Moghimbeigi, Abbas
    Faradmal, Javad
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2021, 83 (SUPPL 2): : 415 - 439
  • [3] Analyzing longitudinal clustered count data with zero inflation: Marginal modeling using the Conway-Maxwell-Poisson distribution
    Kang, Tong
    Levy, Steven M.
    Datta, Somnath
    BIOMETRICAL JOURNAL, 2021, 63 (04) : 761 - 786
  • [4] Three-level zero-inflated Conway–Maxwell–Poisson regression model for analyzing dispersed clustered count data with extra zeros
    Somayeh Ghorbani Gholiabad
    Abbas Moghimbeigi
    Javad Faradmal
    Sankhya B, 2021, 83 : 415 - 439
  • [5] The zero-inflated Conway-Maxwell-Poisson distribution: Bayesian inference, regression modeling and influence diagnostic
    Barriga, Gladys D. C.
    Louzada, Francisco
    STATISTICAL METHODOLOGY, 2014, 21 : 23 - 34
  • [6] Zero-inflated Conway-Maxwell Poisson Distribution to Analyze Discrete Data
    Sim, Shin Zhu
    Gupta, Ramesh C.
    Ong, Seng Huat
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2018, 14 (01):
  • [7] The Conway-Maxwell-Poisson model for analyzing crash data
    Lord, Dominique
    Guikema, Seth D.
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2012, 28 (02) : 122 - 127
  • [8] A multilevel zero-inflated Conway-Maxwell type negative binomial model for analysing clustered count data
    Gholiabad, Somayeh Ghorbani
    Moghimbeigi, Abbas
    Faradmal, Javad
    Baghestani, Ahmad Reza
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (09) : 1762 - 1781
  • [9] A Bayesian approach for analyzing zero-inflated clustered count data with dispersion
    Choo-Wosoba, Hyoyoung
    Gaskins, Jeremy
    Levy, Steven
    Datta, Somnath
    STATISTICS IN MEDICINE, 2018, 37 (05) : 801 - 812
  • [10] Conway-Maxwell-Poisson regression models for dispersed count data
    Sellers, Kimberly F.
    Premeaux, Bailey
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (06)