Marginal zero-inflated regression models for count data

被引:12
|
作者
Martin, Jacob [1 ]
Hall, Daniel B. [1 ]
机构
[1] Univ Georgia, Dept Stat, Athens, GA 30602 USA
关键词
Exponential dispersion family; generalized linear models; zero inflation; marginal models; EM algorithm; BINOMIAL REGRESSION; POISSON REGRESSION;
D O I
10.1080/02664763.2016.1225018
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Data sets with excess zeroes are frequently analyzed in many disciplines. A common framework used to analyze such data is the zero-inflated (ZI) regression model. It mixes a degenerate distribution with point mass at zero with a non-degenerate distribution. The estimates from ZI models quantify the effects of covariates on the means of latent random variables, which are often not the quantities of primary interest. Recently, marginal zero-inflated Poisson (MZIP; Long etal. [A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat. Med. 33 (2014), pp.5151-5165]) and negative binomial (MZINB; Preisser et al., 2016) models have been introduced that model the mean response directly. These models yield covariate effects that have simple interpretations that are, for many applications, more appealing than those available from ZI regression. This paper outlines a general framework for marginal zero-inflated models where the latent distribution is a member of the exponential dispersion family, focusing on common distributions for count data. In particular, our discussion includes the marginal zero-inflated binomial (MZIB) model, which has not been discussed previously. The details of maximum likelihood estimation via the EM algorithm are presented and the properties of the estimators as well as Wald and likelihood ratio-based inference are examined via simulation. Two examples presented illustrate the advantages of MZIP, MZINB, and MZIB models for practical data analysis.
引用
收藏
页码:1807 / 1826
页数:20
相关论文
共 50 条
  • [31] Semiparametric analysis of zero-inflated count data
    Lam, K. F.
    Xue, Hongqi
    Cheung, Yin Bun
    BIOMETRICS, 2006, 62 (04) : 996 - 1003
  • [32] Modelling correlated zero-inflated count data
    Dobbie, MJ
    Welsh, AH
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2001, 43 (04) : 431 - 444
  • [33] Random effect models for repeated measures of zero-inflated count data
    Min, YY
    Agresti, A
    STATISTICAL MODELLING, 2005, 5 (01) : 1 - 19
  • [34] Models for zero-inflated count data using the Neyman type A distribution
    Dobbie, Melissa J.
    Welsh, Alan H.
    STATISTICAL MODELLING, 2001, 1 (01) : 65 - 80
  • [35] Bayesian Analysis for the Zero-inflated Regression Models
    Jane, Hakjin
    Kang, Yunhee
    Lee, S.
    Kim, Seong W.
    KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (04) : 603 - 613
  • [36] The Zero-Inflated Poisson - Probit regression model: a new model for count data
    Pho, Kim-Hung
    Truong, Buu-Chau
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [37] Count Regression and Machine Learning Techniques for Zero-Inflated Overdispersed Count Data: Application to Ecological Data
    Sidumo B.
    Sonono E.
    Takaidza I.
    Annals of Data Science, 2024, 11 (03) : 803 - 817
  • [38] Sample size calculations for clustered count data based on zero-inflated discrete Weibull regression models
    Yoo, Hanna
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2024, 31 (01) : 55 - 64
  • [39] Zero-inflated Modified Borel-Tanner Regression Model for Count Data
    Hassan, Anwar
    Ahmad, Ishfaq S.
    Ahmad, Peer Bilal
    AUSTRIAN JOURNAL OF STATISTICS, 2022, 51 (02) : 28 - 39
  • [40] Infants' gut microbiome data: A Bayesian Marginal Zero-inflated Negative Binomial regression model for multivariate analyses of count data
    Hajihosseini, Morteza
    Amini, Payam
    Saidi-Mehrabad, Alireza
    Dinu, Irina
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 1621 - 1629