Addressing overdispersion and zero-inflation for clustered count data via new multilevel heterogenous hurdle models

被引:1
|
作者
Altinisik, Yasin [1 ]
机构
[1] Sinop Univ, Dept Stat, Fac Sci & Literature, Sinop, Turkey
关键词
Multilevel modeling; count data; overdispersion; zero-inflation; Poisson-Lindley distribution; Poisson-Ailamujia distribution; POISSON REGRESSION; SEVERITY; MISSPECIFICATION; SELECTION; TOBIT;
D O I
10.1080/02664763.2022.2096875
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Unobserved heterogeneity causing overdispersion and the excessive number of zeros take a prominent place in the methodological development on count modeling. An insight into the mechanisms that induce heterogeneity is required for better understanding of the phenomenon of overdispersion. When the heterogeneity is sourced by the stochastic component of the model, the use of a heterogenous Poisson distribution for this part encounters as an elegant solution. Hierarchical design of the study is also responsible for the heterogeneity as the unobservable effects at various levels also contribute to the overdispersion. Zero-inflation, heterogeneity and multilevel nature in the count data present special challenges in their own respect, however the presence of all in one study adds more challenges to the modeling strategies. This study therefore is designed to merge the attractive features of the separate strand of the solutions in order to face such a comprehensive challenge. This study differs from the previous attempts by the choice of two recently developed heterogeneous distributions, namely Poisson-Lindley (PL) and Poisson-Ailamujia (PA) for the truncated part. Using generalized linear mixed modeling settings, predictive performances of the multilevel PL and PA models and their hurdle counterparts were assessed within a comprehensive simulation study in terms of bias, precision and accuracy measures. Multilevel models were applied to two separate real world examples for the assessment of practical implications of the new models proposed in this study.
引用
收藏
页码:408 / 433
页数:26
相关论文
共 33 条
  • [1] The consequences of checking for zero-inflation and overdispersion in the analysis of count data
    Campbell, Harlan
    METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (04): : 665 - 680
  • [2] A score test for zero-inflation in multilevel count data
    Moghimbeigi, Abbas
    Eshraghian, Mohammad Reza
    Mohammad, Kazem
    McArdle, Brian
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (04) : 1239 - 1248
  • [3] Score tests for zero-inflation and overdispersion in two-level count data
    Lim, Hwa Kyung
    Song, Juwon
    Jung, Byoung Cheol
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 61 : 67 - 82
  • [4] A New Computational Algorithm for Assessing Overdispersion and Zero-Inflation in Machine Learning Count Models with Python']Python
    Favero, Luiz Paulo Lopes
    Duarte, Alexandre
    Santos, Helder Prado
    COMPUTERS, 2024, 13 (04)
  • [5] A New Approach for Handling Longitudinal Count Data with Zero-Inflation and Overdispersion: Poisson Geometric Process Model
    Wan, Wai-Yin
    Chan, Jennifer S. K.
    BIOMETRICAL JOURNAL, 2009, 51 (04) : 556 - 570
  • [6] Flexible Lévy-Based Models for Time Series of Count Data with Zero-Inflation, Overdispersion, and Heavy Tails
    Kollie, Confort
    Ngare, Philip
    Malenje, Bonface
    JOURNAL OF PROBABILITY AND STATISTICS, 2023, 2023
  • [7] A Note on Tests for Zero-Inflation in Correlated Count Data
    Xiang, Liming
    Teo, Guo Shou
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2011, 40 (07) : 992 - 1005
  • [8] Sensitivity of score tests for zero-inflation in count data
    Lee, AH
    Xiang, LM
    Fung, WK
    STATISTICS IN MEDICINE, 2004, 23 (17) : 2757 - 2769
  • [9] A score test for zero-inflation in correlated count data
    Xiang, Liming
    Lee, Andy H.
    Yau, Kelvin K. W.
    McLachlan, Geoffrey J.
    STATISTICS IN MEDICINE, 2006, 25 (10) : 1660 - 1671
  • [10] Score Tests for Zero-Inflation in Overdispersed Count Data
    Yang, Zhao
    Hardin, James W.
    Addy, Cheryl L.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (11) : 2008 - 2030