Modelling zero inflated and under-reported count data

被引:2
|
作者
Sengupta, Debjit [1 ,2 ]
Roy, Surupa [1 ]
机构
[1] St Xaviers Coll, Dept Stat, Kolkata, India
[2] St Xaviers Coll, Kolkata 380009, India
关键词
Excess zero; undercount; surrogate; likelihood; bootstrap method; DIAGNOSTIC MISCLASSIFICATION; BAYESIAN-APPROACH; POISSON RATE; REGRESSION;
D O I
10.1080/00949655.2023.2182883
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Poisson distribution is a classic choice for modelling unbounded count data. However, count data arising in various fields of scientific research often have excess zeros and are under-reported. In such situations, Poisson distribution gives a poor fit and Poisson model based inferences lead to biased estimators and inaccurate confidence intervals. In this paper we develop a flexible model which can accommodate excess zeros and undercount. Internal validation data has been used for making likelihood based inferences. The impact of ignoring undercount and excess zeros are studied through extensive simulations. The finite sample behaviour of the estimators are investigated through bootstrap methodology. Finally, a real life data which is supposedly under-reported and known to have excess zeros is analysed.
引用
收藏
页码:2390 / 2409
页数:20
相关论文
共 50 条
  • [41] Modelling count data with excessive zeros: The need for class prediction in zero-inflated models and the issue of data generation in choosing between zero-inflated and generic mixture models for dental caries data
    Gilthorpe, Mark S.
    Frydenberg, Morten
    Cheng, Yaping
    Baelum, Vibeke
    STATISTICS IN MEDICINE, 2009, 28 (28) : 3539 - 3553
  • [42] A joint modeling of longitudinal zero-inflated count data and time to event data
    Kim, Donguk
    Chun, Jihun
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (07) : 1459 - 1473
  • [43] Count Regression and Machine Learning Techniques for Zero-Inflated Overdispersed Count Data: Application to Ecological Data
    Sidumo B.
    Sonono E.
    Takaidza I.
    Annals of Data Science, 2024, 11 (03) : 803 - 817
  • [45] Zero-Inflated Poisson Regression Models with Right Censored Count Data
    Saffari, Seyed Ehsan
    Adnan, Robiah
    MATEMATIKA, 2011, 27 (01) : 21 - 29
  • [46] Marginalized Zero-Inflated Bell Regression Models for Overdispersed Count Data
    Amani, Kouakou Mathias
    Kouakou, Konan Jean Geoffroy
    Hili, Ouagnina
    JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2025, 19 (02)
  • [47] IRT-ZIP Modeling for Multivariate Zero-Inflated Count Data
    Wang, Lijuan
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2010, 35 (06) : 671 - 692
  • [48] Zero-inflated count distributions for capture-mark-reencounter data
    Riecke, Thomas V.
    Gibson, Daniel
    Sedinger, James S.
    Schaub, Michael
    ECOLOGY AND EVOLUTION, 2022, 12 (09):
  • [49] Detecting overdispersion in count data: A zero-inflated Poisson regression analysis
    Jamil, Siti Afiqah Muhamad
    Abdullah, M. Asrul Affendi
    Long, Kek Sie
    Nor, Maria Elena
    Mohamed, Maryati
    Ismail, Norradihah
    1ST INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS 2017 (ICOAIMS 2017), 2017, 890
  • [50] Bayesian semiparametric zero-inflated Poisson model for longitudinal count data
    Dagne, Getachew A.
    MATHEMATICAL BIOSCIENCES, 2010, 224 (02) : 126 - 130