Modelling zero inflated and under-reported count data

被引:2
|
作者
Sengupta, Debjit [1 ,2 ]
Roy, Surupa [1 ]
机构
[1] St Xaviers Coll, Dept Stat, Kolkata, India
[2] St Xaviers Coll, Kolkata 380009, India
关键词
Excess zero; undercount; surrogate; likelihood; bootstrap method; DIAGNOSTIC MISCLASSIFICATION; BAYESIAN-APPROACH; POISSON RATE; REGRESSION;
D O I
10.1080/00949655.2023.2182883
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Poisson distribution is a classic choice for modelling unbounded count data. However, count data arising in various fields of scientific research often have excess zeros and are under-reported. In such situations, Poisson distribution gives a poor fit and Poisson model based inferences lead to biased estimators and inaccurate confidence intervals. In this paper we develop a flexible model which can accommodate excess zeros and undercount. Internal validation data has been used for making likelihood based inferences. The impact of ignoring undercount and excess zeros are studied through extensive simulations. The finite sample behaviour of the estimators are investigated through bootstrap methodology. Finally, a real life data which is supposedly under-reported and known to have excess zeros is analysed.
引用
收藏
页码:2390 / 2409
页数:20
相关论文
共 50 条
  • [1] Modelling correlated zero-inflated count data
    Dobbie, MJ
    Welsh, AH
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2001, 43 (04) : 431 - 444
  • [2] Generalized additive modelling and zero inflated count data
    Barry, SC
    Welsh, AH
    ECOLOGICAL MODELLING, 2002, 157 (2-3) : 179 - 188
  • [3] Estimation of mean using under-reported and overdispersed count data
    Sengupta, Debjit
    Roy, Surupa
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [4] Mediation analysis for count and zero-inflated count data
    Cheng, Jing
    Cheng, Nancy F.
    Guo, Zijian
    Gregorich, Steven
    Ismail, Amid I.
    Gansky, Stuart A.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (09) : 2756 - 2774
  • [5] MODELLING ZERO-INFLATED COUNT DATA WITH A SPECIAL CASE OF THE GENERALISED POISSON DISTRIBUTION
    Calderin-Ojeda, Enrique
    Gomez-Deniz, Emilio
    Barranco-Chamorro, Inmaculada
    ASTIN BULLETIN, 2019, 49 (03): : 689 - 707
  • [6] Semiparametric analysis of zero-inflated count data
    Lam, K. F.
    Xue, Hongqi
    Cheung, Yin Bun
    BIOMETRICS, 2006, 62 (04) : 996 - 1003
  • [7] A New Zero–One-Inflated Poisson–Lindley Distribution for Modelling Overdispersed Count Data
    Razik Ridzuan Mohd Tajuddin
    Noriszura Ismail
    Kamarulzaman Ibrahim
    Shaiful Anuar Abu Bakar
    Bulletin of the Malaysian Mathematical Sciences Society, 2022, 45 : 21 - 35
  • [8] The analysis of zero-inflated count data: Beyond zero-inflated Poisson regression.
    Loeys, Tom
    Moerkerke, Beatrijs
    De Smet, Olivia
    Buysse, Ann
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2012, 65 (01): : 163 - 180
  • [9] Modelling and coherent forecasting of zero-inflated count time series
    Maiti, Raju
    Biswas, Atanu
    Guha, Apratim
    Ong, Seng Huat
    STATISTICAL MODELLING, 2014, 14 (05) : 375 - 398
  • [10] Decision tree approaches for zero-inflated count data
    Lee, Seong-Keon
    Jin, Seohoon
    JOURNAL OF APPLIED STATISTICS, 2006, 33 (08) : 853 - 865