Modelling zero inflated and under-reported count data

被引:2
|
作者
Sengupta, Debjit [1 ,2 ]
Roy, Surupa [1 ]
机构
[1] St Xaviers Coll, Dept Stat, Kolkata, India
[2] St Xaviers Coll, Kolkata 380009, India
关键词
Excess zero; undercount; surrogate; likelihood; bootstrap method; DIAGNOSTIC MISCLASSIFICATION; BAYESIAN-APPROACH; POISSON RATE; REGRESSION;
D O I
10.1080/00949655.2023.2182883
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Poisson distribution is a classic choice for modelling unbounded count data. However, count data arising in various fields of scientific research often have excess zeros and are under-reported. In such situations, Poisson distribution gives a poor fit and Poisson model based inferences lead to biased estimators and inaccurate confidence intervals. In this paper we develop a flexible model which can accommodate excess zeros and undercount. Internal validation data has been used for making likelihood based inferences. The impact of ignoring undercount and excess zeros are studied through extensive simulations. The finite sample behaviour of the estimators are investigated through bootstrap methodology. Finally, a real life data which is supposedly under-reported and known to have excess zeros is analysed.
引用
收藏
页码:2390 / 2409
页数:20
相关论文
共 50 条
  • [21] A dynamic hurdle model for zero-inflated count data
    Baetschmann, Gregori
    Winkelmann, Rainer
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (14) : 7174 - 7187
  • [22] A New Zero-One-Inflated Poisson-Lindley Distribution for Modelling Overdispersed Count Data
    Tajuddin, Razik Ridzuan Mohd
    Ismail, Noriszura
    Ibrahim, Kamarulzaman
    Abu Bakar, Shaiful Anuar
    BULLETIN OF THE MALAYSIAN MATHEMATICAL SCIENCES SOCIETY, 2022, 45 (SUPPL 1) : 21 - 35
  • [23] Hierarchical Bayesian analysis of correlated zero-inflated count data
    Dagne, GA
    BIOMETRICAL JOURNAL, 2004, 46 (06) : 653 - 663
  • [24] Grouped zero-inflated count data models of coital frequency
    Moffatt, PG
    Peters, SA
    JOURNAL OF POPULATION ECONOMICS, 2000, 13 (02) : 205 - 220
  • [25] Multi-level zero-inflated Poisson regression modelling of correlated count data with excess zeros
    Lee, AH
    Wang, K
    Scott, JA
    Yau, KKW
    McLachlan, GJ
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2006, 15 (01) : 47 - 61
  • [26] Grouped zero-inflated count data models of coital frequency
    Peter G. Moffatt
    Simon A. Peters
    Journal of Population Economics, 2000, 13 : 205 - 220
  • [27] Hierarchical Mixture Models for Zero-inflated Correlated Count Data
    Chen, Xue-dong
    Shi, Hong-xing
    Wang, Xue-ren
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2016, 32 (02): : 373 - 384
  • [28] A dynamic hurdle model for zero-inflated panel count data
    Belloc, Filippo
    Bernardi, Mauro
    Maruotti, Antonello
    Petrella, Lea
    APPLIED ECONOMICS LETTERS, 2013, 20 (09) : 837 - 841
  • [29] A marginalized model for zero-inflated, overdispersed and correlated count data
    Iddia, Samuel
    Molenberghs, Geert
    ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2013, 6 (02) : 149 - 165
  • [30] A robust score test of homogeneity for zero-inflated count data
    Hsu, Wei-Wen
    Todem, David
    Mawella, Nadeesha R.
    Kim, KyungMann
    Rosenkranz, Richard R.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (12) : 3653 - 3665