Using observation-level random effects to model overdispersion in count data in ecology and evolution

被引：831

作者：

Harrison, Xavier A. ^{[1
]}

机构：

[1] Zool Soc London, Inst Zool, London NW1 4RY, England

来源：

PEERJ | 2014年 / 2卷

关键词：

Observation-level random effect; Explained variance; r-squared; Poisson-lognormal models; Quasi-Poisson; Generalized linear mixed models; INFERENCE;

D O I：

10.7717/peerj.616

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Overdispersion is common in models of count data in ecology and evolutionary biology, and can occur due to missing covariates, non-independent (aggregated) data, or an excess frequency of zeroes (zero-inflation). Accounting for overdispersion in such models is vital, as failing to do so can lead to biased parameter estimates, and false conclusions regarding hypotheses of interest. Observation-level random effects (OLRE), where each data point receives a unique level of a random effect that models the extra-Poisson variation present in the data, are commonly employed to cope with overdispersion in count data. However studies investigating the efficacy of observation-level random effects as a means to deal with overdispersion are scarce. Here I use simulations to show that in cases where overdispersion is caused by random extra-Poisson noise, or aggregation in the count data, observation-level random effects yield more accurate parameter estimates compared to when overdispersion is simply ignored. Conversely, OLRE fail to reduce bias in zero-inflated data, and in some cases increase bias at high levels of overdispersion. There was a positive relationship between the magnitude of overdispersion and the degree of bias in parameter estimates. Critically, the simulations reveal that failing to account for overdispersion in mixed models can erroneously inflate measures of explained variance (r(2)), which may lead to researchers overestimating the predictive power of variables of interest. This work suggests use of observation-level random effects provides a simple and robust means to account for overdispersion in count data, but also that their ability to minimise bias is not uniform across all types of overdispersion and must be applied judiciously.

引用

页数：19

共 50 条

[21] Model-based biclustering for overdispersed count data with application in microbial ecology
Aubert, Julie
Schbath, Sophie
Robin, Stephane
METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (06): : 1050 - 1061
[22] Modeling zero-inflated count data using a covariate-dependent random effect model
Wong, Kin-Yau
Lam, K. F.
STATISTICS IN MEDICINE, 2013, 32 (08) : 1283 - 1293
[23] Monitoring attributed social networks based on count data and random effects
Mogouie, H.
Ardali, Gh A. Raissi
Amiri, A.
Samani, E. Bahrami
SCIENTIA IRANICA, 2022, 29 (03) : 1581 - 1591
[24] How many people do you know in prison?: Using overdispersion in count data to estimate social structure in networks
Zheng, Tian
Salganik, Matthew J.
Gelman, Andrew
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (474) : 409 - 423
[25] Monitoring attributed social networks based on count data and random effects
Mogouie, H.
Ardali, Gh.A. Raissi
Amiri, A.
Samani, E. Bahrami
Scientia Iranica, 2022, 29 (3E) : 1581 - 1591
[26] A RANDOM EFFECTS MODEL FOR BINARY DATA
CONAWAY, MR
BIOMETRICS, 1990, 46 (02) : 317 - 328
[27] Analyzing Overdispersed Antenatal Care Count Data in Bangladesh: Mixed Poisson Regression with Individual-Level Random Effects
Hossain, Zakir
Maria
AUSTRIAN JOURNAL OF STATISTICS, 2021, 50 (04) : 78 - 90
[28] Patent data analysis using functional count data model
Kim, Jong-Min
Kim, Nak-Kyeong
Jung, Yoonsung
Jun, Sunghae
SOFT COMPUTING, 2019, 23 (18) : 8815 - 8826
[29] Patent data analysis using functional count data model
Jong-Min Kim
Nak-Kyeong Kim
Yoonsung Jung
Sunghae Jun
Soft Computing, 2019, 23 : 8815 - 8826
[30] Model misspecification effects in clustered count data analysis
Jowaheer, V
STATISTICS & PROBABILITY LETTERS, 2006, 76 (05) : 470 - 478

← 1 2 3 4 5 →