Mitigating Bias in Generalized Linear Mixed Models: The Case for Bayesian Nonparametrics

被引:7
|
作者
Antonelli, Joseph [1 ]
Trippa, Lorenzo [2 ]
Haneuse, Sebastien [3 ]
机构
[1] Harvard Univ, Chan Sch Publ Hlth, Deparment Biostat, 655 Huntington Ave, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Ctr Life Sci, Dept Biostat, 3 Blackfan Circle, Boston, MA 02115 USA
[3] Harvard Univ, Chan Sch Publ Hlth, Dept Biostat, 655 Huntington Ave, Boston, MA 02115 USA
关键词
Dirichlet process prior; generalized linear mixed models; model misspecification; random effects; RANDOM-EFFECTS MISSPECIFICATION; MAXIMUM-LIKELIHOOD-ESTIMATION; II ERROR; MIXTURE; DISTRIBUTIONS; INFERENCE;
D O I
10.1214/15-STS533
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Generalized linear mixed models are a common statistical tool for the analysis of clustered or longitudinal data where correlation is accounted for through cluster-specific random effects. In practice, the distribution of the random effects is typically taken to be a Normal distribution, although if this does not hold then the model is misspecified and standard estimation/inference may be invalid. An alternative is to perform a so-called nonparametric Bayesian analyses in which one assigns a Dirichlet process (DP) prior to the unknown distribution of the random effects. In this paper we examine operating characteristics for estimation of fixed effects and random effects based on such an analysis under a range of "true" random effects distributions. As part of this we investigate various approaches for selection of the precision parameter of the DP prior. In addition, we illustrate the use of the methods with an analysis of post-operative complications among n = 18,643 female Medicare beneficiaries who underwent a hysterectomy procedure at N = 503 hospitals in the US. Overall, we conclude that using the DP prior in modeling the random effect distribution results in large reductions of bias with little loss of efficiency. While no single choice for the precision parameter will be optimal in all settings, certain strategies such as importance sampling or empirical Bayes can be used to obtain reasonable results in a broad range of data scenarios.
引用
收藏
页码:80 / 95
页数:16
相关论文
共 50 条
  • [21] Bayesian Modeling of Random Effects Covariance Matrix for Generalized Linear Mixed Models
    Lee, Keunbaik
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2013, 20 (03) : 235 - 240
  • [22] Bayesian treed generalized linear models
    Chipman, HA
    George, EI
    McCulloch, RE
    BAYESIAN STATISTICS 7, 2003, : 85 - 103
  • [23] Bayesian Cholesky factor models in random effects covariance matrix for generalized linear mixed models
    Lee, Keunbaik
    Yoo, Jae Keun
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 80 : 111 - 116
  • [24] Bayesian Generalized Horseshoe Estimation of Generalized Linear Models
    Schmidt, Daniel F.
    Makalic, Enes
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 598 - 613
  • [25] Selection bias in linear mixed models
    Grilli, Leonardo
    Rampichini, Carla
    METRON-INTERNATIONAL JOURNAL OF STATISTICS, 2010, 68 (03): : 309 - 329
  • [26] Robust nonparametrics in mixed-MANOCOVA models
    Sen, PK
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1999, 75 (02) : 433 - 451
  • [27] A bias correction regression calibration approach in generalized linear mixed measurement error models
    Wang, N
    Lin, XH
    Guttierrez, RG
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1999, 28 (01) : 217 - 232
  • [28] Bayesian compositional generalized linear mixed models for disease prediction using microbiome data
    Zhang, Li
    Zhang, Xinyan
    Leach, Justin M.
    Rahman, A. K. M. F.
    Howell, Carrie R.
    Yi, Nengjun
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [29] Bayesian inference for generalized linear mixed models: A comparison of different statstical software procedures
    Yimer, Belay Birlie
    Shkedy, Ziv
    COGENT MATHEMATICS & STATISTICS, 2021, 8
  • [30] A Bayesian framework for generalized linear mixed models in genome-wide association studies
    Wang, X.
    Philip, V.
    Carter, G.
    HUMAN GENOMICS, 2016, 10