Mitigating Bias in Generalized Linear Mixed Models: The Case for Bayesian Nonparametrics

被引:7
|
作者
Antonelli, Joseph [1 ]
Trippa, Lorenzo [2 ]
Haneuse, Sebastien [3 ]
机构
[1] Harvard Univ, Chan Sch Publ Hlth, Deparment Biostat, 655 Huntington Ave, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Ctr Life Sci, Dept Biostat, 3 Blackfan Circle, Boston, MA 02115 USA
[3] Harvard Univ, Chan Sch Publ Hlth, Dept Biostat, 655 Huntington Ave, Boston, MA 02115 USA
关键词
Dirichlet process prior; generalized linear mixed models; model misspecification; random effects; RANDOM-EFFECTS MISSPECIFICATION; MAXIMUM-LIKELIHOOD-ESTIMATION; II ERROR; MIXTURE; DISTRIBUTIONS; INFERENCE;
D O I
10.1214/15-STS533
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Generalized linear mixed models are a common statistical tool for the analysis of clustered or longitudinal data where correlation is accounted for through cluster-specific random effects. In practice, the distribution of the random effects is typically taken to be a Normal distribution, although if this does not hold then the model is misspecified and standard estimation/inference may be invalid. An alternative is to perform a so-called nonparametric Bayesian analyses in which one assigns a Dirichlet process (DP) prior to the unknown distribution of the random effects. In this paper we examine operating characteristics for estimation of fixed effects and random effects based on such an analysis under a range of "true" random effects distributions. As part of this we investigate various approaches for selection of the precision parameter of the DP prior. In addition, we illustrate the use of the methods with an analysis of post-operative complications among n = 18,643 female Medicare beneficiaries who underwent a hysterectomy procedure at N = 503 hospitals in the US. Overall, we conclude that using the DP prior in modeling the random effect distribution results in large reductions of bias with little loss of efficiency. While no single choice for the precision parameter will be optimal in all settings, certain strategies such as importance sampling or empirical Bayes can be used to obtain reasonable results in a broad range of data scenarios.
引用
收藏
页码:80 / 95
页数:16
相关论文
共 50 条
  • [1] Bayesian inference for generalized linear mixed models
    Fong, Youyi
    Rue, Havard
    Wakefield, Jon
    BIOSTATISTICS, 2010, 11 (03) : 397 - 412
  • [2] General design Bayesian generalized linear mixed models
    Zhao, Y.
    Staudenmayer, J.
    Coull, B. A.
    Wand, M. P.
    STATISTICAL SCIENCE, 2006, 21 (01) : 35 - 51
  • [3] Bayesian covariance selection in generalized linear mixed models
    Cai, Bo
    Dunson, David B.
    BIOMETRICS, 2006, 62 (02) : 446 - 457
  • [4] Bayesian model selection for generalized linear mixed models
    Xu, Shuangshuang
    Ferreira, Marco A. R.
    Porter, Erica M.
    Franck, Christopher T.
    BIOMETRICS, 2023, 79 (04) : 3266 - 3278
  • [5] Reference Bayesian methods for generalized linear mixed models
    Natarajan, R
    Kass, RE
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (449) : 227 - 237
  • [6] Approximate Bayesian Inference in Spatial Generalized Linear Mixed Models
    Eidsvik, Jo
    Martino, Sara
    Rue, Havard
    SCANDINAVIAN JOURNAL OF STATISTICS, 2009, 36 (01) : 1 - 22
  • [7] Bayesian Nonparametrics for Stochastic Epidemic Models
    Kypraios, Theodore
    O'Neill, Philip D.
    STATISTICAL SCIENCE, 2018, 33 (01) : 44 - 56
  • [8] Bias correction in generalized linear mixed models with multiple components of dispersion
    Lin, XH
    Breslow, NE
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (435) : 1007 - 1016
  • [9] Estimation of group means using Bayesian generalized linear mixed models
    LaLonde, Amy
    Qu, Yongming
    PHARMACEUTICAL STATISTICS, 2020, 19 (04) : 482 - 491
  • [10] A semi-parametric Bayesian approach to generalized linear mixed models
    Kleinman, KP
    Ibrahim, JG
    STATISTICS IN MEDICINE, 1998, 17 (22) : 2579 - 2596