Mitigating Bias in Generalized Linear Mixed Models: The Case for Bayesian Nonparametrics

被引:7
|
作者
Antonelli, Joseph [1 ]
Trippa, Lorenzo [2 ]
Haneuse, Sebastien [3 ]
机构
[1] Harvard Univ, Chan Sch Publ Hlth, Deparment Biostat, 655 Huntington Ave, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Ctr Life Sci, Dept Biostat, 3 Blackfan Circle, Boston, MA 02115 USA
[3] Harvard Univ, Chan Sch Publ Hlth, Dept Biostat, 655 Huntington Ave, Boston, MA 02115 USA
关键词
Dirichlet process prior; generalized linear mixed models; model misspecification; random effects; RANDOM-EFFECTS MISSPECIFICATION; MAXIMUM-LIKELIHOOD-ESTIMATION; II ERROR; MIXTURE; DISTRIBUTIONS; INFERENCE;
D O I
10.1214/15-STS533
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Generalized linear mixed models are a common statistical tool for the analysis of clustered or longitudinal data where correlation is accounted for through cluster-specific random effects. In practice, the distribution of the random effects is typically taken to be a Normal distribution, although if this does not hold then the model is misspecified and standard estimation/inference may be invalid. An alternative is to perform a so-called nonparametric Bayesian analyses in which one assigns a Dirichlet process (DP) prior to the unknown distribution of the random effects. In this paper we examine operating characteristics for estimation of fixed effects and random effects based on such an analysis under a range of "true" random effects distributions. As part of this we investigate various approaches for selection of the precision parameter of the DP prior. In addition, we illustrate the use of the methods with an analysis of post-operative complications among n = 18,643 female Medicare beneficiaries who underwent a hysterectomy procedure at N = 503 hospitals in the US. Overall, we conclude that using the DP prior in modeling the random effect distribution results in large reductions of bias with little loss of efficiency. While no single choice for the precision parameter will be optimal in all settings, certain strategies such as importance sampling or empirical Bayes can be used to obtain reasonable results in a broad range of data scenarios.
引用
收藏
页码:80 / 95
页数:16
相关论文
共 50 条
  • [31] Bayesian Nonparametrics and Biostatistics: The Case of PET Imaging
    Fall, Mame Diarra
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2019, 15 (02):
  • [32] BIAS CORRECTION IN GENERALIZED LINEAR-MODELS
    CORDEIRO, GM
    MCCULLAGH, P
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1991, 53 (03): : 629 - 643
  • [33] Bayesian generalized linear models in a terabyte world
    Zoeter, Onno
    PROCEEDINGS OF THE 5TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2007, : 435 - 440
  • [34] Generalized, linear and mixed models.
    Land, KC
    SOCIOLOGICAL METHODS & RESEARCH, 2002, 30 (04) : 583 - 584
  • [35] GENERALIZED LINEAR MIXED MODELS IN BIOSTATISTICS
    CLAYTON, D
    STATISTICIAN, 1992, 41 (03): : 327 - 328
  • [36] Bayesian crossover designs for generalized linear models
    Singh, Satya Prakash
    Mukhopadhyay, Siuli
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 104 : 35 - 50
  • [37] Bayesian inference for sparse generalized linear models
    Seeger, Matthias
    Gerwinn, Sebastian
    Bethge, Matthias
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 298 - +
  • [38] Classification of territory risk by generalized linear and generalized linear mixed models
    Xie, Shengkun
    Gan, Chong
    JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (02) : 223 - 246
  • [39] Generalized linear mixed model with bayesian rank likelihood
    Doroshenko, Lyubov
    Liseo, Brunero
    STATISTICAL METHODS AND APPLICATIONS, 2023, 32 (02): : 425 - 446
  • [40] Generalized linear mixed model with bayesian rank likelihood
    Lyubov Doroshenko
    Brunero Liseo
    Statistical Methods & Applications, 2023, 32 : 425 - 446