Bayesian mixture models (in)consistency for the number of clusters

被引:1
|
作者
Alamichel, Louise [1 ]
Bystrova, Daria [1 ,2 ]
Arbel, Julyan [1 ]
King, Guillaume Kon Kam [3 ]
机构
[1] Univ Grenoble Alpes, Inria, Grenoble INP, LJK,CNRS, Grenoble, France
[2] Univ Savoie Mont Blanc, CNRS, Lab Ecol Alpine, Univ Grenoble Alpes, Grenoble, France
[3] Univ Paris Saclay, INRAE, MaIAGE, Jouy En Josas, France
关键词
clustering; finite mixtures; finite-dimensional BNP representations; Gibbs-type process; GIBBS-TYPE PRIORS; PITMAN-YOR; NONPARAMETRIC-INFERENCE; DIRICHLET MIXTURES; DENSITY-ESTIMATION; CONVERGENCE-RATES; FINITE; CONSISTENCY;
D O I
10.1111/sjos.12739
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Bayesian nonparametric mixture models are common for modeling complex data. While these models are well-suited for density estimation, recent results proved posterior inconsistency of the number of clusters when the true number of components is finite, for the Dirichlet process and Pitman-Yor process mixture models. We extend these results to additional Bayesian nonparametric priors such as Gibbs-type processes and finite-dimensional representations thereof. The latter include the Dirichlet multinomial process, the recently proposed Pitman-Yor, and normalized generalized gamma multinomial processes. We show that mixture models based on these processes are also inconsistent in the number of clusters and discuss possible solutions. Notably, we show that a postprocessing algorithm introduced for the Dirichlet process can be extended to more general models and provides a consistent method to estimate the number of components.
引用
收藏
页码:1619 / 1660
页数:42
相关论文
共 50 条
  • [1] Consistency of mixture models with a prior on the number of components
    Miller, Jeffrey W.
    DEPENDENCE MODELING, 2023, 11 (01):
  • [2] On posterior consistency of tail index for Bayesian kernel mixture models
    Li, Cheng
    Lin, Lizhen
    Dunson, David B.
    BERNOULLI, 2019, 25 (03) : 1999 - 2028
  • [3] Overfitting Bayesian Mixture Models with an Unknown Number of Components
    van Havre, Zoe
    White, Nicole
    Rousseau, Judith
    Mengersen, Kerrie
    PLOS ONE, 2015, 10 (07):
  • [4] Comparison of Criteria for Choosing the Number of Classes in Bayesian Finite Mixture Models
    Nasserinejad, Kazem
    van Rosmalen, Joost
    de Kort, Wim
    Lesaffre, Emmanuel
    PLOS ONE, 2017, 12 (01):
  • [5] Bayesian Analysis of Mixture Structural Equation Models With an Unknown Number of Components
    Liu, Hefei
    Song, Xin Yuan
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2018, 25 (01) : 41 - 55
  • [6] Bayesian consistency for stationary models
    Lijoi, Antonio
    Prunster, Igor
    Walker, Stephen G.
    ECONOMETRIC THEORY, 2007, 23 (04) : 749 - 759
  • [7] Bayesian Consistency for Markov Models
    Antoniano-Villalobos I.
    Walker S.G.
    Sankhya A, 2015, 77 (1): : 106 - 125
  • [8] Evaluate the number of clusters in finite mixture models with the penalized histogram difference criterion
    Lin, Weilu
    Wang, Yonghong
    Zhuang, Yingping
    Zhang, Siliang
    JOURNAL OF PROCESS CONTROL, 2013, 23 (08) : 1052 - 1062
  • [9] Bayesian mixture of autoregressive models
    Lau, John W.
    So, Mike K. P.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 53 (01) : 38 - 60
  • [10] Consistency of the MLE under Mixture Models
    Chen, Jiahua
    STATISTICAL SCIENCE, 2017, 32 (01) : 47 - 63