Choosing models in model-based clustering and discriminant analysis

被引:74
|
作者
Biernacki, C
Govaert, G
机构
[1] INRIA Rhone Alps, ZIRST, F-38330 St Martin, France
[2] Univ Technol Compiegne, CNRS, UMR 6599, F-60205 Compiegne, France
关键词
Gaussian mixture models; eigenvalue decomposition; cross-validation; information; Bayesian and classification criteria;
D O I
10.1080/00949659908811966
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Using an eigenvalue decomposition of variance matrices, Celeux and Govaert (1993) obtained numerous and powerful models for Gaussian model-based clustering and discriminant analysis. Through Monte Carlo simulations, we compare the performances of many classical criteria to select these models: information criteria as AIC, the Bayesian criterion BIG, classification criteria as NEC and cross-validation. In the clustering context, information criteria and BIC outperform the classification criteria. In the discriminant analysis context, cross-validation shows good performance but information criteria and BIC give satisfactory results as well with, by far, less time-computing.
引用
收藏
页码:49 / 71
页数:23
相关论文
共 50 条
  • [21] Model-Based Clustering
    Gormley, Isobel Claire
    Murphy, Thomas Brendan
    Raftery, Adrian E.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 573 - 595
  • [22] Model-Based Clustering
    McNicholas, Paul D.
    JOURNAL OF CLASSIFICATION, 2016, 33 (03) : 331 - 373
  • [23] Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributionsThe tEIGEN family
    Jeffrey L. Andrews
    Paul D. McNicholas
    Statistics and Computing, 2012, 22 : 1021 - 1029
  • [24] Mixture model-based functional discriminant analysis for curve classification
    Chamroukhi, Faicel
    Glotin, Herve
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [25] Parsimonious skew mixture models for model-based clustering and classification
    Vrbik, Irene
    McNicholas, Paul D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 196 - 210
  • [26] Mixtures of ARMA models for model-based time series clustering
    Xiong, YM
    Yeung, DY
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 717 - 720
  • [27] Model-based clustering and analysis of life history data
    Scott, Marc A.
    Mohan, Kaushik
    Gauthier, Jacques-Antoine
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2020, 183 (03) : 1231 - 1251
  • [28] Model-based video scene clustering with noise analysis
    Lu, H
    Li, ZY
    Tan, YP
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, 2004, : 105 - 108
  • [29] Model-based clustering with envelopes
    Wang, Wenjing
    Zhang, Xin
    Mai, Qing
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 82 - 109
  • [30] Challenges in model-based clustering
    Melnykov, Volodymyr
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (02): : 135 - 148