On convergence and parameter selection of the EM and DA-EM algorithms for Gaussian mixtures

被引:29
|
作者
Yu, Jian [1 ]
Chaomurilige, Chaomu [1 ]
Yang, Miin-Shen [2 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing, Peoples R China
[2] Chung Yuan Christian Univ, Dept Appl Math, Chungli 32023, Taiwan
关键词
Expectation & maximization (EM) algorithm; Deterministic annealing EM (DA-EM); GAUSSIAN mixtures; Self-annealing; Convergence; Parameter selection; MAXIMUM-LIKELIHOOD; MODELS;
D O I
10.1016/j.patcog.2017.12.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The expectation & maximization (EM) for Gaussian mixtures is popular as a clustering algorithm. However, the EM algorithm is sensitive to initial values, and so Ueda and Nakano [4] proposed the deterministic annealing EM (DA-EM) algorithm to improve it. In this paper, we investigate theoretical behaviors of the EM and DA-EM algorithms. We first derive a general Jacobian matrix of the DA-EM algorithm with respect to posterior probabilities. We then propose a theoretical lower bound for initialization of the annealing parameter in the DA-EM algorithm. On the other hand, some researches mentioned that the EM algorithm exhibits a self-annealing behavior, that is, the equal posterior probability with small random perturbations can avoid the EM algorithm to output the mass center for Gaussian mixtures. However, there is no theoretical analysis on this self-annealing property. Since the DA-EM will become the EM when the annealing parameter is 1, according to the Jacobian matrix of the DA-EM, we can prove the self-annealing property of the EM algorithm for Gaussian mixtures. Based on these results, we give not only convergence behaviors of the equal posterior probabilities and initialization lower bound of the temperature parameter of the DA-EM, but also a theoretical explanation why the EM algorithm for Gaussian mixtures exhibits a self-annealing behavior. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:188 / 203
页数:16
相关论文
共 50 条
  • [31] EM algorithms of Gaussian Mixture Model and Hidden Markov Model
    Xuan, GR
    Zhang, W
    Chai, PQ
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 145 - 148
  • [32] Comparative convergence analysis of EM and SAGE algorithms in DOA estimation
    Chung, PJ
    Böhme, JF
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (12) : 2940 - 2949
  • [33] Sufficient conditions for ergodicity and convergence of MH, SA, and EM algorithms
    Dorea, CCY
    Neto, DSBM
    Pereira, AGC
    STOCHASTIC MODELS, 2004, 20 (02) : 193 - 204
  • [34] CONVERGENCE OF EM IMAGE-RECONSTRUCTION ALGORITHMS WITH GIBBS SMOOTHING
    LANGE, K
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 1990, 9 (04) : 439 - 446
  • [35] Comparative convergence analysis of EM and SAGE algorithms in DOA estimation
    Chung, PJ
    Böhme, JF
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 2993 - 2996
  • [36] Constrained monotone EM algorithms for mixtures of multivariate t distributions
    Greselin, F.
    Ingrassia, S.
    STATISTICS AND COMPUTING, 2010, 20 (01) : 9 - 22
  • [37] Constrained monotone EM algorithms for mixtures of multivariate t distributions
    F. Greselin
    S. Ingrassia
    Statistics and Computing, 2010, 20 : 9 - 22
  • [38] Parameter estimation for grouped data using EM and MCEM algorithms
    AghahosseinaliShirazi, Zahra
    da Silva, Joao Pedro A. R.
    de Souza, Camila P. E.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (08) : 3616 - 3637
  • [39] The EM Algorithm is Adaptively-Optimal for Unbalanced Symmetric Gaussian Mixtures
    Weinberger, Nir
    Bresler, Guy
    Journal of Machine Learning Research, 2022, 23
  • [40] Degeneracy of the EM algorithm for the MLE of multivariate Gaussian mixtures and dynamic constraints
    Ingrassia, Salvatore
    Rocci, Roberto
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (04) : 1715 - 1725