Estimation and computations for Gaussian mixtures with uniform noise under separation constraints

被引:2
|
作者
Coretto, Pietro [1 ]
机构
[1] Univ Salerno, Dept Econ & Stat, Via Giovanni Paolo II 132, I-84084 Fisciano, SA, Italy
来源
STATISTICAL METHODS AND APPLICATIONS | 2022年 / 31卷 / 02期
关键词
Mixture models; Noise component; Robustness; Model-based clustering; EM algorithm; Outlier identification; Density estimation; MAXIMUM-LIKELIHOOD; MODEL; CONSISTENCY; SELECTION;
D O I
10.1007/s10260-021-00578-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we study a finite Gaussian mixture model with an additional uniform component that has the role to catch points in the tails of the data distribution. An adaptive constraint enforces a certain level of separation between the Gaussian mixture components and the uniform component representing noise and outliers in the tail of the distribution. The latter makes the proposed tool particularly useful for robust estimation and outlier identification. A constrained ML estimator is introduced for which existence and consistency is shown. One of the attractive features of the methodology is that the noise level is estimated from data. We also develop an EM-type algorithm with proven convergence. Based on numerical evidence we show how the methods developed in this paper are useful for several fundamental data analysis tasks: outlier identification, robust location-scale estimation, clustering, and density estimation.
引用
收藏
页码:427 / 458
页数:32
相关论文
共 50 条