Measuring the component overlapping in the Gaussian mixture model

被引:1
|
作者
Haojun Sun
Shengrui Wang
机构
[1] Shantou University,College of Engineering
[2] Université de Sherbrooke,Départment d’informatique
来源
Data Mining and Knowledge Discovery | 2011年 / 23卷
关键词
Mixture model; Ridge curve; Overlap rate; Cluster analysis;
D O I
暂无
中图分类号
学科分类号
摘要
The ability of a clustering algorithm to deal with overlapping clusters is a major indicator of its efficiency. However, the phenomenon of cluster overlapping is still not mathematically well characterized, especially in multivariate cases. In this paper, we are interested in the overlap phenomenon between Gaussian clusters, since the Gaussian mixture is a fundamental data distribution model suitable for many clustering algorithms. We introduce the novel concept of the ridge curve and establish a theory on the degree of overlap between two components. Based on this theory, we develop an algorithm for calculating the overlap rate. As an example, we use this algorithm to calculate the overlap rates between the classes in the IRIS data set and clear up some of the confusion as to the true number of classes in the data set. We investigate factors that affect the value of the overlap rate, and show how the theory can be used to generate “truthed data” as well as to measure the overlap rate between a given pair of clusters or components in a mixture. Finally, we show an example of application of the theory to evaluate the well known clustering algorithms.
引用
收藏
页码:479 / 502
页数:23
相关论文
共 50 条
  • [1] Measuring the component overlapping in the Gaussian mixture model
    Sun, Haojun
    Wang, Shengrui
    DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 23 (03) : 479 - 502
  • [2] A Gaussian mixture model for underdetermined independent component analysis
    Zhang, Yingyu
    Shi, Xizhi
    Chen, Chi Hau
    SIGNAL PROCESSING, 2006, 86 (07) : 1538 - 1549
  • [3] SHARP OPTIMAL RECOVERY IN THE TWO COMPONENT GAUSSIAN MIXTURE MODEL
    Ndaoud, Mohamed
    ANNALS OF STATISTICS, 2022, 50 (04): : 2096 - 2126
  • [4] Fall Detection Using Gaussian Mixture Model and Principle Component Analysis
    Poonsri, Arisa
    Chiracharit, Werapon
    2017 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2017,
  • [5] Component Reduction for Gaussian Mixture Models
    Maebashi, Kumiko
    Suematsu, Nobuo
    Hayashi, Akira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (12) : 2846 - 2853
  • [6] Multi-model switching based on Gaussian mixture model and principal component analysis
    Pang, Qiang
    Zou, Tao
    Cong, Qiumei
    Li, Yongmin
    Huagong Xuebao/CIESC Journal, 2013, 64 (08): : 2938 - 2946
  • [7] Diffusion model conditioning on Gaussian mixture model and negative Gaussian mixture gradient
    Lu, Weiguo
    Wu, Xuan
    Ding, Deng
    Duan, Jinqiao
    Zhuang, Jirong
    Yuan, Gangnan
    NEUROCOMPUTING, 2025, 614
  • [8] Damage Detection of Refractory Based on Principle Component Analysis and Gaussian Mixture Model
    Liu, Changming
    Zhou, Di
    Wang, Zhigang
    Yang, Dan
    Song, Gangbing
    COMPLEXITY, 2018,
  • [9] Mixture Gaussian process model with Gaussian mixture distribution for big data
    Guan, Yaonan
    He, Shaoying
    Ren, Shuangshuang
    Liu, Shuren
    Li, Dewei
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 253
  • [10] Iterative Mixture Component Pruning Algorithm for Gaussian Mixture PHD Filter
    Yan, Xiaoxi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014