A novel member enhancement-based clustering ensemble algorithm

被引:0
|
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Clustering Ensemble Based on Fuzzy Matrix Self-Enhancement
    Ji, Xia
    Sun, Jiawei
    Peng, Jianhua
    Pang, Yue
    Zhou, Peng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (01) : 148 - 161
  • [22] A Novel Spectral Ensemble Clustering Algorithm Based on Social Group Migratory Behavior and Emotional Preference
    Dai, Mingzhi
    Feng, Xiang
    Yu, Huiqun
    Guo, Weibin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 316 - 328
  • [23] A novel kernel clustering algorithm based selective neural network ensemble model for economic forecasting
    Lin, Jian
    Zhu, Bangzhu
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2007, 4683 : 310 - +
  • [24] Incremental clustering algorithm of mixed numerical and categorical data based on clustering ensemble
    Li, Tao-Ying
    Chen, Yan
    Zhang, Jin-Song
    Qin, Sheng-Jun
    Kongzhi yu Juece/Control and Decision, 2012, 27 (04): : 603 - 608
  • [25] A Novel Virtual Capacitance Enhancement-Based Active EMI Filter for CM Noise Attenuation
    Zhou, Yongxing
    Meng, Xin
    Wang, Yun
    Yu, Zheyuan
    Dang, Haowei
    Su, Ding
    Sun, Shihang
    Chen, Wenjie
    Yang, Xu
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (12) : 16870 - 16874
  • [26] A novel clustering ensemble model based on granular computing
    Li Xu
    Shifei Ding
    Applied Intelligence, 2021, 51 : 5474 - 5488
  • [27] A novel clustering ensemble model based on granular computing
    Xu, Li
    Ding, Shifei
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5474 - 5488
  • [28] GA-Based Membrane Evolutionary Algorithm for Ensemble Clustering
    Wang, Yanhua
    Liu, Xiyu
    Xiang, Laisheng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [29] Clustering Ensemble Algorithm with Cluster Connection Based on Wisdom of Crowds
    Zhang H.
    Gao Y.
    Chen Y.
    Wang Z.
    Gao, Yukun (821566504@qq.com), 2018, Science Press (55): : 2611 - 2619
  • [30] A Genetic Algorithm Based Ensemble Approach for Categorical Data Clustering
    Goswami, Jyoti Prokash
    Mahanta, Anjana Kakoti
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,