A novel member enhancement-based clustering ensemble algorithm

被引:0
|
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A cluster-weighted clustering ensemble algorithm based on member selection
    Xu, Sen
    Gao, Ting
    Xu, Xiu-Fang
    Xu, He-Yang
    Guo, Nai-Xuan
    Bian, Xue-Sheng
    Hua, Xiaopeng
    Chen, Zhi-Yuan
    Kongzhi yu Juece/Control and Decision, 2024, 39 (12): : 4136 - 4140
  • [2] A Novel Cluster Ensemble based on a Single Clustering Algorithm
    Khan, Tahseen
    Tian, Wenhong
    Kadhim, Mustafa R.
    Buyya, Rajkumar
    PROCEEDINGS OF THE 2021 16TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2021, : 127 - 135
  • [3] Visual enhancement-based bridge detection algorithm and technique
    Zhu Y.
    Li J.
    Zhu L.
    Liu Y.
    He C.
    Liu T.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2024, 54 (04): : 902 - 910
  • [4] Diversity enhancement-based Differential Evolution with a novel perturbation strategy
    Song, Zhenghao
    Sun, Liangliang
    Matsveichuk, Natalja
    Sotskov, Yuri
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 92
  • [5] An ensemble agglomerative hierarchical clustering algorithm based on clusters clustering technique and the novel similarity measurement
    Li, Teng
    Rezaeipanah, Amin
    El Din, ElSayed M. Tag
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 3828 - 3842
  • [6] A Community Structure Enhancement-Based Community Detection Algorithm for Complex Networks
    Su, Yansen
    Liu, Chunlong
    Niu, Yunyun
    Cheng, Fan
    Zhang, Xingyi
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (05): : 2833 - 2846
  • [7] Low-Rank Enhancement-Based Compressed Image Sensing Reconstruction Algorithm
    Yang C.
    Tang R.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2018, 46 (10): : 72 - 80
  • [8] A network enhancement-based method for clustering of single cell RNA-seq data
    Zhu, Xiaoshu
    Guo, Lilu
    Li, Rongyuan
    Xu, Yunpei
    Wu, Fang-Xiang
    Peng, Xiaoqing
    Li, Hong-Dong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 24 (04) : 306 - 325
  • [9] DenMG: Density-Based Member Generation for Ensemble Clustering
    Du, Xueqin
    He, Yulin
    Fournier-Viger, Philippe
    Huang, Joshua Zhexue
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022, 2022,
  • [10] Clustering ensemble based on the fuzzy KNN algorithm
    Weng, Fangfei
    Jiang, Qingshan
    Chen, Lifei
    Hong, Zhiling
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 1001 - +