A novel member enhancement-based clustering ensemble algorithm

被引:0
|
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A decentralized algorithm for distributed ensemble clustering
    Rosato, Antonello
    Altilio, Rosa
    Panella, Massimo
    INFORMATION SCIENCES, 2021, 578 : 417 - 434
  • [42] A New Selective Clustering Ensemble Algorithm
    Liu Limin
    Fan Xiaoping
    2012 NINTH IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2012, : 45 - 49
  • [43] RESEARCH ADVANCE OF CLUSTERING ENSEMBLE ALGORITHM
    Zhan, Jin-Mei
    Chen, Jun-Tao
    Xing, Jie-Qing
    2017 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2017, : 109 - 114
  • [44] GrpClassifierEC: a novel classification approach based on the ensemble clustering space
    Loai Abdallah
    Malik Yousef
    Algorithms for Molecular Biology, 15
  • [45] GrpClassifierEC: a novel classification approach based on the ensemble clustering space
    Abdallah, Loai
    Yousef, Malik
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2020, 15 (01)
  • [46] Textual Enhancement-Based Grammar Instructional Design for English Students
    Roza, Veni
    Radjab, Desmawati
    Syarif, Hermawati
    Zaim, M.
    PROCEEDINGS OF THE FIFTH INTERNATIONAL SEMINAR ON ENGLISH LANGUAGE AND TEACHING (ISELT 2017), 2017, 110 : 1 - 7
  • [47] An Ensemble Clustering Framework Based on Hierarchical Clustering Ensemble Selection and Clusters Clustering
    Li, Wenjun
    Wang, Zikang
    Sun, Wei
    Bahrami, Sara
    CYBERNETICS AND SYSTEMS, 2023, 54 (05) : 741 - 766
  • [48] Semi-supervised clustering ensemble based on genetic algorithm model
    Bi, Sheng
    Li, Xiangli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55851 - 55865
  • [49] Spectral Clustering Algorithm Based on Weighted Ensemble Nyström Sampling
    Qiu Y.
    Liu C.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (05): : 420 - 428
  • [50] An ensemble image quality assessment algorithm based on deep feature clustering
    Bian, Tianliang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 81