A novel member enhancement-based clustering ensemble algorithm

被引:0
|
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Integrated neural network ensemble algorithm based on clustering technology
    Liu, Bingjie
    Hu, Changhua
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 718 - 726
  • [32] A Clustering Algorithm Based on an Ensemble of Dissimilarities: An Application in the Bioinformatics Domain
    Martin Merino, Manuel
    Lopez Rivero, Alfonso Jose
    Alons, Vidal
    Vallejo, Marcelo
    Ferreras, Antonio
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (06): : 6 - 13
  • [33] Study on Ensemble based Clustering Algorithm for Gene Expression Data
    Chu, Zhenfang
    Cao, Buyang
    Yu, Fang
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [34] Image Enhancement-Based Detection with Small Infrared Targets
    Liu, Shuai
    Chen, Pengfei
    Wozniak, Marcin
    REMOTE SENSING, 2022, 14 (13)
  • [35] Design of SSVEP Enhancement-Based Brain Computer Interface
    Lin, Bor-Shing
    Wang, Hsiao-An
    Huang, Yao-Kuang
    Wang, Yu-Lin
    Lin, Bor-Shyh
    IEEE SENSORS JOURNAL, 2021, 21 (13) : 14330 - 14338
  • [36] A fluorescence enhancement-based sensor for hydrogen sulfate ion
    Yang, Shih-Tse
    Liao, De-Jhong
    Chen, Shau-Jiun
    Hu, Ching-Han
    Wu, An-Tai
    ANALYST, 2012, 137 (07) : 1553 - 1555
  • [37] Adaptive Data Clustering Ensemble Algorithm Based on Stability Feature Selection and Spectral Clustering
    Li, Zuhong
    Ma, Zhixin
    Ma, Zhicheng
    Yang, Shibo
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 277 - 281
  • [38] Enhancement of Kernel Clustering Based on Pigeon Optimization Algorithm
    Thamer, Mathil K.
    Algamal, Zakariya Yahya
    Zine, Raoudha
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2023, 31 (SUPP01) : 121 - 133
  • [39] Selective Enhancement-Based Shade Segmentation of Photovoltaic Array
    Li Zheyu
    Ding Kun
    Zhang Jingwei
    Li Chenyang
    Li Zhang
    Liu Yongjie
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (24)
  • [40] Survival classification of Gliomas through a novel enhancement-based strategy for class overlap of radiomics features
    Malhotra, Radhika
    Saini, Barjinder Singh
    Gupta, Savita
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240