Fuzzy Divergence Weighted Ensemble Clustering With Spectral Learning Based on Random Projections for Big Data

被引:1
|
作者
Lahmar, Ines [1 ]
Zaier, Aida [2 ]
Yahia, Mohamed [3 ]
Ali, Tarig [4 ]
Boaullegue, Ridha [2 ]
机构
[1] Univ Gabes, MACS Lab, Gabes 6029, Tunisia
[2] Univ Carthage Tunis, InnovCom Lab, Tunis 1002, Tunisia
[3] Univ Tunis El Manar, ENIT, SYSCOM Lab, Tunis 1002, Tunisia
[4] Amer Univ Sharjah, GIS & Mapping Lab, Sharjah, U Arab Emirates
关键词
Matrix converters; Entropy; Clustering algorithms; Uncertainty; Reliability; Weight measurement; Sparse matrices; Ensemble learning; Fuzzy systems; Spectral analysis; Fuzzy ensemble clustering; high-dimensional data; random projection; Kullback-Leibler divergence entropy; spectral learning; FACE-RECOGNITION;
D O I
10.1109/ACCESS.2024.3359299
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many real-world applications, data are described by high-dimensional feature spaces, posing new challenges for current ensemble clustering methods. The goal is to combine sets of base clusters to enhance clustering accuracy, but this makes them susceptible to low quality. However, the reliability of present ensemble clustering in high-dimensional data still needs improvement. In this context, we propose a new fuzzy divergence-weighted ensemble clustering based on random projection and spectral learning. Firstly, random projection (RP) is used to create various dimensional data and find membership matrices via fuzzy c-means (FCM). Secondly, fuzzy partitions of random projections are ranked using entropy-based local weighting along with Kullback-Leibler (KL) divergence to detect any uncertainty. Then it used to evaluate the weight of each cluster. Finally, we create regularized graphs from these membership matrices and use spectral matrices to estimate the affinity matrices of these graphs using fuzzy KL divergence anchor graphs. Subsequently, obtaining the final clustering results is considered as an optimization problem, and the ensemble clustering results are obtained. The experimental results on high-dimensional data demonstrate the efficiency of our method compared to state-of-the-art methods.
引用
收藏
页码:20197 / 20208
页数:12
相关论文
共 50 条
  • [1] Fuzzy ensemble clustering based on random projections for DNA microarray data analysis
    Avogadri, Roberto
    Valentini, Giorgio
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2009, 45 (2-3) : 173 - 183
  • [2] Random projections fuzzy c-means (RPFCM) for big data clustering
    Popescu, Mihail
    Keller, James
    Bezdek, James
    Zare, Alina
    2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [3] Ensemble Fuzzy Clustering Using Cumulative Aggregation on Random Projections
    Rathore, Punit
    Bezdek, James C.
    Erfani, Sarah M.
    Rajasegarar, Sutharshan
    Palaniswami, Marimuthu
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (03) : 1510 - 1524
  • [4] RSPCA: Random Sample Partition and Clustering Approximation for ensemble learning of big data
    Mahmud, Mohammad Sultan
    Zheng, Hua
    Garcia-Gil, Diego
    Garcia, Salvador
    Huang, Joshua Zhexue
    PATTERN RECOGNITION, 2025, 161
  • [5] Fuzzy c-Means and Cluster Ensemble with Random Projection for Big Data Clustering
    Ye, Mao
    Liu, Wenfen
    Wei, Jianghong
    Hu, Xuexian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [6] Adaptive weighted fuzzy clustering based on intra-cluster data divergence
    Wu, Ziheng
    Zhao, Yuan
    Wang, Wenyan
    Li, Cong
    NEUROCOMPUTING, 2023, 552
  • [7] Robust and fuzzy ensemble framework via spectral learning for random projection-based fuzzy-c-means clustering
    Shi, Zhaoyin
    Chen, Long
    Duan, Junwei
    Chen, Guangyong
    Zhao, Kai
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [8] Random Sample Partition-Based Clustering Ensemble Algorithm for Big Data
    Du, Xueqin
    He, Yulin
    Huang, Joshua Zhexue
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5885 - 5887
  • [9] Multi-view Iterative Random Projections on Big Data Clustering
    Bettoumi, Safa
    Jlassi, Chiraz
    Arous, Najet
    IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 215 - 224
  • [10] Ensemble Learning for Spectral Clustering
    Li, Hongmin
    Ye, Xiucai
    Imakura, Akira
    Sakurai, Tetsuya
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1094 - 1099