A boosted clustering algorithm for distributed homogeneous data mining

被引:0
|
作者
Li, Chengan [1 ]
Wu, Tiejun [1 ]
机构
[1] Zhejiang Univ, Inst Intelligent Syst & Decis Making, Hangzhou 310027, Zhejiang, Peoples R China
关键词
cluster ensembles; distributed clustering; unsupervised learning; boosting strategy; partition schemes;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new distributed clustering algorithm based on boosting techniques is present to efficiently integrate multiple partitions constructed over very large and distributed homogeneous databases that cannot be merged at a single location. In the proposed method, the individual clustering solutions are first produced from disjoint datasets at each boosting round and then the cluster prototypes rather than matrices of partitions are transferred to a site to generate a global cluster prototype which is broadcasted to all distributed sites and used to partition data in each site. Finally, all the individual solutions are combined into a weighted voting ensemble on each disjoint data set. Experimental results demonstrate that the proposed distributed clustering method can effectively achieve clustering accuracy comparable to or slightly better than the algorithms in which boosting techniques are applied to the centralized data. In addition, communication cost of the proposed algorithm is very small.
引用
收藏
页码:5952 / 5956
页数:5
相关论文
共 50 条
  • [1] Distributed Clustering Algorithm for Spatial Data Mining
    Bendechache, Malika
    Kechadi, M-Tahar
    PROCEEDINGS 2015 SECOND IEEE INTERNATIONAL CONFERENCE ON SPATIAL DATA MINING AND GEOGRAPHICAL KNOWLEDGE SERVICES (ICSDM 2015), 2015, : 60 - 65
  • [2] Privacy Preserving Data Mining in Terms of DBSCAN Clustering Algorithm in Distributed Systems
    Anikin, Igor V.
    Gazimov, Rinat M.
    2018 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING, APPLICATIONS AND MANUFACTURING (ICIEAM), 2018,
  • [3] A modified clustering algorithm for data mining
    Xu, ZJ
    Wang, LS
    Luo, JC
    Zhang, JQ
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 741 - 744
  • [4] An Effective Clustering Algorithm for Data Mining
    Vijendra, Singh
    Ashwini, Kelkar
    Laxman, Sahoo
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA STORAGE AND DATA ENGINEERING (DSDE 2010), 2010, : 250 - 253
  • [5] The intrusion data mining method for distributed network based on fuzzy kernel clustering algorithm
    Li, Linlin
    INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2022, 15 (01) : 32 - 45
  • [6] FedCLUS: Federated Clustering from Distributed Homogeneous Data
    Gupta, Sargam
    Sharma, Shachi
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 29 - 41
  • [7] Hierarchical Sequence Clustering Algorithm for Data Mining
    Chezhian, V. Umadevi
    Subash, Thanappan
    Samy, M. Ragavan
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL III, 2011, : 1861 - 1864
  • [8] Dual clustering algorithm for spatial data mining
    Zhou, Jiaogen
    Guan, Jihong
    Bian, Fuling
    Journal of Computational Information Systems, 2006, 2 (04): : 1405 - 1410
  • [9] ITERATE: A conceptual clustering algorithm for data mining
    Biswas, Gautam
    Weinberg, Jerry B.
    Fisher, Douglas H.
    IEEE Transactions on Systems, Man & Cybernetics Part C: Applications and Reviews, 1998, 28 (02): : 219 - 230
  • [10] ITERATE: A conceptual clustering algorithm for data mining
    Biswas, G
    Weinberg, JB
    Fisher, DH
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 1998, 28 (02): : 219 - 230