A boosted clustering algorithm for distributed homogeneous data mining

被引:0
|
作者
Li, Chengan [1 ]
Wu, Tiejun [1 ]
机构
[1] Zhejiang Univ, Inst Intelligent Syst & Decis Making, Hangzhou 310027, Zhejiang, Peoples R China
来源
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS | 2006年
关键词
cluster ensembles; distributed clustering; unsupervised learning; boosting strategy; partition schemes;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new distributed clustering algorithm based on boosting techniques is present to efficiently integrate multiple partitions constructed over very large and distributed homogeneous databases that cannot be merged at a single location. In the proposed method, the individual clustering solutions are first produced from disjoint datasets at each boosting round and then the cluster prototypes rather than matrices of partitions are transferred to a site to generate a global cluster prototype which is broadcasted to all distributed sites and used to partition data in each site. Finally, all the individual solutions are combined into a weighted voting ensemble on each disjoint data set. Experimental results demonstrate that the proposed distributed clustering method can effectively achieve clustering accuracy comparable to or slightly better than the algorithms in which boosting techniques are applied to the centralized data. In addition, communication cost of the proposed algorithm is very small.
引用
收藏
页码:5952 / 5956
页数:5
相关论文
共 50 条
  • [21] Neural Network Data Mining Clustering Optimization Algorithm
    Jiao, Guie
    Li, Wang
    IETE JOURNAL OF RESEARCH, 2021,
  • [22] The Application of Data Mining Clustering Algorithm in Fuzzy Control
    Li Guodong
    Xia Kewen
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 105 - 113
  • [23] An ant-based clustering algorithm in data mining
    Tang, Y
    Ma, YK
    SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1101 - 1105
  • [24] A Fuzzy Clustering Algorithm of Data Mining Based on IWO
    Zhao Xiao-qiang
    Zhou Jin-Hu
    Yang Jia-Min
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7988 - 7993
  • [25] A clustering algorithm for data mining based on swarm intelligence
    Jin, Peng
    Zhu, Vun-Long
    Hu, Kun-Yuan
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 803 - 807
  • [26] Incremental Clustering Algorithm for Earth Science Data Mining
    Vatsavi, Ranga Raju
    COMPUTATIONAL SCIENCE - ICCS 2009, 2009, 5545 : 375 - 384
  • [27] Analysis and Application of Data Mining Based on Clustering Algorithm
    Lai Honghui
    Lai Xiao Tao
    PROCEEDINGS OF THE 2015 INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE, 2015, 7 : 129 - 133
  • [28] An algorithm for time series data mining based on clustering
    Wu, Shaozhi
    Wu, Yue
    Wang, Ying
    Ye, Yalan
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 2155 - +
  • [29] Mining of Data Stream Using "DDenStream" Clustering Algorithm
    Kumar, Manoj
    Sharma, Ashish
    PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE IN MOOC, INNOVATION AND TECHNOLOGY IN EDUCATION (MITE), 2013, : 315 - 320
  • [30] Parallel and distributed clustering framework for big spatial data mining
    Bendechache, Malika
    Tari, A-Kamel
    Kechadi, M-Tahar
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (06) : 671 - 689