An Improved Affinity Propagation Clustering Algorithm for Large-scale Data Sets

被引:0
|
作者
Liu, Xiaonan [1 ]
Yin, Meijuan [1 ]
Luo, Junyong [1 ]
Chen, Wuping [2 ]
机构
[1] State Key Lab Math Engn & Adv Comp, Zhengzhou, Peoples R China
[2] Sci & Technol Informat Assurance Lab, Beijing, Peoples R China
关键词
Data clustering; affinity propagation; hierarchical; selection; clustering center;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Affinity Propagation (AP) clustering does not need to set the number of clusters, and has advantages on efficiency and accuracy, but is not suitable for large-scale data clustering. To ensure both a low time complexity and a good accuracy for the clustering method of affinity propagation on large-scale data clustering, an improved AP clustering algorithm named hierarchical affinity propagation (HAP) is proposed, which clusters data points by using AP algorithm several times on different level data. The data set to be clustered is firstly divided into several subsets, each of which can be efficiently clustered by AP algorithm. Then, the AP algorithm is performed on each subset to respectively select cluster centers of each subset. Further, AP clustering was again implemented on all the local cluster centers to select well-suited global exemplars of whole data set. Finally, to efficiently and accurately cluster data points in a large-scale, all the data points are clustered by the similarities between each data point and the global exemplars. The experimental results on real and simulated data sets show that, compared with the traditional AP and adaptive AP algorithm, the HAP algorithm can greatly reduce the clustering time consumption with a relatively better clustering results.
引用
收藏
页码:894 / 899
页数:6
相关论文
共 50 条
  • [41] A distributed and incremental algorithm for large-scale graph clustering
    Inoubli, Wissem
    Aridhi, Sabeur
    Mezni, Haithem
    Maddouri, Mondher
    Nguifo, Engelbert Mephu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 134 : 334 - 347
  • [42] Fast algorithm for large-scale subspace clustering by LRR
    Xie, Deyan
    Nie, Feiping
    Gao, Quanxue
    Xiao, Song
    IET IMAGE PROCESSING, 2020, 14 (08) : 1475 - 1480
  • [43] A Novel Clustering Algorithm for Large-Scale Graph Processing
    Qu, Zhaoyang
    Ding, Wei
    Qu, Nan
    Yan, Jia
    Wang, Ling
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 349 - 358
  • [44] A distributed clustering algorithm for large-scale dynamic networks
    Bernard, Thibault
    Bui, Alain
    Pilard, Laurence
    Sohier, Devan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2012, 15 (04): : 335 - 350
  • [45] A distributed clustering algorithm for large-scale dynamic networks
    Thibault Bernard
    Alain Bui
    Laurence Pilard
    Devan Sohier
    Cluster Computing, 2012, 15 : 335 - 350
  • [46] A fast fuzzy clustering algorithm for large-scale datasets
    Shi, LK
    He, PL
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 203 - 208
  • [47] UNSUPERVISED CLASSIFICATION OF POLSAR DATA BASED ON THE IMPROVED AFFINITY PROPAGATION CLUSTERING
    Wang, Shuang
    Liu, Yachao
    Liu, Kun
    Hou, Xiaojin
    Hou, Biao
    2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 3207 - 3210
  • [48] A EM Probabilistic Clustering Algorithm for Large Scale Data Sets based on Partial Constraints Information
    Yan S.
    Shunlin S.
    Yuquan Z.
    Advances in Information Sciences and Service Sciences, 2011, 3 (10): : 20 - 29
  • [49] A Spark-based Artificial Bee Colony Algorithm for Large-scale Data Clustering
    Wang, Yanjie
    Qian, Quan
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1213 - 1218
  • [50] A K-partitioning algorithm for clustering large-scale spatio-textual data
    Choi, Dong-Wan
    Chung, Chin-Wan
    INFORMATION SYSTEMS, 2017, 64 : 1 - 11