Affinity propagation clustering algorithm based on large-scale data-set

被引:38
|
作者
Wang L. [1 ]
Zheng K. [1 ]
Tao X. [2 ]
Han X. [3 ]
机构
[1] School of Management Science and Information Engineering, Jilin University Finance and Economics, Changchun
[2] School of Library Information and Archives Management Engineering, Jilin University, Changchun
[3] School of Computer Science and Engineering, Changchun University of Technology, Jiiln
基金
中国国家自然科学基金;
关键词
affinity propagation algorithm; density peak algorithm; Large-scale data-sets; structural similarity;
D O I
10.1080/1206212X.2018.1425184
中图分类号
学科分类号
摘要
Affinity Propagation (AP) algorithm is not effective in processing large-scale data-sets, so the paper purposed an affinity propagation clustering algorithm based on large scale data-set, called LD-AP. First, we use the idea of grid clustering to divide large data-sets into small datasets and running AP in them to ensure the center of clustering. Then, we introduced the structure similarity matrix to calculate the distance of the cluster center. At last, we used Density peak Clustering Algorithm (DP) algorithm to cluster the cluster again. The experimental results show that the improved algorithm is better than the original algorithm in the clustering effect and computation speed. © 2018, © 2018 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:1 / 6
页数:5
相关论文
共 50 条
  • [1] CLUSTERING LARGE-SCALE DATA BASED ON MODIFIED AFFINITY PROPAGATION ALGORITHM
    Serdah, Ahmed M.
    Ashour, Wesam M.
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (01) : 23 - 33
  • [2] An Improved Affinity Propagation Clustering Algorithm for Large-scale Data Sets
    Liu, Xiaonan
    Yin, Meijuan
    Luo, Junyong
    Chen, Wuping
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 894 - 899
  • [3] The Research on Large Scale Data Set Clustering Algorithm Based on Tag Set
    Chen, Qiang
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015), 2016, 575 : 365 - 372
  • [4] A stratified sampling based clustering algorithm for large-scale data
    Zhao, Xingwang
    Liang, Jiye
    Dang, Chuangyin
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 416 - 428
  • [5] Exploring gendered cycling behaviours within a large-scale behavioural data-set
    Beecham, Roger
    Wood, Jo
    TRANSPORTATION PLANNING AND TECHNOLOGY, 2014, 37 (01) : 83 - 97
  • [6] Fuzzy clustering algorithm based on multiple medoids for large-scale data
    Chen A.-G.
    Wang S.-T.
    Kongzhi yu Juece/Control and Decision, 2016, 31 (12): : 2122 - 2130
  • [8] Local and global approaches of affinity propagation clustering for large scale data
    Ding-yin Xia
    Fei Wu
    Xu-qing Zhang
    Yue-ting Zhuang
    Journal of Zhejiang University-SCIENCE A, 2008, 9 : 1373 - 1381
  • [9] Local and global approaches of affinity propagation clustering for large scale data
    Xia, Ding-yin
    Wu, Fei
    Zhang, Xu-qing
    Zhuang, Yue-ting
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2008, 9 (10): : 1373 - 1381
  • [10] A Local Approach of Adaptive Affinity Propagation Clustering for Large Scale Data
    Sun, Changyin
    Wang, Chenghong
    Song, Su
    Wang, Yifan
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 161 - +