Affinity propagation clustering algorithm based on large-scale data-set

被引:38
|
作者
Wang L. [1 ]
Zheng K. [1 ]
Tao X. [2 ]
Han X. [3 ]
机构
[1] School of Management Science and Information Engineering, Jilin University Finance and Economics, Changchun
[2] School of Library Information and Archives Management Engineering, Jilin University, Changchun
[3] School of Computer Science and Engineering, Changchun University of Technology, Jiiln
基金
中国国家自然科学基金;
关键词
affinity propagation algorithm; density peak algorithm; Large-scale data-sets; structural similarity;
D O I
10.1080/1206212X.2018.1425184
中图分类号
学科分类号
摘要
Affinity Propagation (AP) algorithm is not effective in processing large-scale data-sets, so the paper purposed an affinity propagation clustering algorithm based on large scale data-set, called LD-AP. First, we use the idea of grid clustering to divide large data-sets into small datasets and running AP in them to ensure the center of clustering. Then, we introduced the structure similarity matrix to calculate the distance of the cluster center. At last, we used Density peak Clustering Algorithm (DP) algorithm to cluster the cluster again. The experimental results show that the improved algorithm is better than the original algorithm in the clustering effect and computation speed. © 2018, © 2018 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:1 / 6
页数:5
相关论文
共 50 条
  • [31] On the Clustering of Large-scale Data: A Matrix-based Approach
    Wang, Lijun
    Dong, Ming
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 139 - 144
  • [32] Affinity Propagation Clustering Algorithm based on Spark Platform
    Zhang, Lijia
    Cheng, Lianglun
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 532 - 535
  • [33] CLUSTERING STUDY BASED ON A LARGE DATA SET OF QUANTUM GENETIC SPECTRAL CLUSTERING ALGORITHM
    Jiang Yong
    Tan Huailiang
    Li Guangwen
    Zhou Hengwei
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 435 - 440
  • [34] A visual word clustering algorithm based on affinity propagation
    Zhao, Jian
    Sun, Cheng
    Ma, Miao
    Xie, Yu
    2012 7TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2012, : 14 - 17
  • [35] A Stable Clustering Algorithm based on Affinity Propagation for VANETs
    Shahwani, Hamayoun
    Toan Duc Bui
    Jeong, Jaehoon
    Shin, Jitae
    2017 19TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - OPENING NEW ERA OF SMART SOCIETY, 2017, : 501 - 504
  • [36] An improved PSO clustering algorithm based on affinity propagation
    Zheng, Yuyan
    Qu, Jianhua
    Zhou, Yang
    1600, World Scientific and Engineering Academy and Society, Ag. Ioannou Theologou 17-23, Zographou, Athens, 15773, Greece (12): : 447 - 456
  • [37] Parallel SVM for large data-set mining
    Qian, L
    Hung, T
    DATA MINING IV, 2004, 7 : 661 - 670
  • [38] A Sampling-Based Graph Clustering Algorithm for Large-Scale Networks
    Zhang J.-P.
    Chen H.-C.
    Wang K.
    Zhu K.-J.
    Wang Y.-W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (08): : 1731 - 1737
  • [39] A Parallel Affinity Propagation Clustering Algorithm in Biological Data Processing
    Wang, Minchao
    Zhang, Wu
    Dai, Dongbo
    Zhang, Huiran
    Xie, Jiang
    2014 INTERNATIONAL CONFERENCE ON BIOLOGICAL ENGINEERING AND BIOMEDICAL (BEAB 2014), 2014, : 248 - 254
  • [40] A fast hierarchical clustering algorithm for large-scale protein sequence data sets
    Szilagyi, Sandor M.
    Szilagyi, Laszlo
    COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 48 : 94 - 101