Research on differential privacy preserving clustering algorithm based on spark platform

被引:0
|
作者
Meng Q. [1 ]
Zhou L. [1 ]
机构
[1] Department of Information Engineering College, Capital Normal University, Beijing
关键词
Differential evolution; Differential privacy; K-means; Opposition-based learning; Spark;
D O I
10.3966/199115992018012901005
中图分类号
学科分类号
摘要
Differential privacy is a kind of privacy protection model based on data distortion proposed by Dwork. As the model does not need to assume the prior knowledge of the attacker, it has been a research hot spot in the field of privacy protection. Aimed at the problem that the traditional differential privacy K-means algorithm is more sensitive to the selection of the initial center points, which reduces the usability of clustering results, an improved differential privacy preserving clustering algorithm (DEDP K-means) is proposed by introducing adaptive opposition-based learning technique and differential evolution algorithm. At the same time, the improved algorithm is parallelized based on the Spark platform. It was also demonstrated that the improved algorithm can optimize the selection of the initial centers, improve the usability of clustering results and have a good speedup when dealing with massive data by parallel experiments. © 2018 Computer Society of the Republic of China. All rights reserved.
引用
收藏
页码:47 / 62
页数:15
相关论文
共 50 条
  • [1] A-PAM Clustering Algorithm Based on Differential Privacy Preserving
    Shao, Rong-min
    Zhang, Lin
    Liu, Yan
    Huang, Da-guang
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE, MULTIMEDIA AND COMMUNICATION ENGINEERING (SMCE 2015), 2015, : 183 - 190
  • [2] Density Peak Clustering Algorithm Based on Differential Privacy Preserving
    Chen, Yun
    Du, Yunlan
    Cao, Xiaomei
    SCIENCE OF CYBER SECURITY, SCISEC 2019, 2019, 11933 : 20 - 32
  • [3] DPHKMS: An Efficient Hybrid Clustering Preserving Differential Privacy in Spark
    Gao, Zhi-Qiang
    Zhang, Long-Jun
    ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 367 - 377
  • [4] The Research of Privacy-preserving Clustering Algorithm
    Shen, Yanguang
    Han, Junrui
    Shan, Huifang
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 324 - 327
  • [5] Differential Privacy-Preserving Recommendation Algorithm Based on Bhattacharyya Coefficient Clustering
    Wang Y.
    Yin E.-M.
    Ran X.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (02): : 81 - 88
  • [6] Affinity Propagation Clustering Algorithm based on Spark Platform
    Zhang, Lijia
    Cheng, Lianglun
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 532 - 535
  • [7] Privacy preserving DBSCAN algorithm for clustering
    Anil Kumar, K.
    Pandu Rangan, C.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4632 : 57 - 68
  • [8] Privacy preserving DBSCAN algorithm for clustering
    Kumar, K. Anil
    Rangan, C. Pandu
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 57 - +
  • [9] Research on the Parallelization of the DBSCAN Clustering Algorithm for Spatial Data Mining Based on the Spark Platform
    Huang, Fang
    Zhu, Qiang
    Zhou, Ji
    Tao, Jian
    Zhou, Xiaocheng
    Jin, Du
    Tan, Xicheng
    Wang, Lizhe
    REMOTE SENSING, 2017, 9 (12)
  • [10] A Local Differential Privacy Based Privacy-Preserving Grid Clustering Method
    Zhang D.-Y.
    Ni W.-W.
    Zhang S.
    Fu N.
    Hou L.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (02): : 422 - 435