Research on differential privacy preserving clustering algorithm based on spark platform

被引:0
|
作者
Meng Q. [1 ]
Zhou L. [1 ]
机构
[1] Department of Information Engineering College, Capital Normal University, Beijing
关键词
Differential evolution; Differential privacy; K-means; Opposition-based learning; Spark;
D O I
10.3966/199115992018012901005
中图分类号
学科分类号
摘要
Differential privacy is a kind of privacy protection model based on data distortion proposed by Dwork. As the model does not need to assume the prior knowledge of the attacker, it has been a research hot spot in the field of privacy protection. Aimed at the problem that the traditional differential privacy K-means algorithm is more sensitive to the selection of the initial center points, which reduces the usability of clustering results, an improved differential privacy preserving clustering algorithm (DEDP K-means) is proposed by introducing adaptive opposition-based learning technique and differential evolution algorithm. At the same time, the improved algorithm is parallelized based on the Spark platform. It was also demonstrated that the improved algorithm can optimize the selection of the initial centers, improve the usability of clustering results and have a good speedup when dealing with massive data by parallel experiments. © 2018 Computer Society of the Republic of China. All rights reserved.
引用
收藏
页码:47 / 62
页数:15
相关论文
共 50 条
  • [31] Design of a privacy-preserving algorithm for peer-to-peer network based on differential privacy
    Yu J.
    Ingenierie des Systemes d'Information, 2019, 24 (04): : 433 - 437
  • [32] Privacy preserving clustering
    Jha, S
    Kruger, L
    McDaniel, P
    COMPUTER SECURITY - ESORICS 2005, PROCEEDINGS, 2005, 3679 : 397 - 417
  • [33] Research on Local Fingerprint Image Differential Privacy Protection Method Based on Clustering Algorithm and Regression Algorithm Segmentation Image
    Liu, Chao
    Zhi, Zhaolong
    Zhao, Weinan
    He, Zhicheng
    IEEE ACCESS, 2024, 12 : 27127 - 27146
  • [34] A Novel Differential Privacy Protection Model Based on Spectral Clustering Algorithm
    Zhang, Yudi
    International Journal of Network Security, 2023, 25 (04) : 713 - 720
  • [35] Privacy Preserving EM-Based Clustering
    Luong The Dung
    Ho Tu Bao
    2009 IEEE-RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES: RESEARCH, INNOVATION AND VISION FOR THE FUTURE, 2009, : 111 - +
  • [36] A differential privacy preserving algorithm for greedy decision tree
    Yang, Shudan
    Li, Nan
    Sun, Daozhu
    Du, Qiming
    Liu, Wenfu
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 229 - 237
  • [37] A Neural-Network Clustering-Based Algorithm for Privacy Preserving Data Mining
    Tsiafoulis, S.
    Zorkadis, V. C.
    Karras, D. A.
    GRID AND DISTRIBUTED COMPUTING, CONTROL AND AUTOMATION, 2010, 121 : 269 - +
  • [38] A privacy-preserving data publishing algorithm for clustering application
    Chong, Zhihong
    Ni, Weiwei
    Liu, Tengteng
    Zhang, Yong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (12): : 2083 - 2089
  • [39] Privacy Preserving Distributed Cell-based K-means Clustering Algorithm
    Su, Fang
    Zu, Yun-xiao
    Li, Wei-hai
    INTERNATIONAL CONFERENCE ON MATHEMATICS, MODELLING AND SIMULATION TECHNOLOGIES AND APPLICATIONS (MMSTA 2017), 2017, 215 : 377 - 383
  • [40] A reversible privacy-preserving clustering technique based on k-means algorithm
    Lin, Chen-Yi
    APPLIED SOFT COMPUTING, 2020, 87