Research on differential privacy preserving clustering algorithm based on spark platform

被引:0
|
作者
Meng Q. [1 ]
Zhou L. [1 ]
机构
[1] Department of Information Engineering College, Capital Normal University, Beijing
关键词
Differential evolution; Differential privacy; K-means; Opposition-based learning; Spark;
D O I
10.3966/199115992018012901005
中图分类号
学科分类号
摘要
Differential privacy is a kind of privacy protection model based on data distortion proposed by Dwork. As the model does not need to assume the prior knowledge of the attacker, it has been a research hot spot in the field of privacy protection. Aimed at the problem that the traditional differential privacy K-means algorithm is more sensitive to the selection of the initial center points, which reduces the usability of clustering results, an improved differential privacy preserving clustering algorithm (DEDP K-means) is proposed by introducing adaptive opposition-based learning technique and differential evolution algorithm. At the same time, the improved algorithm is parallelized based on the Spark platform. It was also demonstrated that the improved algorithm can optimize the selection of the initial centers, improve the usability of clustering results and have a good speedup when dealing with massive data by parallel experiments. © 2018 Computer Society of the Republic of China. All rights reserved.
引用
收藏
页码:47 / 62
页数:15
相关论文
共 50 条
  • [21] Privacy Preserving BIRCH Algorithm under Differential Privacy
    Zhang, Yao
    Li, Shuyu
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 48 - 53
  • [22] Face Recognition System Based on Differential Privacy Preserving Algorithm
    School of Data and Information, Changjiang Polytechnic, Wuhan
    430074, China
    Int. J. Netw. Secur., 6 (934-942): : 934 - 942
  • [23] Segment Clustering Based Privacy Preserving Algorithm for Trajectory Data Publishing
    Li Fengyun
    Xue Junchao
    Sun Dawei
    Gao Yanfang
    WIRELESS SENSOR NETWORKS (CWSN 2017), 2018, 812 : 211 - 221
  • [24] Privacy-Preserving DBSCAN Clustering Algorithm Based on Negative Database
    Zhang, Mingkun
    Liao, Hucheng
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 209 - 213
  • [25] Efficient and Privacy Preserving Clustering Algorithm for Spatiotemporal Data
    Mehmood, Abid
    Natgunanathan, Iynkaran
    Xiang, Yong
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2024, 23 (02) : 967 - 992
  • [26] A Supermodularity-Based Differential Privacy Preserving Algorithm for Data Anonymization
    Fouad, Mohamed R.
    Elbassioni, Khaled
    Bertino, Elisa
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (07) : 1591 - 1601
  • [27] Research on an Ensemble Classification Algorithm Based on Differential Privacy
    Jia, Junjie
    Qiu, Wanyong
    IEEE ACCESS, 2020, 8 : 93499 - 93513
  • [28] Privacy-preserving mechanism for mixed data clustering with local differential privacy
    Yuan, Liujie
    Zhang, Shaobo
    Zhu, Gengming
    Alinani, Karim
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (19):
  • [29] A New Density Peak Clustering Algorithm With Adaptive Clustering Center Based on Differential Privacy
    Chen, Hua
    Zhou, Yuan
    Mei, Kehui
    Wang, Nan
    Cai, Guangxing
    IEEE ACCESS, 2023, 11 : 1418 - 1431
  • [30] Research on a Privacy Preserving Clustering Method for Social Network
    Bian, Jin
    Li, Shuyu
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 29 - 33