An Improved Density Peak Clustering Algorithm Based on Chebyshev Inequality and Differential Privacy

被引:4
|
作者
Chen, Hua [1 ]
Zhou, Yuan [1 ,2 ]
Mei, Kehui [1 ]
Wang, Nan [1 ]
Tang, Mengdi [1 ]
Cai, Guangxing [1 ]
机构
[1] Hubei Univ Technol, Sch Sci, Wuhan 430068, Peoples R China
[2] Wuhan Univ Bioengn, Sch Comp Sci & Technol, Wuhan 430060, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
基金
中国国家自然科学基金;
关键词
DPC algorithm; differential privacy; cosine distance; dichotomy method; Chebyshev inequality; BIG DATA;
D O I
10.3390/app13158674
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Privacy protection and data mining. This study aims to improve the quality of the clustering results of the density peak clustering (DPC) algorithm and address the privacy protection problem in the clustering analysis process. To achieve this, a DPC algorithm based on Chebyshev inequality and differential privacy (DP-CDPC) is proposed. Firstly, the distance matrix is calculated using cosine distance instead of Euclidean distance when dealing with high-dimensional datasets, and the truncation distance is automatically calculated using the dichotomy method. Secondly, to solve the difficulty in selecting suitable clustering centers in the DPC algorithm, statistical constraints are constructed from the perspective of the decision graph using Chebyshev inequality, and the selection of clustering centers is achieved by adjusting the constraint parameters. Finally, to address the privacy leakage problem in the cluster analysis, the Laplace mechanism is applied to introduce noise to the local density in the process of cluster analysis, enabling the privacy protection of the algorithm. The experimental results demonstrate that the DP-CDPC algorithm can effectively select the clustering centers, improve the quality of clustering results, and provide good privacy protection performance.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Quantum Density Peak Clustering Algorithm
    Wu, Zhihao
    Song, Tingting
    Zhang, Yanbing
    ENTROPY, 2022, 24 (02)
  • [32] Accelerating Density Peak Clustering Algorithm
    Lin, Jun-Lin
    SYMMETRY-BASEL, 2019, 11 (07):
  • [33] An Adaptive Density Peak Clustering Algorithm
    Ma S.-H.
    You H.-R.
    Tang L.
    He P.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2022, 43 (06): : 761 - 768
  • [34] Survey on Density Peak Clustering Algorithm
    Chen Y.
    Shen L.
    Zhong C.
    Wang T.
    Chen Y.
    Du J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (02): : 378 - 394
  • [35] A-PAM Clustering Algorithm Based on Differential Privacy Preserving
    Shao, Rong-min
    Zhang, Lin
    Liu, Yan
    Huang, Da-guang
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE, MULTIMEDIA AND COMMUNICATION ENGINEERING (SMCE 2015), 2015, : 183 - 190
  • [36] Unsupervised clustering algorithm for databases based on density peak optimisation
    Pu, Xiaochuan
    Seo, Wonchul
    Ruan, Qingqiang
    INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2023, 16 (03) : 313 - 326
  • [37] Density peak clustering algorithm based on interval shadowed sets
    Chen Y.
    Zhang Q.
    Yang J.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (06): : 531 - 544
  • [38] A domain density peak clustering algorithm based on natural neighbor
    Chen, Di
    Du, Tao
    Zhou, Jin
    Shen, Tianyu
    INTELLIGENT DATA ANALYSIS, 2023, 27 (02) : 443 - 462
  • [39] An Improved Density Peaks Clustering Algorithm Based On Density Ratio
    Zou, Yujuan
    Wang, Zhijian
    Xu, Pengfei
    Lv, Taizhi
    COMPUTER JOURNAL, 2024, 67 (07): : 2515 - 2528
  • [40] Density Peak Clustering Algorithm Based on Space Vector Search
    Ma, Zhenming
    An, Junxiu
    Computer Engineering and Applications, 2023, 59 (15): : 123 - 131