An Improved Density Peak Clustering Algorithm Based on Chebyshev Inequality and Differential Privacy

被引:4
|
作者
Chen, Hua [1 ]
Zhou, Yuan [1 ,2 ]
Mei, Kehui [1 ]
Wang, Nan [1 ]
Tang, Mengdi [1 ]
Cai, Guangxing [1 ]
机构
[1] Hubei Univ Technol, Sch Sci, Wuhan 430068, Peoples R China
[2] Wuhan Univ Bioengn, Sch Comp Sci & Technol, Wuhan 430060, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
基金
中国国家自然科学基金;
关键词
DPC algorithm; differential privacy; cosine distance; dichotomy method; Chebyshev inequality; BIG DATA;
D O I
10.3390/app13158674
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Privacy protection and data mining. This study aims to improve the quality of the clustering results of the density peak clustering (DPC) algorithm and address the privacy protection problem in the clustering analysis process. To achieve this, a DPC algorithm based on Chebyshev inequality and differential privacy (DP-CDPC) is proposed. Firstly, the distance matrix is calculated using cosine distance instead of Euclidean distance when dealing with high-dimensional datasets, and the truncation distance is automatically calculated using the dichotomy method. Secondly, to solve the difficulty in selecting suitable clustering centers in the DPC algorithm, statistical constraints are constructed from the perspective of the decision graph using Chebyshev inequality, and the selection of clustering centers is achieved by adjusting the constraint parameters. Finally, to address the privacy leakage problem in the cluster analysis, the Laplace mechanism is applied to introduce noise to the local density in the process of cluster analysis, enabling the privacy protection of the algorithm. The experimental results demonstrate that the DP-CDPC algorithm can effectively select the clustering centers, improve the quality of clustering results, and provide good privacy protection performance.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Grid density peak clustering algorithm based on Zipf distribution
    Ma F.-M.
    Gong T.
    Yang F.
    Zhang T.-F.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (02): : 577 - 587
  • [42] A Collaborative Filtering Recommendation Algorithm Based on Density Peak Clustering
    Wang, Zhihe
    Zhang, Teng
    Du, Hui
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 45 - 49
  • [43] A Recommendation Algorithm Based on Density Peak Clustering and Key Users
    Wang, Pei-Pei
    Liu, Pei-Yu
    Wang, Ru
    Zhu, Zhen-Fang
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 468 - 473
  • [44] HaloDPC: An Improved Recognition Method on Halo Node for Density Peak Clustering Algorithm
    Jiang, Jianhua
    Zhou, Wei
    Wang, Limin
    Tao, Xin
    Li, Keqin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (08)
  • [45] Microfeature Segmentation Algorithm for Biological Images Using Improved Density Peak Clustering
    Li, Man
    Sha, Haiyin
    Liu, Hongying
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [46] An Improved Algorithm Based on Fast Search and Find of Density Peak Clustering for High-Dimensional Data
    Du, Hui
    Ni, Yiyang
    Wang, Zhihe
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [47] An Improved Algorithm Based on Fast Search and Find of Density Peak Clustering for High-Dimensional Data
    Du, Hui
    Ni, Yiyang
    Wang, Zhihe
    Wireless Communications and Mobile Computing, 2021, 2021
  • [48] Analysis method for factors influencing gear hobbing quality based on density peak clustering and improved multi-objective differential evolution algorithm
    Guo, You
    Yan, Ping
    Wu, Dayuan
    Zhou, Han
    Shi, Yancheng
    Yi, Runzhong
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2021, 34 (04) : 385 - 406
  • [49] Improved density peaks clustering based on firefly algorithm
    Zhao, Jia
    Tang, Jingjing
    Shi, Aiye
    Fan, Tanghuai
    Xu, Lizhong
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2020, 15 (01) : 24 - 42
  • [50] Improved density peaks clustering based on firefly algorithm
    Zhao J.
    Tang J.
    Shi A.
    Fan T.
    Xu L.
    Xu, Lizhong (lxu0530@126.com), 1600, Inderscience Enterprises Ltd. (15): : 24 - 42