A High-Availability K-modes Clustering Method Based on Differential Privacy

被引:1
|
作者
Zhang, Shaobo [1 ,2 ,3 ]
Yuan, Liujie [1 ,2 ]
Li, Yuxing [1 ,2 ]
Chen, Wenli [1 ,2 ]
Ding, Yifei [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & New Software Serv Techn, Xiangtan 411201, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Key Lab Software Engn Complex Syst, Changsha 410073, Peoples R China
关键词
Privacy protection; Categorical data mining; Differential privacy; K-modes clustering; ALGORITHM;
D O I
10.1007/978-3-030-95388-1_18
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In categorical data mining, the K-modes algorithm is a classic algorithm that has been widely used. However, the data analyzed by the K-modes algorithm usually contains sensitive user information. If these data are leaked, it will seriously threaten the privacy of users. In response to this problem, the existing method that combines differential privacy with the K-modes algorithm can effectively prevent privacy leakage. Nevertheless, differential privacy adds noise to the data while protecting data privacy, which will reduce the availability of clustering results. In this paper, we propose a high-availability K-modes clustering mechanism based on differential privacy(HAKC). In this mechanism, based on the use of differential privacy to protect data privacy, we select the initial centroid of the clustering by calculation, and improve the calculation method of the distance between the data point and the centroid in the iterative process.
引用
收藏
页码:274 / 283
页数:10
相关论文
共 50 条
  • [31] DP-k-modes: A self-tuning k-modes clustering algorithm
    Xie, Juanying
    Wang, Mingzhao
    Lu, Xiaoxiao
    Liu, Xinglin
    Grant, Philip W.
    Pattern Recognition Letters, 2022, 158 : 117 - 124
  • [32] Cluster center initialization algorithm for K-modes clustering
    Khan, Shehroz S.
    Ahmad, Amir
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7444 - 7456
  • [33] BINARY CODES K-MODES CLUSTERING FOR HSI SEGMENTATION
    Berthier, Michel
    El Asmar, Saadallah
    Frelicot, Carl
    2016 IEEE 12TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2016,
  • [34] A load clustering algorithm based on discrete wavelet transform and fuzzy K-modes
    Zhang J.
    Zhang Y.
    Hong J.
    Gao H.
    Liu J.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2019, 39 (02): : 100 - 106and122
  • [35] Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes
    Mehta, Darshan
    Tripathy, B. K.
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2016), VOL 1, 2017, 546 : 254 - 263
  • [36] Computation of Initial Modes for K-modes Clustering Algorithm using Evidence Accumulation
    Khan, Shehroz S.
    Kant, Shri
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2784 - 2789
  • [37] Categorical data clustering: 25 years beyond K-modes
    Dinh, Tai
    Wong, Hauchi
    Fournier-Viger, Philippe
    Lisik, Daniil
    Ha, Minh-Quyet
    Dam, Hieu-Chi
    Huynh, Van-Nam
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [38] A weighting k-modes algorithm for subspace clustering of categorical data
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Zhao, Xingwang
    NEUROCOMPUTING, 2013, 108 : 23 - 30
  • [39] Initialization of K-modes clustering using outlier detection techniques
    Jiang, Feng
    Liu, Guozhu
    Du, Junwei
    Sui, Yuefei
    INFORMATION SCIENCES, 2016, 332 : 167 - 183
  • [40] Semantically Enhanced Clustering in Retail Using Possibilistic K-Modes
    Ammar, Asma
    Elouedi, Zied
    Lingras, Pawan
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 753 - 764