Privacy Aware K-Means Clustering with High Utility

被引:5
|
作者
Thanh Dai Nguyen [1 ]
Gupta, Sunil [1 ]
Rana, Santu [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Ctr Pattern Recognit & Data Analyt, Geelong, Vic 3216, Australia
关键词
D O I
10.1007/978-3-319-31750-2_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Privacy-preserving data mining aims to keep data safe, yet useful. But algorithms providing strong guarantees often end up with low utility. We propose a novel privacy preserving framework that thwarts an adversary from inferring an unknown data point by ensuring that the estimation error is almost invariant to the inclusion/exclusion of the data point. By focusing directly on the estimation error of the data point, our framework is able to significantly lower the perturbation required. We use this framework to propose a new privacy aware K-means clustering algorithm. Using both synthetic and real datasets, we demonstrate that the utility of this algorithm is almost equal to that of the unperturbed K-means, and at strict privacy levels, almost twice as good as compared to the differential privacy counterpart.
引用
收藏
页码:388 / 400
页数:13
相关论文
共 50 条
  • [1] Privacy Preserving Approximate K-means Clustering
    Biswas, Chandan
    Ganguly, Debasis
    Roy, Dwaipayan
    Bhattacharya, Ujjwal
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1321 - 1330
  • [2] K-Means Clustering with Local Distance Privacy
    Yang, Mengmeng
    Huang, Longxia
    Tang, Chenghua
    BIG DATA MINING AND ANALYTICS, 2023, 6 (04) : 433 - 442
  • [3] Efficient Privacy Preserving K-Means Clustering
    Upmanyu, Maneesh
    Namboodiri, Anoop M.
    Srinathan, Kannan
    Jawahar, C. V.
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2010, 6122 : 154 - 166
  • [4] Privacy Preserving K-means Clustering: A Survey Research
    Meskine, Fatima
    Bahloul, Safia Nait
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2012, 9 (02) : 194 - 200
  • [5] Privacy Preservation in k-Means Clustering by Cluster Rotation
    Dhiraj, S. S. Shivaji
    Khan, Ameer M. Asif
    Khan, Wajhiulla
    Challagalla, Ajay
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 1437 - 1443
  • [6] k-Means Clustering with Distance-Based Privacy
    Epasto, Alessandro
    Mirrokni, Vahab
    Narayanan, Shyam
    Zhong, Peilin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Privacy Preserving Clustering: A k-Means Type Extension
    Li, Wenye
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 319 - 326
  • [8] Distributed K-Means clustering guaranteeing local differential privacy
    Xia, Chang
    Hua, Jingyu
    Tong, Wei
    Zhong, Sheng
    COMPUTERS & SECURITY, 2020, 90
  • [9] Privacy of outsourced two-party k-means clustering
    Cai, Yunlu
    Tang, Chunming
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (08):
  • [10] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67