The Effect of Clustering on Data Privacy

被引:6
|
作者
Canbay, Pelin [1 ]
Sever, Hayri [1 ]
机构
[1] Hacettepe Univ, Dept Comp Engn, Ankara, Turkey
关键词
data privacy; privacy preserving; anonymization; clustering; data diversity;
D O I
10.1109/ICMLA.2015.198
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The data obtained by various organizations provide opportunities for generating solutions in the future. It is essential that, the accurate data must be sharable with research communities and scientists in order to improve quality of life. However, accurate records of personal data include sensitive information about individuals. Hence sharing these records without applying any anonymization criteria paves the way for disclosure of personal privacy. In an effort to protect personal privacy, Privacy-Preserving Data Mining (PPDM) and Privacy-Preserving Data Publishing (PPDP) approaches have been studied extensively. Numerous works have been dedicated to diversifying techniques for de-identification or anonymization of identifiable datasets, but there is an important trade-off between data loss and data privacy. While original data anonymized, it exposed to information loss. In order to minimize information loss, the anonymization algorithms discard keeping diversity. In this study, we proposed an approach that uses a clustering algorithm as a pre-process for privacy preserving methods to improve the diversity of anonymized data. In addition, the effect of clustering on anonymization was evaluated by using original and clustered form of a real world dataset. The results are evaluated with the aspect of usability in scientific works and it was observed that a clustering algorithm and an affective anonymization algorithm must be used in privacy preservation approaches in order to keep diversity of the original datasets.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [31] K-Means Clustering With Local dχ-Privacy for Privacy-Preserving Data Analysis
    Yang, Mengmeng
    Tjuawinata, Ivan
    Lam, Kwok-Yan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2524 - 2537
  • [32] Hybrid privacy preserving clustering for big data while ensuring security
    Pushphavathi T.P.
    Murthy P.V.R.
    International Journal of Cloud Computing, 2021, 10 (04) : 370 - 389
  • [33] Privacy-preserving DBSCAN clustering over vertically partitioned data
    Xu Wei-jiang
    Huang Liu-sheng
    Luo Yong-long
    Yao Yi-fei
    Jing Wei-wei
    MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 850 - 856
  • [34] Privacy preserving clustering
    Jha, S
    Kruger, L
    McDaniel, P
    COMPUTER SECURITY - ESORICS 2005, PROCEEDINGS, 2005, 3679 : 397 - 417
  • [35] Clustering-Anonymity Method for Privacy Preserving Table Data Sharing
    Liu, Liping
    Piao, Chunhui
    Cao, Huirui
    ADVANCES IN E-BUSINESS ENGINEERING FOR UBIQUITOUS COMPUTING, 2020, 41 : 405 - 420
  • [36] Privacy Preserving Collaborative Clustering Using SOM for Horizontal Data Distribution
    Gadepaka, Latha
    Surampudi, Bapi Raju
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON FUZZY AND NEURO COMPUTING (FANCCO - 2015), 2015, 415 : 273 - 284
  • [37] PPPCT: Privacy-Preserving framework for Parallel Clustering Transcriptomics data
    Tadi A.A.
    Alhadidi D.
    Rueda L.
    Computers in Biology and Medicine, 2024, 173
  • [38] A k-anonymity clustering method for effective data privacy preservation
    Chin, Chuang-Cheng
    Tsai, Chieh-Yuan
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 89 - 99
  • [39] Segment Clustering Based Privacy Preserving Algorithm for Trajectory Data Publishing
    Li Fengyun
    Xue Junchao
    Sun Dawei
    Gao Yanfang
    WIRELESS SENSOR NETWORKS (CWSN 2017), 2018, 812 : 211 - 221
  • [40] Global Combination and Clustering Based Differential Privacy Mixed Data Publishing
    Chen, Lanxiang
    Zeng, Lingfang
    Mu, Yi
    Chen, Leilei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11437 - 11448