The Effect of Clustering on Data Privacy

被引:6
|
作者
Canbay, Pelin [1 ]
Sever, Hayri [1 ]
机构
[1] Hacettepe Univ, Dept Comp Engn, Ankara, Turkey
关键词
data privacy; privacy preserving; anonymization; clustering; data diversity;
D O I
10.1109/ICMLA.2015.198
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The data obtained by various organizations provide opportunities for generating solutions in the future. It is essential that, the accurate data must be sharable with research communities and scientists in order to improve quality of life. However, accurate records of personal data include sensitive information about individuals. Hence sharing these records without applying any anonymization criteria paves the way for disclosure of personal privacy. In an effort to protect personal privacy, Privacy-Preserving Data Mining (PPDM) and Privacy-Preserving Data Publishing (PPDP) approaches have been studied extensively. Numerous works have been dedicated to diversifying techniques for de-identification or anonymization of identifiable datasets, but there is an important trade-off between data loss and data privacy. While original data anonymized, it exposed to information loss. In order to minimize information loss, the anonymization algorithms discard keeping diversity. In this study, we proposed an approach that uses a clustering algorithm as a pre-process for privacy preserving methods to improve the diversity of anonymized data. In addition, the effect of clustering on anonymization was evaluated by using original and clustered form of a real world dataset. The results are evaluated with the aspect of usability in scientific works and it was observed that a clustering algorithm and an affective anonymization algorithm must be used in privacy preservation approaches in order to keep diversity of the original datasets.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [21] Privacy Preserving Clustering over Horizontal and Vertical Partitioned Data
    Sheikhalishahi, Mina
    Martinelli, Fabio
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 1237 - 1244
  • [22] Shearing Based Data Transformation Approach For Privacy Preserving Clustering
    Manikandan, G.
    Sairam, N.
    Sudhan, R.
    Vaishnavi, B.
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [23] Privacy preserving unsupervised clustering over vertically partitioned data
    Tasoulis, D. K.
    Laskari, E. C.
    Meletiou, G. C.
    Vrahatis, M. N.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 5, 2006, 3984 : 635 - 643
  • [24] Clustering-oriented privacy-preserving data publishing
    Ni, Weiwei
    Chong, Zhihong
    KNOWLEDGE-BASED SYSTEMS, 2012, 35 : 264 - 270
  • [25] Privacy-Awareness of Distributed Data Clustering Algorithms Revisited
    da Silva, Josenildo C.
    Klusch, Matthias
    Lodi, Stefano
    ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 : 261 - 272
  • [26] A privacy-preserving data publishing algorithm for clustering application
    Chong, Zhihong
    Ni, Weiwei
    Liu, Tengteng
    Zhang, Yong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (12): : 2083 - 2089
  • [27] Clustering Approach for User Location Data Privacy in Telecommunication Services
    Vukovic, Marin
    Kordic, Mario
    Jevtic, Dragan
    2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 1459 - 1463
  • [28] Research on Clustering-Differential Privacy for Express Data Release
    Chen, Tianying
    Kang, Haiyan
    INFORMATION AND COMMUNICATIONS SECURITY, ICICS 2017, 2018, 10631 : 427 - 437
  • [29] Clustering-assisted privacy perseveration model for data mining
    Mohana, S.
    Nithya, T. M.
    Bushra, Sardar Khan Nikkath
    Vasanthi, Ramakrishnan
    Guruprakash, K. S.
    Rajesh, Sudha
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2024, 47 (02) : 108 - 125
  • [30] Privacy-Preserving Data Mining in Homogeneous Collaborative Clustering
    Ouda, Mohamed
    Salem, Sameh
    Ali, Ihab
    Saad, El-Sayed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (06) : 604 - 612