The Effect of Clustering on Data Privacy

被引:6
|
作者
Canbay, Pelin [1 ]
Sever, Hayri [1 ]
机构
[1] Hacettepe Univ, Dept Comp Engn, Ankara, Turkey
关键词
data privacy; privacy preserving; anonymization; clustering; data diversity;
D O I
10.1109/ICMLA.2015.198
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The data obtained by various organizations provide opportunities for generating solutions in the future. It is essential that, the accurate data must be sharable with research communities and scientists in order to improve quality of life. However, accurate records of personal data include sensitive information about individuals. Hence sharing these records without applying any anonymization criteria paves the way for disclosure of personal privacy. In an effort to protect personal privacy, Privacy-Preserving Data Mining (PPDM) and Privacy-Preserving Data Publishing (PPDP) approaches have been studied extensively. Numerous works have been dedicated to diversifying techniques for de-identification or anonymization of identifiable datasets, but there is an important trade-off between data loss and data privacy. While original data anonymized, it exposed to information loss. In order to minimize information loss, the anonymization algorithms discard keeping diversity. In this study, we proposed an approach that uses a clustering algorithm as a pre-process for privacy preserving methods to improve the diversity of anonymized data. In addition, the effect of clustering on anonymization was evaluated by using original and clustered form of a real world dataset. The results are evaluated with the aspect of usability in scientific works and it was observed that a clustering algorithm and an affective anonymization algorithm must be used in privacy preservation approaches in order to keep diversity of the original datasets.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [41] Privacy preserving spatio-temporal clustering on horizontally partitioned data
    Inan, Ali
    Saygin, Yucel
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 459 - 468
  • [42] Clustering-anonymity method for data-publishing privacy preservation
    Jiang Huowen
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER ENGINEERING AND ELECTRONICS (ICECEE 2015), 2015, 24 : 34 - 37
  • [43] Privacy-preserving data publishing based on de-clustering
    Wei, Qiong
    Lu, Yansheng
    Lou, Qiang
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 152 - +
  • [44] Hierarchical PSO Clustering on MapReduce for Scalable Privacy Preservation in Big Data
    Wai, Ei Nyein Chan
    Tsai, Pei-Wei
    Pan, Jeng-Shyang
    GENETIC AND EVOLUTIONARY COMPUTING, 2017, 536 : 36 - 44
  • [45] Enhancing Privacy and Availability for Data Clustering in Intelligent Electrical Service of IoT
    Xiong, Jinbo
    Ren, Jun
    Chen, Lei
    Yao, Zhiqiang
    Lin, Mingwei
    Wu, Dapeng
    Niu, Ben
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02): : 1530 - 1540
  • [46] The Effect of Providing Visualizations in Privacy Policies on Trust in Data Privacy and Security
    Becker, Joerg
    Heddier, Marcel
    Oeksuez, Ayten
    Knackstedt, Ralf
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 3224 - 3233
  • [47] Clustering-based privacy preserving anonymity approach for table data sharing
    Piao, Chunhui
    Liu, Liping
    Shi, Yajuan
    Jiang, Xuehong
    Song, Ning
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2020, 11 (04) : 768 - 773
  • [48] Clustering-Based Federated Learning for Enhancing Data Privacy in Internet of Vehicles
    Jin, Zilong
    Wang, Jin
    Zhang, Lejun
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (06): : 1462 - 1477
  • [49] Privacy-preserving clustering federated learning for non-IID data
    Luo, Guixun
    Chen, Naiyue
    He, Jiahuan
    Jin, Bingwei
    Zhang, Zhiyuan
    Li, Yidong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 384 - 395
  • [50] Fuzzy Co-clustering of Vertically Partitioned Cooccurrence Data With Privacy Consideration
    Honda, Katsuhiro
    Oda, Toshiya
    Notsu, Akira
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 2500 - 2504