Differential Privacy and k -Anonymity-Based Privacy Preserving Data Publishing Scheme With Minimal Loss of Statistical Information

被引:1
|
作者
Majeed, Abdul [1 ]
Hwang, Seong Oun [1 ]
机构
[1] Gachon Univ, Dept Comp Engn, Seongnam 13120, South Korea
关键词
k-anonymity; differential privacy (DP); privacy; privacy-preserving data publishing (PPDP); statistical information; utility; ANONYMIZATION;
D O I
10.1109/TCSS.2023.3320141
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Though anonymization mechanisms have made huge progress in fostering the secondary use of data, it is still very challenging to obtain adequate knowledge from anonymized data while preserving privacy. Most existing mechanisms anonymize entire sections of data and fail to maximally preserve the structure/values of real data. Consequently, the performance of those mechanisms and the output (i.e., the anonymized data) remain problematic in real-life scenarios due to the extensive and unneeded anonymization applied. To address these issues, we propose and implement a hybrid (differential privacy (DP) and k-anonymity) anonymization scheme that produces supreme-quality anonymized data that offers knowledge similar to real data without compromising privacy. Specifically, we implement a pair of algorithms that divide the dataset into privacy-violating and nonprivacy-violating partitions. Afterward, in a nonprivacy-violating partition, a relaxed privacy budget epsilon is applied to numerical attributes, but most of the categorical attributes are retained (as is) for informative analysis. In privacy-violating partitions, fewer changes are applied to the data by using a reasonable value for epsilon and by exploiting the diversity in sensitive information. Experiments are conducted on three real-life datasets to prove the feasibility of our scheme for futuristic AI applications. Compared with state-of-the-art (SOTA) methods, our scheme preserves 60.81% of the originality in the anonymized data. The privacy risks are reduced by 20.05%, and utility is enhanced by 54.01% and 15.33% based on information loss (IL) and accuracy metrics. Furthermore, the time overhead is 3.13x lower than the SOTA methods.
引用
收藏
页码:3753 / 3765
页数:13
相关论文
共 50 条
  • [21] Differential privacy preserving data publishing based on Bayesian network
    Qi, Xuejian
    Ma, Xuebin
    Bai, Xiangyu
    Li, Wuyungerile
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1718 - 1726
  • [22] Privacy preserving pattern based anonymity (k, P) anonymity in time series data
    Mayil Vel Kumar, P.
    Karthikeyan, M.
    Journal of Computational and Theoretical Nanoscience, 2015, 12 (12) : 5524 - 5529
  • [23] A Coalitional Game Theoretic Mechanism for Privacy Preserving Publishing Based on k-Anonymity
    Chakravarthy, Srinivasa L.
    Kumari, Valli V.
    Sarojini, Ch
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 889 - 896
  • [24] Enhanced P-Sensitive K-Anonymity Models for Privacy Preserving Data Publishing
    Sun, Xiaoxun
    Wang, Hua
    Li, Jiuyong
    Truta, Traian Marius
    TRANSACTIONS ON DATA PRIVACY, 2008, 1 (02) : 53 - 66
  • [25] K-anonymity privacy-preserving data sharing for a dynamic game scheme
    Cao, Laicheng
    Hou, Yangning
    Feng, Tao
    Guo, Xian
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2024, 51 (04): : 170 - 179
  • [26] A differential privacy-based privacy-preserving data publishing algorithm for transit smart card data
    Li, Yang
    Yang, Dasen
    Hu, Xianbiao
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 115
  • [27] Privacy-preserving governmental data publishing: A fog-computing-based differential privacy approach
    Piao, Chunhui
    Shi, Yajuan
    Yan, Jiaqi
    Zhang, Changyou
    Liu, Liping
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 90 : 158 - 174
  • [28] A Non-Grouping Anonymity Model for Preserving Privacy in Health Data Publishing
    Nagendrakumar, S.
    Aparna, R.
    Ramesh, Srejith
    2014 INTERNATIONAL CONFERENCE ON SCIENCE ENGINEERING AND MANAGEMENT RESEARCH (ICSEMR), 2014,
  • [29] Systematic Evaluation of Probabilistic k-Anonymity for Privacy Preserving Micro-data Publishing and Analysis
    Senavirathne, Navoda
    Torra, Vicenc
    SECRYPT 2021: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, 2021, : 307 - 320
  • [30] Lightweight Privacy-Preserving Raw Data Publishing Scheme
    Chen, Jingxue
    Liu, Gao
    Liu, Yining
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (04) : 2170 - 2174