A Supermodularity-Based Differential Privacy Preserving Algorithm for Data Anonymization

被引:20
|
作者
Fouad, Mohamed R. [1 ]
Elbassioni, Khaled [2 ]
Bertino, Elisa [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
Differential privacy; security; risk management; data sharing; data utility; anonymity; scalability;
D O I
10.1109/TKDE.2013.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Maximizing data usage and minimizing privacy risk are two conflicting goals. Organizations always apply a set of transformations on their data before releasing it. While determining the best set of transformations has been the focus of extensive work in the database community, most of this work suffered from one or both of the following major problems: scalability and privacy guarantee. Differential Privacy provides a theoretical formulation for privacy that ensures that the system essentially behaves the same way regardless of whether any individual is included in the database. In this paper, we address both scalability and privacy risk of data anonymization. We propose a scalable algorithm that meets differential privacy when applying a specific random sampling. The contribution of the paper is two-fold: 1) we propose a personalized anonymization technique based on an aggregate formulation and prove that it can be implemented in polynomial time; and 2) we show that combining the proposed aggregate formulation with specific sampling gives an anonymization algorithm that satisfies differential privacy. Our results rely heavily on exploring the supermodularity properties of the risk function, which allow us to employ techniques from convex optimization. Through experimental studies we compare our proposed algorithm with other anonymization schemes in terms of both time and privacy risk.
引用
收藏
页码:1591 / 1601
页数:11
相关论文
共 50 条
  • [1] Privacy-preserving Searchable Encryption Based on Anonymization and Differential privacy
    Ma, Caixia
    Jia, Chunfu
    Du, Ruizhong
    Ha, Guanxiong
    Li, Mingyue
    2024 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2024, 2024, : 371 - 382
  • [2] (k, ε, δ)-Anonymization: privacy-preserving data release based on k-anonymity and differential privacy
    Tsou, Yao-Tung
    Alraja, Mansour Naser
    Chen, Li-Sheng
    Chang, Yu-Hsiang
    Hu, Yung-Li
    Huang, Yennun
    Yu, Chia-Mu
    Tsai, Pei-Yuan
    SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2021, 15 (03) : 175 - 185
  • [3] Anonymization-Based Attacks in Privacy-Preserving Data Publishing
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Wang, Ke
    Pei, Jian
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [4] Privacy Preserving Data Publishing and Data Anonymization Approaches: A Review
    Goswami, Puneet
    Madan, Suman
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2017, : 139 - 142
  • [5] Clustering based privacy preserving of big data using fuzzification and anonymization operation
    Khan S.
    Iqbal K.
    Faizullah S.
    Fahad M.
    Ali J.
    Ahmed W.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (12): : 282 - 289
  • [6] Clustering based Privacy Preserving of Big Data using Fuzzification and Anonymization Operation
    Khan, Saira
    Iqba, Khalid
    Faizullah, Safi
    Fahad, Muhammad
    Ali, Jawad
    Ahmed, Waqas
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (12) : 282 - 289
  • [7] A differential privacy-based privacy-preserving data publishing algorithm for transit smart card data
    Li, Yang
    Yang, Dasen
    Hu, Xianbiao
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 115
  • [8] Stipulation-Based Anonymization with Sensitivity Flags for Privacy Preserving Data Publishing
    Ashoka, K.
    Poornima, B.
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 1, 2019, 707 : 445 - 454
  • [9] Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach
    Hore, Bijit
    Jammalamadaka, Ravi Chandra
    Mehrotra, Sharad
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 497 - 502
  • [10] Privacy and utility preserving data clustering for data anonymization and distribution on Hadoop
    Nayahi, J. Jesu Vedha
    Kavitha, V.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 74 : 393 - 408