Efficient and flexible anonymization of transaction data

被引:0
|
作者
Grigorios Loukides
Aris Gkoulalas-Divanis
Jianhua Shao
机构
[1] Cardiff University,School of Computer Science and Informatics
[2] IBM Research-Ireland,Smarter Cities Technology Centre
来源
关键词
Anonymity; Privacy; Transaction data; Privacy requirements; Identity disclosure; Sensitive information disclosure; Efficiency; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
Transaction data are increasingly used in applications, such as marketing research and biomedical studies. Publishing these data, however, may risk privacy breaches, as they often contain personal information about individuals. Approaches to anonymizing transaction data have been proposed recently, but they may produce excessively distorted and inadequately protected solutions. This is because these approaches do not consider privacy requirements that are common in real-world applications in a realistic and flexible manner, and attempt to safeguard the data only against either identity disclosure or sensitive information inference. In this paper, we propose a new approach that overcomes these limitations. We introduce a rule-based privacy model that allows data publishers to express fine-grained protection requirements for both identity and sensitive information disclosure. Based on this model, we also develop two anonymization algorithms. Our first algorithm works in a top-down fashion, employing an efficient strategy to recursively generalize data with low information loss. Our second algorithm uses sampling and a combination of top-down and bottom-up generalization heuristics, which greatly improves scalability while maintaining low information loss. Extensive experiments show that our algorithms significantly outperform the state-of-the-art in terms of retaining data utility, while achieving good protection and scalability.
引用
收藏
页码:153 / 210
页数:57
相关论文
共 50 条
  • [41] Big Data Privacy and Anonymization
    Torra, Vicenc
    Navarro-Arribas, Guillermo
    PRIVACY AND IDENTITY MANAGEMENT: FACING UP TO NEXT STEPS, 2016, 498 : 15 - 26
  • [42] Mobile Sensor Data Anonymization
    Malekzadeh, Mohammad
    Clegg, Richard G.
    Cavallaro, Andrea
    Haddadi, Hamed
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION (IOTDI '19), 2019, : 49 - 58
  • [43] A Review of Anonymization for Healthcare Data
    Olatunji, Iyiola E.
    Rauch, Jens
    Katzensteiner, Matthias
    Khosla, Megha
    BIG DATA, 2022,
  • [44] Scalable Distributed Data Anonymization
    di Vimercati, Sabrina De Capitani
    Facchinetti, Dario
    Foresti, Sara
    Oldani, Gianluca
    Paraboschi, Stefano
    Rossi, Matthew
    Samarati, Pierangela
    2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 401 - 403
  • [45] Big Data Anonymization with Spark
    Canbay, Yavuz
    Sagiroglu, Seref
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 833 - 838
  • [46] Anonymization in the time of big data
    Domingo-Ferrer J.
    Soria-Comas J.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 9867 LNCS : 57 - 68
  • [47] Interactive Anonymization of Sensitive Data
    Xiao, Xiaokui
    Wang, Guozhang
    Gehrke, Johannes
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 1051 - 1053
  • [48] Anonymization in the Time of Big Data
    Domingo-Ferrer, Josep
    Soria-Comas, Jordi
    PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY, 2016, 9867 : 57 - 68
  • [49] Data Anonymization With Diversity Constraints
    Milani, Mostafa
    Huang, Yu
    Chiang, Fei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3603 - 3618
  • [50] Consensus Robustness and Transaction De-Anonymization in the Ripple Currency Exchange System
    Di Luzio, Adriano
    Mei, Alessandro
    Stefa, Julinda
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 140 - 150