On Optimizing the Trade-off between Privacy and Utility in Data Provenance

被引:5
|
作者
Deutch, Daniel [1 ]
Frankenthal, Ariel [1 ]
Gilad, Amir [2 ]
Moskovitch, Yuval [3 ]
机构
[1] Tel Aviv Univ, Tel Aviv, Israel
[2] Duke Univ, Durham, NC 27706 USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
SECURE; AGGREGATION; VIEWS;
D O I
10.1145/3448016.3452835
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Organizations that collect and analyze data may wish or be mandated by regulation to justify and explain their analysis results. At the same time, the logic that they have followed to analyze the data, i.e., their queries, may be proprietary and confidential. Data provenance, a record of the transformations that data underwent, was extensively studied as means of explanations. In contrast, only a few works have studied the tension between disclosing provenance and hiding the underlying query. This tension is the focus of the present paper, where we formalize and explore for the first time the tradeoff between the utility of presenting provenance information and the breach of privacy it poses with respect to the underlying query. Intuitively, our formalization is based on the notion of provenance abstraction, where the representation of some tuples in the provenance expressions is abstracted in a way that makes multiple tuples indistinguishable. The privacy of a chosen abstraction is then measured based on how many queries match the obfuscated provenance, in the same vein as k-anonymity. The utility is measured based on the entropy of the abstraction, intuitively how much information is lost with respect to the actual tuples participating in the provenance. Our formalization yields a novel optimization problem of choosing the best abstraction in terms of this tradeoff. We show that the problem is intractable in general, but design greedy heuristics that exploit the provenance structure towards a practically efficient exploration of the search space. We experimentally prove the effectiveness of our solution using the TPC-H benchmark and the IMDB dataset.
引用
收藏
页码:379 / 391
页数:13
相关论文
共 50 条
  • [1] Local Differential Privacy on Metric Spaces: optimizing the trade-off with utility
    Alvim, Mario S.
    Chatzikokolakis, Konstantinos
    Palamidessi, Catuscia
    Pazii, Anna
    IEEE 31ST COMPUTER SECURITY FOUNDATIONS SYMPOSIUM (CSF 2018), 2018, : 262 - 267
  • [2] Differential privacy: On the trade-off between utility and information leakage
    INRIA, LIX, Ecole Polytechnique, France
    不详
    Lect. Notes Comput. Sci., (39-54):
  • [3] Who are you? The trade-off between information utility and privacy
    Miller, Jim
    IEEE INTERNET COMPUTING, 2008, 12 (04) : 93 - 96
  • [4] PULP: Achieving Privacy and Utility Trade-off in User Mobility Data
    Cerf, Sophie
    Primault, Vincent
    Boutet, Antoine
    Ben Mokhtar, Sonia
    Birke, Robert
    Bouchenak, Sara
    Chen, Lydia Y.
    Marchand, Nicolas
    Robu, Bogdan
    2017 IEEE 36TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2017, : 164 - 173
  • [5] On the Trade-Off Between Privacy and Utility in Mobile Services: A Qualitative Study
    Liu, Yang
    Simpson, Andrew
    COMPUTER SECURITY, ESORICS 2019, 2020, 11980 : 261 - 278
  • [6] AI in Healthcare Data Privacy-Preserving: Enhanced Trade-Off Between Security and Utility
    Peng, Lian
    Qiu, Meikang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024, 2024, 14886 : 349 - 360
  • [7] Utility/privacy trade-off as regularized optimal transport
    Boursier, Etienne
    Perchet, Vianney
    MATHEMATICAL PROGRAMMING, 2024, 203 (1-2) : 703 - 726
  • [8] Utility/privacy trade-off as regularized optimal transport
    Etienne Boursier
    Vianney Perchet
    Mathematical Programming, 2024, 203 : 703 - 726
  • [9] On the Privacy-Utility Trade-Off With and Without Direct Access to the Private Data
    Zamani, Amirreza
    Oechtering, Tobias J.
    Skoglund, Mikael
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (03) : 2177 - 2200
  • [10] Data privacy and utility trade-off based on mutual information neural estimator
    Wu, Qihong
    Tang, Jinchuan
    Dang, Shuping
    Chen, Gaojie
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 207