Summary Statistic Privacy in Data Sharing

被引:0
|
作者
Lin Z. [1 ]
Wang S. [2 ]
Sekar V. [2 ]
Fanti G. [2 ]
机构
[1] Algorithms Group, Microsoft Research, Redmond, 98052, WA
[2] Carnegie Mellon University, Department of Electrical and Computer Engineering, Pittsburgh, 15213, PA
关键词
data privacy; Privacy; synthetic data;
D O I
10.1109/JSAIT.2024.3403811
中图分类号
学科分类号
摘要
We study a setting where a data holder wishes to share data with a receiver, without revealing certain summary statistics of the data distribution (e.g., mean, standard deviation). It achieves this by passing the data through a randomization mechanism. We propose summary statistic privacy, a metric for quantifying the privacy risk of such a mechanism based on the worst-case probability of an adversary guessing the distributional secret within some threshold. Defining distortion as a worst-case Wasserstein-1 distance between the real and released data, we prove lower bounds on the tradeoff between privacy and distortion. We then propose a class of quantization mechanisms that can be adapted to different data distributions. We show that the quantization mechanism's privacy-distortion tradeoff matches our lower bounds under certain regimes, up to small constant factors. Finally, we demonstrate on real-world datasets that the proposed quantization mechanisms achieve better privacy-distortion tradeoffs than alternative privacy mechanisms. © 2020 IEEE.
引用
收藏
页码:369 / 384
页数:15
相关论文
共 50 条
  • [1] Data sharing threatens privacy
    Declan Butler
    Nature, 2007, 449 : 644 - 644
  • [2] Sharing data - protecting privacy
    不详
    R&D MAGAZINE, 2006, 48 (06): : 14 - 14
  • [3] Data sharing threatens privacy
    Butler, Declan
    NATURE, 2007, 449 (7163) : 644 - 645
  • [4] Genetic Data Sharing and Privacy
    Marco D. Sorani
    John K. Yue
    Sourabh Sharma
    Geoffrey T. Manley
    Adam R. Ferguson
    Shelly R. Cooper
    Kristen Dams-O’Connor
    Wayne A. Gordon
    Hester F. Lingsma
    Andrew I. R. Maas
    David K. Menon
    Diane J. Morabito
    Pratik Mukherjee
    David O. Okonkwo
    Ava M. Puccio
    Alex B. Valadka
    Esther L. Yuh
    Neuroinformatics, 2015, 13 : 1 - 6
  • [5] Genetic Data Sharing and Privacy
    Sorani, Marco D.
    Yue, John K.
    Sharma, Sourabh
    Manley, Geoffrey T.
    Ferguson, Adam R.
    Cooper, Shelly R.
    Dams-O'Connor, Kristen
    Gordon, Wayne A.
    Lingsma, Hester F.
    Maas, Andrew I. R.
    Menon, David K.
    Morabito, Diane J.
    Mukherjee, Pratik
    Okonkwo, David O.
    Puccio, Ava M.
    Valadka, Alex B.
    Yuh, Esther L.
    NEUROINFORMATICS, 2015, 13 (01) : 1 - 6
  • [6] Sharing extended summary data from contemporary genetics studies is unlikely to threaten subject privacy
    Bacanu, Silviu-Alin
    PLOS ONE, 2017, 12 (06):
  • [7] Spatial Statistic Data Release Based on Differential Privacy
    Cai, Sujin
    Lyu, Xin
    Ban, Duohan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (10) : 5244 - 5259
  • [8] Data sharing: guard the privacy of donors
    Shirley Y. Hill
    Nature, 2017, 548 : 281 - 281
  • [9] Data sharing: guard the privacy of donors
    Hill, Shirley Y.
    NATURE, 2017, 548 (7667) : 281 - 281
  • [10] PRIVACY CONSIDERATIONS FOR SHARING GENOMICS DATA
    Oestreich, Marie
    Chen, Dingfan
    Schultze, Joachim L.
    Fritz, Mario
    Becker, Matthias
    EXCLI JOURNAL, 2021, 20 : 1243 - 1260