Turbo: Effective Caching in Differentially-Private Databases

被引:0
|
作者
Kostopoulou, Kelly [1 ]
Tholoniat, Pierre [1 ]
Cidon, Asaf [1 ]
Geambasu, Roxana [1 ]
Lecuyer, Mathias [2 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada
关键词
QUERIES;
D O I
10.1145/3600006.3613174
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Differentially-private (DP) databases allow for privacy-preserving analytics over sensitive datasets or data streams. In these systems, user privacy is a limited resource that must be conserved with each query. We propose Turbo, a novel, state-of-the-art caching layer for linear query workloads over DP databases. Turbo builds upon private multiplicative weights (PMW), a DP mechanism that is powerful in theory but ineffective in practice, and transforms it into a highly-effective caching mechanism, PMW-Bypass, that uses prior query results obtained through an external DP mechanism to train a PMW to answer arbitrary future linear queries accurately and "for free" from a privacy perspective. Our experiments on public Covid and CitiBike datasets show that Turbo with PMW-Bypass conserves 1.7 - 15.9x more budget compared to vanilla PMW and simpler cache designs, a significant improvement. Moreover, Turbo provides support for range query workloads, such as timeseries or streams, where opportunities exist to further conserve privacy budget through DP parallel composition and warm-starting of PMW state. Our work provides a theoretical foundation and general system design for effective caching in DP databases.
引用
收藏
页码:579 / +
页数:25
相关论文
共 50 条
  • [1] Towards Verifiable Differentially-Private Polling
    Garrido, Gonzalo Munilla
    Babel, Matthias
    Sedlmeir, Johannes
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, ARES 2022, 2022,
  • [2] Differentially-Private Logistic Regression for Detecting Multiple-SNP Association in GWAS Databases
    Yu, Fei
    Rybar, Michal
    Uhler, Caroline
    Fienberg, Stephen E.
    PRIVACY IN STATISTICAL DATABASES, PSD 2014, 2014, 8744 : 170 - 184
  • [3] Differentially-Private Network Trace Analysis
    McSherry, Frank
    Mahajan, Ratul
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) : 123 - 134
  • [4] DIFFERENTIALLY-PRIVATE CANONICAL CORRELATION ANALYSIS
    Imtiaz, Hafiz
    Sarwate, Anand D.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 283 - 287
  • [5] Differentially-Private Clustering of Easy Instances
    Cohen, Edith
    Kaplan, Haim
    Mansour, Yishay
    Stemmer, Uri
    Tsfadia, Eliad
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [6] On the information leakage of differentially-private mechanisms
    Alvim, Mario S.
    Andres, Miguel E.
    Chatzikokolakis, Konstantinos
    Degano, Pierpaolo
    Palamidessi, Catuscia
    JOURNAL OF COMPUTER SECURITY, 2015, 23 (04) : 427 - 469
  • [7] Hybrid Differentially-Private String Matching
    Rao, Fang-Yu
    Ghinita, Gabriel
    Bertino, Elisa
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 478 - 488
  • [8] Locally Differentially-Private Distribution Estimation
    Pastore, Adrian
    Gastpar, Michael
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 2694 - 2698
  • [9] Online and Differentially-Private Tensor Decomposition
    Wang, Yining
    Anandkumar, Animashree
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [10] Distributed differentially-private learning with communication efficiency
    Phuong, Tran Thi
    Phong, Le Trieu
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 128