Differentially Private Multi-Party High-Dimensional Data Publishing

被引:0
|
作者
Su, Sen [1 ]
Tang, Peng [1 ]
Cheng, Xiang [1 ]
Chen, Rui [2 ]
Wu, Zequn [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
[2] Samsung Res Amer, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the novel problem of publishing high-dimensional data in a distributed multi-party environment under differential privacy. In particular, with the assistance of a semi-trusted curator, the involved parties (i.e., local data owners) collectively generate a synthetic integrated dataset while satisfying c-differential privacy for any local dataset. To solve this problem, we present a differentially private sequential update of Bayesian network (DP-SUBN) solution. In DP-SUBN, the parties and the curator collaboratively identify the Bayesian network N that best fits the integrated dataset D in a sequential manner, from which a synthetic dataset can then be generated. The fundamental advantage of adopting the sequential update manner is that the parties can treat the statistical results provided by previous parties as their prior knowledge to direct how to learn N. The core of DP-SUBN is the construction of the search frontier, which can be seen as a priori knowledge to guide the parties to update N. To improve the fitness of N and reduce the communication cost, we introduce a correlation-aware search frontier construction (CSFC) approach, where attribute pairs with strong correlations are used to construct the search frontier. In particular, to privately quantify the correlations of attribute pairs without introducing too much noise, we first propose a non-overlapping covering design (NOCD) method, and then introduce a dynamic programming method to find the optimal parameters used in NOCD to ensure that the injected noise is minimum. Through formal privacy analysis, we show that DP-SUBN satisfies c-differential privacy for any local dataset. Extensive experiments on a real dataset demonstrate that DP-SUBN offers desirable data utility with low communication cost.
引用
收藏
页码:205 / 216
页数:12
相关论文
共 50 条
  • [1] Multi-Party High-Dimensional Data Publishing Under Differential Privacy
    Cheng, Xiang
    Tang, Peng
    Su, Sen
    Chen, Rui
    Wu, Zequn
    Zhu, Binyuan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1557 - 1571
  • [2] Differentially Private Publication of Multi-Party Sequential Data
    Tang, Peng
    Chen, Rui
    Su, Sen
    Guo, Shanqing
    Ju, Lei
    Liu, Gaoyuan
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 145 - 156
  • [3] Differentially Private Multi-party Computation
    Kairouz, Peter
    Oh, Sewoong
    Viswanath, Pramod
    2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [4] Differentially Private Multi-Party Data Release for Linear Regression
    Wu, Ruihan
    Yang, Xin
    Yao, Yuanshun
    Sun, Jiankai
    Liu, Tianyi
    Weinberger, Kilian Q.
    Wang, Chong
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2128 - 2137
  • [5] Secure Multi-party Computation of Differentially Private Median
    Bohler, Jonas
    Kerschbaum, Florian
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 2147 - 2164
  • [6] Locally differentially private high-dimensional data synthesis
    Chen, Xue
    Wang, Cheng
    Yang, Qing
    Hu, Teng
    Jiang, Changjun
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (01)
  • [7] ATLAS: GAN-Based Differentially Private Multi-Party Data Sharing
    Wang, Zhenya
    Cheng, Xiang
    Su, Sen
    Liang, Jintao
    Yang, Haocheng
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (04) : 1225 - 1237
  • [8] Locally differentially private high-dimensional data synthesis
    Xue Chen
    Cheng Wang
    Qing Yang
    Teng Hu
    Changjun Jiang
    Science China Information Sciences, 2023, 66
  • [9] Locally differentially private high-dimensional data synthesis
    Xue CHEN
    Cheng WANG
    Qing YANG
    Teng HU
    Changjun JIANG
    ScienceChina(InformationSciences), 2023, 66 (01) : 25 - 42
  • [10] Publishing locally private high-dimensional synthetic data efficiently
    Zhang, Hua
    Li, Kaixuan
    Huang, Teng
    Zhang, Xin
    Li, Wenmin
    Jin, Zhengping
    Gao, Fei
    Gao, Minghui
    INFORMATION SCIENCES, 2023, 633 : 343 - 356