Key-value data collection and statistical analysis with local differential privacy

被引:1
|
作者
Zhu, Hui [1 ]
Tang, Xiaohu [1 ]
Yang, Laurence Tianruo [2 ,3 ,4 ]
Fu, Chao [5 ]
Peng, Shuangrong [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Hainan Univ, Sch Comp Sci & Technol, Haikou, Peoples R China
[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS, Canada
[4] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[5] Southwest Jiaotong Univ, Sch Math, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Key-value data; Local differential privacy; Mean estimation; Frequency estimation; RANGE QUERIES;
D O I
10.1016/j.ins.2023.119058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The collection and statistical analysis of simple data types (e.g., categorical, numerical and multi-dimensional data) under local differential privacy has been widely studied. Recently, researchers have focused on the collection of the key-value data, which is one of the main types of NoSQL data model. In the collection and statistical analysis of key-value data under local differential privacy, the frequency and mean of each key must be estimated simultaneously. However, achieving a good utility-privacy tradeoff is difficult, because key-value data has inherent correlation, and some users may have different numbers of key-value pairs. In this paper, we propose an efficient sampling based scheme for collecting and analyzing key-value data. Note that the more valid data collected, the higher the accuracy of statistical data under the same disturbance level and disturbance algorithm. Therefore, we make full use of probability sampling and the inherent correlation of key-value data to improve the probability of users submitting valid key-value data. Moreover, we optimize the budget allocation on key-value data, so that the overall variance of frequency and mean estimation is close to optimal. Detailed theoretical analysis and experimental results show that the proposed scheme is superior to existing schemes in accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Trajectory Data Collection with Local Differential Privacy
    Zhang, Yuemin
    Ye, Qingqing
    Chen, Rui
    Hu, Haibo
    Han, Qilong
    arXiv, 2023,
  • [22] Trajectory Data Collection with Local Differential Privacy
    Zhang, Yuemin
    Ye, Qingqing
    Chen, Rui
    Hu, Haibo
    Han, Qilong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (10): : 2591 - 2604
  • [23] Protecting privacy in key-value search systems
    Xie, Yinglian
    Reiter, Michael K.
    O'Hallaron, David
    22ND ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, PROCEEDINGS, 2006, : 493 - +
  • [24] Collection scheme of location data based on local differential privacy
    Gao Z.
    Cui X.
    Du B.
    Zhou S.
    Yuan C.
    Li A.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2019, 59 (01): : 23 - 27
  • [25] Application of Local Differential Privacy to Collection of Indoor Positioning Data
    Kim, Jong Wook
    Kim, Dae-Ho
    Jang, Beakcheol
    IEEE ACCESS, 2018, 6 : 4276 - 4286
  • [26] PCKV: Locally Differentially Private Correlated Key-Value Data Collection with Optimized Utility
    Gu, Xiaolan
    Li, Ming
    Cheng, Yueqiang
    Xiong, Li
    Cao, Yang
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 967 - 984
  • [27] Adaptive personalized privacy-preserving data collection scheme with local differential privacy
    Song, Haina
    Shen, Hua
    Zhao, Nan
    He, Zhangqing
    Xiong, Wei
    Wu, Minghu
    Zhang, Mingwu
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)
  • [28] Robust Data Sharing with Key-Value Stores
    Basescu, Cristina
    Cachin, Christian
    Eyal, Ittay
    Haas, Robert
    Sorniotti, Alessandro
    Vukolic, Marko
    Zachevsky, Ido
    2012 42ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2012,
  • [29] Longitudinal attacks against iterative data collection with local differential privacy
    Gursoy, Mehmet Emre
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2024, 32 (01) : 198 - 218
  • [30] Transaction Data Collection for Itemset Mining Under Local Differential Privacy
    Ouyang J.
    Yin J.
    Xiao Z.-H.
    Zhao H.-M.
    Liu S.-P.
    Liang P.
    Xiao Y.-Y.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (11): : 3541 - 3562