Key-value data collection and statistical analysis with local differential privacy

被引:1
|
作者
Zhu, Hui [1 ]
Tang, Xiaohu [1 ]
Yang, Laurence Tianruo [2 ,3 ,4 ]
Fu, Chao [5 ]
Peng, Shuangrong [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Hainan Univ, Sch Comp Sci & Technol, Haikou, Peoples R China
[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS, Canada
[4] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[5] Southwest Jiaotong Univ, Sch Math, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Key-value data; Local differential privacy; Mean estimation; Frequency estimation; RANGE QUERIES;
D O I
10.1016/j.ins.2023.119058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The collection and statistical analysis of simple data types (e.g., categorical, numerical and multi-dimensional data) under local differential privacy has been widely studied. Recently, researchers have focused on the collection of the key-value data, which is one of the main types of NoSQL data model. In the collection and statistical analysis of key-value data under local differential privacy, the frequency and mean of each key must be estimated simultaneously. However, achieving a good utility-privacy tradeoff is difficult, because key-value data has inherent correlation, and some users may have different numbers of key-value pairs. In this paper, we propose an efficient sampling based scheme for collecting and analyzing key-value data. Note that the more valid data collected, the higher the accuracy of statistical data under the same disturbance level and disturbance algorithm. Therefore, we make full use of probability sampling and the inherent correlation of key-value data to improve the probability of users submitting valid key-value data. Moreover, we optimize the budget allocation on key-value data, so that the overall variance of frequency and mean estimation is close to optimal. Detailed theoretical analysis and experimental results show that the proposed scheme is superior to existing schemes in accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] PrivKV: Key-Value Data Collection with Local Differential Privacy
    Ye, Qingqing
    Hu, Haibo
    Meng, Xiaofeng
    Zheng, Huadi
    2019 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP 2019), 2019, : 317 - 331
  • [2] Key-Value Data Accurate Collection under Local Differential Privacy
    Zhang X.-J.
    Fu N.
    Meng X.-F.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (08): : 1479 - 1492
  • [3] KSKV: Key-Strategy for Key-Value Data Collection with Local Differential Privacy
    Zhao, Dan
    You, Yang
    Luo, Chuanwen
    Chen, Ting
    Liu, Yang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (03): : 3063 - 3083
  • [4] Key-Value Data Collection with Distribution Estimation under Local Differential Privacy
    Li, Xiaoguang
    Yan, Haonan
    Zheng, Gewei
    Li, Hui
    Li, Fenghua
    Security and Communication Networks, 2022, 2022
  • [5] LHKV: A Key-Value Data Collection Mechanism Under Local Differential Privacy
    Xue, Weihao
    Sang, Yingpeng
    Tian, Hui
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT I, 2023, 14146 : 228 - 242
  • [6] Key-Value Data Collection with Distribution Estimation under Local Differential Privacy
    Li, Xiaoguang
    Yan, Haonan
    Zheng, Gewei
    Li, Hui
    Li, Fenghua
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [7] Utility-Improved Key-Value Data Collection with Local Differential Privacy for Mobile Devices
    TONG Ze
    DENG Bowen
    ZHENG Lele
    ZHANG Tao
    ZTECommunications, 2022, 20 (04) : 15 - 21
  • [8] Poisoning Attacks to Local Differential Privacy Protocols for Key-Value Data
    Wu, Yongji
    Cao, Xiaoyu
    Jia, Jinyuan
    Gong, Neil Zhenqiang
    PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 519 - 536
  • [9] PrivKVM*: Revisiting Key-Value Statistics Estimation With Local Differential Privacy
    Ye, Qingqing
    Hu, Haibo
    Meng, Xiaofeng
    Zheng, Huadi
    Huang, Kai
    Fang, Chengfang
    Shi, Jie
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (01) : 17 - 35
  • [10] Local Differential Privacy Protocol for Making Key-Value Data Robust Against Poisoning Attacks
    Horigome, Hikaru
    Kikuchi, Hiroaki
    Yu, Chia-Mu
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2023, 2023, 13890 : 241 - 252