Collecting and Analyzing Multidimensional Data with Local Differential Privacy

被引:231
|
作者
Wang, Ning [1 ]
Xiao, Xiaokui [2 ]
Yang, Yin [3 ]
Zhao, Jun [4 ]
Hui, Siu Cheung [4 ]
Shin, Hyejin [5 ]
Shin, Junbum [5 ]
Yu, Ge [6 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Shandong, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Hamad Bin Khalifa Univ, Div Informat & Comp Techol, Coll Sci & Engn, Doha, Qatar
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[5] Samsung Elect, Samsung Res, Seoul, South Korea
[6] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Liaoning, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Local differential privacy; multidimensional data; stochastic gradient descent; NOISE;
D O I
10.1109/ICDE.2019.00063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP has attracted much research attention in recent years, the majority of existing work focuses on applying LDP to complex data and/or analysis tasks. In this paper, we point out that the fundamental problem of collecting multidimensional data under LDP has not been addressed sufficiently, and there remains much room for improvement even for basic tasks such as computing the mean value over a single numeric attribute under LDP. Motivated by this, we first propose novel LDP mechanisms for collecting a numeric attribute, whose accuracy is at least no worse (and usually better) than existing solutions in terms of worst-case noise variance. Then, we extend these mechanisms to multidimensional data that can contain both numeric and categorical attributes, where our mechanisms always outperform existing solutions regarding worst-case noise variance. As a case study, we apply our solutions to build an LDP-compliant stochastic gradient descent algorithm (SGD), which powers many important machine learning tasks. Experiments using real datasets confirm the effectiveness of our methods, and their advantages over existing solutions.
引用
收藏
页码:638 / 649
页数:12
相关论文
共 50 条
  • [1] On the Risks of Collecting Multidimensional Data Under Local Differential Privacy
    Arcolezi, Heber H.
    Gambs, Sebastien
    Couchot, Jean-Francois
    Palamidessi, Catuscia
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (05): : 1126 - 1139
  • [2] Collecting Multidimensional Numerical Data and Estimating Mean with Personalized Local Differential Privacy
    Zeng, Yufeng
    Long, Shigong
    Liu, Hai
    Ding, Hongfa
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [3] Collecting Partial Ordered Data With Local Differential Privacy
    Huang, Yaxuan
    Xue, Kaiping
    Zhu, Bin
    Zhao, Jingcheng
    Li, Ruidong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7646 - 7658
  • [4] Collecting Spatial Data Under Local Differential Privacy
    Ye, Yutong
    Zhang, Min
    Feng, Dengguo
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 120 - 127
  • [5] Collecting Geospatial Data with Local Differential Privacy for Personalized Services
    Hong, Daeyoung
    Jung, Woohwan
    Shim, Kyuseok
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2237 - 2242
  • [6] PLDP: Personalized Local Differential Privacy for Multidimensional Data Aggregation
    Shen, Zixuan
    Xia, Zhihua
    Yu, Peipeng
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [7] Collecting and analyzing key-value data under shuffled differential privacy
    Wang, Ning
    Zheng, Wei
    Wang, Zhigang
    Wei, Zhiqiang
    Gu, Yu
    Tang, Peng
    Yu, Ge
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (02)
  • [8] Collecting and analyzing key-value data under shuffled differential privacy
    Ning Wang
    Wei Zheng
    Zhigang Wang
    Zhiqiang Wei
    Yu Gu
    Peng Tang
    Ge Yu
    Frontiers of Computer Science, 2023, 17
  • [9] Collecting and analyzing key-value data under shuffled differential privacy
    Ning WANG
    Wei ZHENG
    Zhigang WANG
    Zhiqiang WEI
    Yu GU
    Peng TANG
    Ge YU
    Frontiers of Computer Science, 2023, 17 (02) : 237 - 239
  • [10] PrivAG: Analyzing Attributed Graph Data with Local Differential Privacy
    Liu, Zichun
    Huang, Liusheng
    Xu, Hongli
    Yang, Wei
    Wang, Shaowei
    2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 422 - 429