Collecting and Analyzing Multidimensional Data with Local Differential Privacy

被引:231
|
作者
Wang, Ning [1 ]
Xiao, Xiaokui [2 ]
Yang, Yin [3 ]
Zhao, Jun [4 ]
Hui, Siu Cheung [4 ]
Shin, Hyejin [5 ]
Shin, Junbum [5 ]
Yu, Ge [6 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Shandong, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Hamad Bin Khalifa Univ, Div Informat & Comp Techol, Coll Sci & Engn, Doha, Qatar
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[5] Samsung Elect, Samsung Res, Seoul, South Korea
[6] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Liaoning, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Local differential privacy; multidimensional data; stochastic gradient descent; NOISE;
D O I
10.1109/ICDE.2019.00063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP has attracted much research attention in recent years, the majority of existing work focuses on applying LDP to complex data and/or analysis tasks. In this paper, we point out that the fundamental problem of collecting multidimensional data under LDP has not been addressed sufficiently, and there remains much room for improvement even for basic tasks such as computing the mean value over a single numeric attribute under LDP. Motivated by this, we first propose novel LDP mechanisms for collecting a numeric attribute, whose accuracy is at least no worse (and usually better) than existing solutions in terms of worst-case noise variance. Then, we extend these mechanisms to multidimensional data that can contain both numeric and categorical attributes, where our mechanisms always outperform existing solutions regarding worst-case noise variance. As a case study, we apply our solutions to build an LDP-compliant stochastic gradient descent algorithm (SGD), which powers many important machine learning tasks. Experiments using real datasets confirm the effectiveness of our methods, and their advantages over existing solutions.
引用
收藏
页码:638 / 649
页数:12
相关论文
共 50 条
  • [21] Local Differential Privacy for Evolving Data
    Joseph, Matthew
    Roth, Aaron
    Ullman, Jonathan
    Waggoner, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [22] HyperMean: Effective Multidimensional Mean Estimation with Local Differential Privacy
    Zhang, Tao
    Deng, Bowen
    Zheng, Lele
    Tong, Ze
    Li, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 615 - 622
  • [23] Multidimensional grid-based clustering with local differential privacy
    Fu, Nan
    Ni, Weiwei
    Hu, Haibo
    Zhang, Sen
    INFORMATION SCIENCES, 2023, 623 : 402 - 420
  • [24] Statistics Collecting Algorithms of Distributed Graph via Local Differential Privacy
    Fu, Peiwang
    Ding, Hongfa
    Liu, Hai
    Jiang, Heling
    Tang, Mingli
    Yu, Yingying
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (07): : 1643 - 1669
  • [25] AsgLDP: Collecting and Generating Decentralized Attributed Graphs With Local Differential Privacy
    Wei, Chengkun
    Ji, Shouling
    Liu, Changchang
    Chen, Wenzhi
    Wang, Ting
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 3239 - 3254
  • [26] Collecting and analyzing data in multidimensional scaling experiments: A guide for psychologists using SPSS
    Giguere, Gyslain
    TUTORIALS IN QUANTITATIVE METHODS FOR PSYCHOLOGY, 2006, 2 (01) : 26 - 37
  • [27] Privacy preserving classification on local differential privacy in data centers
    Fan, Weibei
    He, Jing
    Guo, Mengjiao
    Li, Peng
    Han, Zhijie
    Wang, Ruchuan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 135 (135) : 70 - 82
  • [28] Trajectory Data Collection with Local Differential Privacy
    Zhang, Yuemin
    Ye, Qingqing
    Chen, Rui
    Hu, Haibo
    Han, Qilong
    arXiv, 2023,
  • [29] Local Differential Privacy for data collection and analysis
    Wang, Teng
    Zhao, Jun
    Hu, Zhi
    Yang, Xinyu
    Ren, Xuebin
    Lam, Kwok-Yan
    Neurocomputing, 2021, 426 : 114 - 133
  • [30] Local Differential Privacy for data collection and analysis
    Wang, Teng
    Zhao, Jun
    Hu, Zhi
    Yang, Xinyu
    Ren, Xuebin
    Lam, Kwok-Yan
    NEUROCOMPUTING, 2021, 426 : 114 - 133