Collecting and Analyzing Multidimensional Data with Local Differential Privacy

被引:231
|
作者
Wang, Ning [1 ]
Xiao, Xiaokui [2 ]
Yang, Yin [3 ]
Zhao, Jun [4 ]
Hui, Siu Cheung [4 ]
Shin, Hyejin [5 ]
Shin, Junbum [5 ]
Yu, Ge [6 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao, Shandong, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Hamad Bin Khalifa Univ, Div Informat & Comp Techol, Coll Sci & Engn, Doha, Qatar
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[5] Samsung Elect, Samsung Res, Seoul, South Korea
[6] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Liaoning, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Local differential privacy; multidimensional data; stochastic gradient descent; NOISE;
D O I
10.1109/ICDE.2019.00063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP has attracted much research attention in recent years, the majority of existing work focuses on applying LDP to complex data and/or analysis tasks. In this paper, we point out that the fundamental problem of collecting multidimensional data under LDP has not been addressed sufficiently, and there remains much room for improvement even for basic tasks such as computing the mean value over a single numeric attribute under LDP. Motivated by this, we first propose novel LDP mechanisms for collecting a numeric attribute, whose accuracy is at least no worse (and usually better) than existing solutions in terms of worst-case noise variance. Then, we extend these mechanisms to multidimensional data that can contain both numeric and categorical attributes, where our mechanisms always outperform existing solutions regarding worst-case noise variance. As a case study, we apply our solutions to build an LDP-compliant stochastic gradient descent algorithm (SGD), which powers many important machine learning tasks. Experiments using real datasets confirm the effectiveness of our methods, and their advantages over existing solutions.
引用
收藏
页码:638 / 649
页数:12
相关论文
共 50 条
  • [31] Trajectory Data Collection with Local Differential Privacy
    Zhang, Yuemin
    Ye, Qingqing
    Chen, Rui
    Hu, Haibo
    Han, Qilong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (10): : 2591 - 2604
  • [32] Multidimensional categorical data collection under shuffled differential privacy
    Wang, Ning
    Zhuang, Jian
    Wang, Zhigang
    Wei, Zhiqiang
    Gu, Yu
    Tang, Peng
    Yu, Ge
    COMPUTERS & SECURITY, 2025, 151
  • [33] Data: Planning, Collecting, and Analyzing
    Evans, Kevin D.
    Xu, Menglin
    JOURNAL OF DIAGNOSTIC MEDICAL SONOGRAPHY, 2022,
  • [34] Local differential privacy protection for wearable device data
    Li, Zhangbing
    Wang, Baichuan
    Li, Jinsheng
    Hua, Yi
    Zhang, Shaobo
    PLOS ONE, 2022, 17 (08):
  • [35] Mobile Data Collection and Analysis with Local Differential Privacy
    Li, Ninghui
    Ye, Qingqing
    2019 20TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2019), 2019, : 4 - 7
  • [36] Data Poisoning Attacks to Local Differential Privacy Protocols
    Cao, Xiaoyu
    Jia, Jinyuan
    Gong, Neil Zhenqiang
    PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, 2021, : 947 - 964
  • [37] A Data Synthesis Approach Based on Local Differential Privacy
    Wang, Zhihui
    Liu, Yishan
    Ni, Yuliang
    WEB AND BIG DATA, APWEB-WAIM 2024, PT IV, 2024, 14964 : 137 - 151
  • [38] IOT-DETECTIVE: Analyzing IoT Data Under Differential Privacy
    Ghayyur, Sameera
    Chen, Yan
    Yus, Roberto
    Machanavajjhala, Ashwin
    Hay, Michael
    Miklau, Gerome
    Mehrotra, Sharad
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1725 - 1728
  • [39] Privacy-preserving mechanism for mixed data clustering with local differential privacy
    Yuan, Liujie
    Zhang, Shaobo
    Zhu, Gengming
    Alinani, Karim
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (19):
  • [40] Analyzing Subgraph Statistics from Extended Local Views with Decentralized Differential Privacy
    Sun, Haipei
    Xiao, Xiaokui
    Khalil, Issa
    Yang, Yin
    Qin, Zhan
    Wang, Hui
    Yu, Ting
    PROCEEDINGS OF THE 2019 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'19), 2019, : 703 - 717