CGM: An Enhanced Mechanism for Streaming Data Collection with Local Differential Privacy

被引:18
|
作者
Bao, Ergute [1 ]
Yang, Yin [2 ]
Xiao, Xiaokui [1 ]
Ding, Bolin [3 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Hamad Bin Khalifa Univ, Ar Rayyan, Qatar
[3] Alibaba Grp, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 11期
基金
新加坡国家研究基金会;
关键词
D O I
10.14778/3476249.3476277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) is a well-established privacy protection scheme for collecting sensitive data, which has been integrated into major platforms such as iOS, Chrome, and Windows. The main idea is that each individual randomly perturbs her data on her local device, and only uploads the noisy version to an untrusted data aggregator. This paper focuses on the collection of streaming data consisting of regular updates, e.g., daily app usage. Such streams, when aggregated over a large population, often exhibit strong autocorrelations, e.g., the average usage of an app usually does not change dramatically from one day to the next. To our knowledge, this property has been largely neglected in existing LDP mechanisms. Consequently, data collected with current LDP methods often exhibit unrealistically violent fluctuations due to the added noise, drowning the overall trend, as shown in our experiments. This paper proposes a novel correlated Gaussian mechanism (CGM) for enforcing (epsilon,delta)-LDP on streaming data collection, which reduces noise by exploiting public-known autocorrelation patterns of the aggregated data. This is done through non-trivial modifications to the core of the underlying Gaussian Mechanism; in particular, CGM injects temporally correlated noise, computed through an optimization program that takes into account the given autocorrelation pattern, data value range, and utility metric. CGM comes with formal proof of correctness, and consumes negligible computational resources. Extensive experiments using real datasets from different application domains demonstrate that CGM achieves consistent and significant utility gains compared to the baseline method of repeatedly running the underlying one-shot LDP mechanism.
引用
收藏
页码:2258 / 2270
页数:13
相关论文
共 50 条
  • [21] Secure and Utility-Aware Data Collection with Condensed Local Differential Privacy
    Gursoy, Mehmet Emre
    Tamersoy, Acar
    Truex, Stacey
    Wei, Wenqi
    Liu, Ling
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (05) : 2365 - 2378
  • [22] Multiple Privacy Regimes Mechanism for Local Differential Privacy
    Ye, Yutong
    Zhang, Min
    Feng, Dengguo
    Li, Hao
    Chi, Jialin
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT II, 2019, 11447 : 247 - 263
  • [23] Local differential privacy federated learning based on heterogeneous data multi-privacy mechanism
    Wang, Jie
    Zhang, Zhiju
    Tian, Jing
    Li, Hongtao
    COMPUTER NETWORKS, 2024, 254
  • [24] Workload-Aware Indoor Positioning Data Collection via Local Differential Privacy
    Kim, Jong Wook
    Jang, Beakcheol
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (08) : 1352 - 1356
  • [25] Key-Value Data Collection with Distribution Estimation under Local Differential Privacy
    Li, Xiaoguang
    Yan, Haonan
    Zheng, Gewei
    Li, Hui
    Li, Fenghua
    Security and Communication Networks, 2022, 2022
  • [26] Set-valued data collection with local differential privacy based on category hierarchy
    Ouyang, Jia
    Xiao, Yinyin
    Liu, Shaopeng
    Xiao, Zhenghong
    Liao, Xiuxiu
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (03) : 2733 - 2763
  • [27] Secure Medical Data Collection in the Internet of Medical Things Based on Local Differential Privacy
    Wang, Jinpeng
    Li, Xiaohui
    ELECTRONICS, 2023, 12 (02)
  • [28] Key-Value Data Collection with Distribution Estimation under Local Differential Privacy
    Li, Xiaoguang
    Yan, Haonan
    Zheng, Gewei
    Li, Hui
    Li, Fenghua
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [29] Local Differential Privacy for Evolving Data
    Joseph, Matthew
    Roth, Aaron
    Ullman, Jonathan
    Waggoner, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [30] Privacy Enhanced Matrix Factorization for Recommendation with Local Differential Privacy
    Shin, Hyejin
    Kim, Sungwook
    Shin, Junbum
    Xiao, Xiaokui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) : 1770 - 1782