DPI: Ensuring Strict Differential Privacy for Infinite Data Streaming

被引:0
|
作者
Feng, Shuya [1 ]
Mohammadyt, Meisam [2 ]
Wang, Han [3 ]
Li, Xiaochen [4 ]
Qin, Zhan [4 ]
Hong, Yuan [1 ]
机构
[1] Univ Connecticut, Storrs, CT 06269 USA
[2] Iowa State Univ, Ames, IA 50011 USA
[3] Univ Kansas, Lawrence, KS 66045 USA
[4] Zhejiang Univ, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
QUERIES;
D O I
10.1109/SP54263.2024.00124
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Streaming data, crucial for applications like crowd-sourcing analytics, behavior studies, and real-time monitoring, faces significant privacy risks due to the large and diverse data linked to individuals. In particular, recent efforts to release data streams, using the rigorous privacy notion of differential privacy (DP), have encountered issues with unbounded privacy leakage. This challenge limits their applicability to only a finite number of time slots ("finite data stream") or relaxation to protecting the events ("event or w-event DP") rather than all the records of users. A persistent challenge is managing the sensitivity of outputs to inputs in situations where users contribute many activities and data distributions evolve over time. In this paper, we present a novel technique for Differentially Private data streaming over Infinite disclosure (DPI) that effectively bounds the total privacy leakage of each user in infinite data streams while enabling accurate data collection and analysis. Furthermore, we also maximize the accuracy of DPI via a novel boosting mechanism. Finally, extensive experiments across various streaming applications and real datasets (e.g., COVID-19, Network Traffic, and USDA Production), show that DPI maintains high utility for infinite data streams in diverse settings. Code for DPI is available at https://github.com/ShuyaFeng/DPI.
引用
收藏
页码:1009 / 1027
页数:19
相关论文
共 50 条
  • [31] Big Data Privacy Based On Differential Privacy a Hope for Big Data
    Shrivastva, Krishna Mohan Pd
    Rizvi, M. A.
    Singh, Shailendra
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 776 - 781
  • [32] Temporally Discounted Differential Privacy for Evolving Datasets on an Infinite Horizon
    Farokhi, Farhad
    2020 ACM/IEEE 11TH INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2020), 2020, : 1 - 8
  • [33] Influence of data errors on differential privacy
    Tao Wang
    Zhengquan Xu
    Dong Wang
    Hao Wang
    Cluster Computing, 2019, 22 : 2739 - 2746
  • [34] Local Differential Privacy for Evolving Data
    Joseph, Matthew
    Roth, Aaron
    Ullman, Jonathan
    Waggoner, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [35] Distributed Data Mining with Differential Privacy
    Zhang, Ning
    Li, Ming
    Lou, Wenjing
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [36] PrivateClean: Data Cleaning and Differential Privacy
    Krishnan, Sanjay
    Wang, Jiannan
    Franklin, Michael J.
    Goldberg, Ken
    Kraska, Tim
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 937 - 951
  • [37] Differential Privacy in Networked Data Collection
    Javidbakht, Omid
    Venkitasubramaniam, Mary
    2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [38] The Use of Differential Privacy for Energy Data
    Scaglione, Anna
    CPSS'22: PROCEEDINGS OF THE 8TH ACM CYBER-PHYSICAL SYSTEM SECURITY WORKSHOP, 2022, : 1 - 1
  • [39] Differential Privacy and Federal Data Releases
    Reiter, Jerome P.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 6, 2019, 6 : 85 - 101
  • [40] Differential Privacy for Functions and Functional Data
    Hall, Rob
    Rinaldo, Alessandro
    Wasserman, Larry
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 703 - 727