PCOR: Private Contextual Outlier Release via Differentially Private Search

被引:1
|
作者
Shafieinejad, Masoumeh [1 ]
Kerschbaum, Florian [1 ]
Ilyas, Ihab F. [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
来源
SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2021年
基金
加拿大自然科学与工程研究理事会;
关键词
differential privacy; contextual outlier detection; graph search; private sampling;
D O I
10.1145/3448016.3452812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection plays a significant role in various real world applications such as intrusion, malfunction, and fraud detection. Traditionally, outlier detection techniques are applied to find outliers in the context of the whole dataset. However, this practice neglects the data points, namely contextual outliers, that are not outliers in the whole dataset but in some specific neighborhoods. Contextual outliers are particularly important in data exploration and targeted anomaly explanation and diagnosis. In these scenarios, the data owner computes the following information: i) The attributes that contribute to the abnormality of an outlier (metric), ii) Contextual description of the outlier's neighborhoods (context), and iii) The utility score of the context, e.g. its strength in showing the outlier's significance, or in relation to a particular explanation for the outlier. However, revealing the outlier's context leaks information about the other individuals in the population as well, violating their privacy. We address the issue of population privacy violations in this paper. There are two main challenges in defining and applying privacy in contextual outlier release. In this setting, the data owner is required to release a valid context for the queried record, i.e. a context in which the record is an outlier. Hence, the first major challenge is that the privacy technique must preserve the validity of the context for each record. We propose techniques to protect the privacy of individuals through a relaxed notion of differential privacy to satisfy this requirement. The second major challenge is applying the proposed techniques efficiently, as they impose intensive computation to the base algorithm. To overcome this challenge, we propose a graph structure to map the contexts to, and introduce differentially private graph search algorithms as efficient solutions for the computation problem caused by differential privacy techniques.
引用
收藏
页码:1571 / 1583
页数:13
相关论文
共 50 条
  • [31] Answering differentially private queries for continual datasets release
    Zhu, Tianqing
    Li, Gang
    Xiong, Ping
    Zhou, Wanlei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 816 - 827
  • [32] Differentially Private Query Release Through Adaptive Projection
    Aydore, Sergul
    Brown, William
    Kearns, Michael
    Kenthapadi, Krishnaram
    Melis, Luca
    Roth, Aaron
    Siva, Ankit
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [33] Differentially Private Crowdsourcing With the Public and Private Blockchain
    Wang, Minghao
    Zhu, Tianqing
    Zuo, Xuhan
    Yang, Mengmeng
    Yu, Shui
    Zhou, Wanlei
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (10) : 8918 - 8930
  • [34] A guide for private outlier analysis
    Asif, Hafiz
    Papakonstantinou, Periklis A.
    Vaidya, Jaideep
    IEEE Letters of the Computer Society, 2020, 3 (01): : 29 - 33
  • [35] Differentially Private Auctions for Private Data Crowdsourcing
    Shi, Mingyu
    Qiao, Yu
    Wang, Xinbo
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1 - 8
  • [36] Differentially Private Streaming Data Release Under Temporal Correlations via Post-processing
    Cao, Xuyang
    Cao, Yang
    Pappachan, Primal
    Nakamura, Atsuyoshi
    Yoshikawa, Masatoshi
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXVII, DBSEC 2023, 2023, 13942 : 184 - 200
  • [37] Differentially Private Data Release for Mixed-type Data via Latent Factor Models
    Zhang, Yanqing
    Xu, Qi
    Tang, Niansheng
    Qu, Annie
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [38] Differentially private data release via statistical election to partition sequentiallyStatistical election to partition sequentially
    Claire McKay Bowen
    Fang Liu
    Bingyue Su
    METRON, 2021, 79 : 1 - 31
  • [39] Using Noisy Binary Search for Differentially Private Anomaly Detection
    Bittner, Daniel M.
    Sarwate, Anand D.
    Wright, Rebecca N.
    CYBER SECURITY CRYPTOGRAPHY AND MACHINE LEARNING, CSCML 2018, 2018, 10879 : 20 - 37
  • [40] Improved Differentially Private Regression via Gradient Boosting
    Tang, Shuai
    Aydore, Sergul
    Kearns, Michael
    Rho, Saeyoung
    Roth, Aaron
    Wang, Yichen
    Wang, Yu-Xiang
    Wu, Zhiwei Steven
    IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 33 - 55