PCOR: Private Contextual Outlier Release via Differentially Private Search

被引:1
|
作者
Shafieinejad, Masoumeh [1 ]
Kerschbaum, Florian [1 ]
Ilyas, Ihab F. [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
来源
SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2021年
基金
加拿大自然科学与工程研究理事会;
关键词
differential privacy; contextual outlier detection; graph search; private sampling;
D O I
10.1145/3448016.3452812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection plays a significant role in various real world applications such as intrusion, malfunction, and fraud detection. Traditionally, outlier detection techniques are applied to find outliers in the context of the whole dataset. However, this practice neglects the data points, namely contextual outliers, that are not outliers in the whole dataset but in some specific neighborhoods. Contextual outliers are particularly important in data exploration and targeted anomaly explanation and diagnosis. In these scenarios, the data owner computes the following information: i) The attributes that contribute to the abnormality of an outlier (metric), ii) Contextual description of the outlier's neighborhoods (context), and iii) The utility score of the context, e.g. its strength in showing the outlier's significance, or in relation to a particular explanation for the outlier. However, revealing the outlier's context leaks information about the other individuals in the population as well, violating their privacy. We address the issue of population privacy violations in this paper. There are two main challenges in defining and applying privacy in contextual outlier release. In this setting, the data owner is required to release a valid context for the queried record, i.e. a context in which the record is an outlier. Hence, the first major challenge is that the privacy technique must preserve the validity of the context for each record. We propose techniques to protect the privacy of individuals through a relaxed notion of differential privacy to satisfy this requirement. The second major challenge is applying the proposed techniques efficiently, as they impose intensive computation to the base algorithm. To overcome this challenge, we propose a graph structure to map the contexts to, and introduce differentially private graph search algorithms as efficient solutions for the computation problem caused by differential privacy techniques.
引用
收藏
页码:1571 / 1583
页数:13
相关论文
共 50 条
  • [41] DIFFERENTIALLY PRIVATE VARIABLE SELECTION VIA THE KNOCKOFF FILTER
    Pournaderi, Mehrdad
    Xiang, Yu
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [42] Differentially Private Release of Heterogeneous Network for Managing Healthcare Data
    Khokhar, Rashid Hussain
    Fung, Benjamin C. M.
    Iqbal, Farkhund
    Al-Hussaeni, Khalil
    Hussain, Mohammed
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (06)
  • [43] PriView: Practical Differentially Private Release of Marginal Contingency Tables
    Qardaji, Wahbeh
    Yang, Weinin
    Li, Ninghui
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1435 - 1446
  • [44] An optimal differentially private data release mechanism with constrained error
    Hao Wang
    Zhengquan Xu
    Xiaoshan Zhang
    Xiao Peng
    Kaiju Li
    Frontiers of Computer Science, 2022, 16
  • [45] An optimal differentially private data release mechanism with constrained error
    Hao WANG
    Zhengquan XU
    Xiaoshan ZHANG
    Xiao PENG
    Kaiju LI
    Frontiers of Computer Science, 2022, 16 (01) : 179 - 181
  • [46] Differentially Private Small Dataset Release Using Random Projections
    Gondara, Lovedeep
    Wang, Ke
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 639 - 648
  • [47] Differentially Private Real-Time Release of Sequential Data
    Zhang, Xueru
    Khalili, Mohammad Mahdi
    Liu, Mingyan
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2023, 26 (01)
  • [48] DPCube: Differentially Private Histogram Release through Multidimensional Partitioning
    Xiao, Yonghui
    Xiong, Li
    Fan, Liyue
    Goryczka, Slawomir
    Li, Haoran
    TRANSACTIONS ON DATA PRIVACY, 2014, 7 (03) : 195 - 222
  • [49] An optimal differentially private data release mechanism with constrained error
    Wang, Hao
    Xu, Zhengquan
    Zhang, Xiaoshan
    Peng, Xiao
    Li, Kaiju
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (01)
  • [50] Incremental release of differentially-private check-in data
    Riboni, Daniele
    Bettini, Claudio
    PERVASIVE AND MOBILE COMPUTING, 2015, 16 : 220 - 238