EPiC: efficient privacy-preserving counting for MapReduce

被引:5
|
作者
Triet Dang Vo-Huu [1 ]
Blass, Erik-Oliver [2 ]
Noubir, Guevara [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Airbus Grp Innovat, D-81663 Munich, Germany
基金
美国国家科学基金会;
关键词
Privacy-preserving; MapReduce; Somewhat homomorphic encryption; FULLY HOMOMORPHIC ENCRYPTION;
D O I
10.1007/s00607-018-0634-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the face of an untrusted cloud infrastructure, outsourced data needs to be protected. We present EPiC, a practical protocol for the privacy-preserving evaluation of a fundamental operation on data sets: frequency counting. In an encrypted outsourced data set, a cloud user can specify a pattern, and the cloud will count the number of occurrences of this pattern in an oblivious manner. A pattern is expressed as a Boolean formula on the fields of data records and can specify values counting, value comparison, range counting, and conjunctions/disjunctions of field values. We show how a general pattern, defined by a Boolean formula, is arithmetized into a multivariate polynomial and used in EPiC. To increase the performance of the system, we introduce a new privacy-preserving encoding with "somewhat homomorphic" properties. The encoding is highly efficient in our particular counting scenario. Besides a formal analysis where we prove EPiC 's privacy, we also present implementation and evaluation results. We specifically target Google's prominent MapReduce paradigm as offered by major cloud providers. Our evaluation performed both locally and in Amazon's public cloud with up to 1 TByte data sets shows only a modest overhead of compared to non-private counting, attesting to EPiC 's efficiency.
引用
收藏
页码:1265 / 1286
页数:22
相关论文
共 50 条
  • [31] Efficient Privacy-Preserving Approaches for Trajectory Datasets
    Hassan, Md Yeakub
    Saha, Ullash
    Mohammed, Noman
    Durocher, Stephane
    Miller, Avery
    2020 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2020, : 612 - 619
  • [33] Accurate and efficient privacy-preserving string matching
    Vaiwsri, Sirintra
    Ranbaduge, Thilina
    Christen, Peter
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (02) : 191 - 215
  • [34] Efficient Privacy-Preserving Stochastic Nonconvex Optimization
    Wang, Lingxiao
    Jayaraman, Bargav
    Evans, David
    Gu, Quanquan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2203 - 2213
  • [35] Efficient and privacy-preserving biometric identification in cloud
    Hahn, Changhee
    Hur, Junbeom
    ICT EXPRESS, 2016, 2 (03): : 135 - 139
  • [36] Efficient privacy-preserving similar document detection
    Mummoorthy Murugesan
    Wei Jiang
    Chris Clifton
    Luo Si
    Jaideep Vaidya
    The VLDB Journal, 2010, 19 : 457 - 475
  • [37] PPDC: A Privacy-Preserving Distinct Counting Scheme for Mobile Sensing
    Yang, Xiaochen
    Xu, Ming
    Fu, Shaojing
    Luo, Yuchuan
    APPLIED SCIENCES-BASEL, 2019, 9 (18):
  • [38] PDCS: A Privacy-Preserving Distinct Counting Scheme for Mobile Sensing
    Yang, Xiaochen
    Xu, Ming
    Fu, Shaojing
    Luo, Yuchuan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 227 - 243
  • [39] Privacy preserving scheme for MapReduce
    Shetty, Madhvaraj M.
    Manjaiah, D. H.
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [40] Efficient Deep Learning Models for Privacy-Preserving People Counting on Low-Resolution Infrared Arrays
    Xie, Chen
    Daghero, Francesco
    Chen, Yukai
    Castellano, Marco
    Gandolfi, Luca
    Calimera, Andrea
    Macii, Enrico
    Poncino, Massimo
    Pagliari, Daniele Jahier
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13895 - 13907