Private Frequent Itemset Mining in the Local Setting

被引:2
|
作者
Fu, Hang [1 ]
Yang, Wei [1 ]
Huang, Liusheng [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei, Peoples R China
关键词
Local differential privacy; Frequent itemset mining; Crowdsensing; Randomized response;
D O I
10.1007/978-3-030-86130-8_27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Set-valued data, which is useful for representing user-generated data, becomes ubiquitous in numerous online services. Service provider profits by learning patterns and associations from users' set-valued data. However, it comes with privacy concerns if these data are collected from users directly. This work studies frequent itemset mining from user-generated set-valued datameanwhile locally preserving personal data privacy. Under local d-privacy constraints, which capture intrinsic dissimilarity between set-valued data in the framework of differential privacy, we propose a novel privacy-preserving frequent itemset mining mechanism, called PrivFIM. It provides rigorous data privacy protection on the user-side and allows effective statistical analyses on the server-side. Specifically, each user perturbs his set-valued data locally to guarantee that the server cannot infer the user's original itemset with high confidence. The server can reconstruct an unbiased estimation of itemset frequency from these randomized data and then combines it with the Apriori-based pruning technique to identify frequent itemsets efficiently and accurately. Extensive experiments conducted on real-world and synthetic datasets demonstrate that PrivFIM surpasses existing methods, and maintains high utility while providing strong privacy guarantees.
引用
收藏
页码:338 / 350
页数:13
相关论文
共 50 条
  • [41] Revised ECLAT Algorithm for Frequent Itemset Mining
    Suvalka, Bharati
    Khandelwal, Sarika
    Patel, Chintal
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, INDIA 2016, 2016, 434 : 219 - 226
  • [42] Frequent Itemset Mining on Correlated Probabilistic Databases
    Kalaz, Yasemin Asan
    Raman, Rajeev
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 84 - 98
  • [43] A Highly Parallel Algorithm for Frequent Itemset Mining
    Mesa, Alejandro
    Feregrino-Uribe, Claudia
    Cumplido, Rene
    Hernandez-Palancar, Jose
    ADVANCES IN PATTERN RECOGNITION, 2010, 6256 : 291 - +
  • [44] Adaptive Apriori Algorithm for Frequent Itemset Mining
    Patill, Shubhangi D.
    Deshmukh, Ratnadeep R.
    Kirange, D. K.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2016), 2016, : 7 - 13
  • [45] Parallel Frequent Itemset Mining on Streaming Data
    He, Yanshan
    Yue, Min
    2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 725 - 730
  • [46] Algorithms for frequent itemset mining: a literature review
    Chee, Chin-Hoong
    Jaafar, Jafreezal
    Aziz, Izzatdin Abdul
    Hasan, Mohd Hilmi
    Yeoh, William
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (04) : 2603 - 2621
  • [47] Probabilistic Frequent Itemset Mining on a GPU Cluster
    Kozawa, Yusuke
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 779 - 789
  • [48] The Discussions of Maximal Frequent Itemset Mining Optimization
    Li, Haifeng
    2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY (CII 2016), 2016, : 96 - 100
  • [49] An efficient algorithm for fuzzy frequent itemset mining
    Wu, Tsu-Yang
    Lin, Jerry Chun-Wei
    Yun, Unil
    Chen, Chun-Hao
    Srivastava, Gautam
    Lv, Xianbiao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (05) : 5787 - 5797
  • [50] Frequent Itemset Mining as Set Intersection Problem
    Stanisic, Predrag
    Tomovic, Savo
    2013 2ND MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2013,