Frequent Itemset Mining with Local Differential Privacy

被引:11
|
作者
Li, Junhui [1 ]
Gan, Wensheng [1 ,3 ]
Gui, Yijie [1 ]
Wu, Yongdong [1 ]
Yu, Philip S. [2 ]
机构
[1] Jinan Univ, Guangzhou, Peoples R China
[2] Univ Illinois, Chicago, IL USA
[3] Pazhou Lab, Guangzhou 510330, Peoples R China
基金
中国国家自然科学基金;
关键词
differential privacy; frequent itemset; transaction database; local differential privacy;
D O I
10.1145/3511808.3557327
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of the Internet, a large amount of transaction data (e.g., shopping records, web browsing history), which represents user data, has been generated. By collecting user transaction data and learning specific patterns and association rules from it, service providers can provide better services. However, because of the increasing privacy awareness and the formulation of laws on data protection, collecting data directly from users will raise privacy concerns. The concept of local differential privacy (LDP), which provides strict data privacy protection on the user side and allows effective statistical analysis on the server side, is able to protect user privacy and perform statistics on sensitive issues at the same time. This paper adopts padding-and-sampling-based frequent oracle (PSFO), combined with an interactive query-response method satisfying local differential privacy, to identify frequent itemsets in an efficient and accurate way. Therefore, this paper proposes FIML, an improved algorithm for finding frequent itemsets in the LDP setting of transaction data. The data collector generates frequent candidate sets based on the results of the previous stage and uses them for querying, and users randomize their responses in a reduced domain to achieve local differential privacy. Extensive experiments on real-world and synthetic datasets show that the FIML algorithm can find frequent itemsets more efficiently with the same privacy protection and computational cost.
引用
收藏
页码:1146 / 1155
页数:10
相关论文
共 50 条
  • [41] A Survey Paper on Frequent Itemset Mining
    Sastry, J. S. V. R. S.
    Suresh, V
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
  • [42] Frequent Itemset Mining in Multirelational Databases
    Jimenez, Aida
    Berzal, Fernando
    Cubero, Juan-Carlos
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2009, 5722 : 15 - 24
  • [43] Verified Programs for Frequent Itemset Mining
    Loulergue, Frederic
    Whitney, Christopher D.
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1516 - 1523
  • [44] A primer to frequent itemset mining for bioinformatics
    Naulaerts, Stefan
    Meysman, Pieter
    Bittremieux, Wout
    Trung Nghia Vu
    Vanden Berghe, Wim
    Goethals, Bart
    Laukens, Kris
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (02) : 216 - 231
  • [45] Oracle and Vertica for Frequent Itemset Mining
    Kyurkchiev, Hristo
    Kaloyanova, Kalinka
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 77 - 85
  • [46] An efficient frequent itemset mining algorithm
    Luo, Ke
    Zhang, Xue-Mao
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 756 - 761
  • [47] Privacy preserving frequent itemset mining: Maximizing data utility based on database reconstruction
    Li, Shaoxin
    Mu, Nankun
    Le, Junqing
    Liao, Xiaofeng
    COMPUTERS & SECURITY, 2019, 84 : 17 - 34
  • [48] A parallel algorithm for frequent itemset mining
    Li, L
    Zhai, DH
    Fan, J
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 868 - 871
  • [49] Frequent itemset mining with parallel RDBMS
    Shang, XQ
    Sattler, KU
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 539 - 544
  • [50] Verifiable Privacy-Preserving Outsourced Frequent Itemset Mining on Vertically Partitioned Databases
    Zhao, Zhen
    Lan, Lei
    Wang, Baocang
    Lai, Jianchang
    ELECTRONICS, 2023, 12 (08)