Frequent Itemset Mining with Local Differential Privacy

被引:11
|
作者
Li, Junhui [1 ]
Gan, Wensheng [1 ,3 ]
Gui, Yijie [1 ]
Wu, Yongdong [1 ]
Yu, Philip S. [2 ]
机构
[1] Jinan Univ, Guangzhou, Peoples R China
[2] Univ Illinois, Chicago, IL USA
[3] Pazhou Lab, Guangzhou 510330, Peoples R China
基金
中国国家自然科学基金;
关键词
differential privacy; frequent itemset; transaction database; local differential privacy;
D O I
10.1145/3511808.3557327
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of the Internet, a large amount of transaction data (e.g., shopping records, web browsing history), which represents user data, has been generated. By collecting user transaction data and learning specific patterns and association rules from it, service providers can provide better services. However, because of the increasing privacy awareness and the formulation of laws on data protection, collecting data directly from users will raise privacy concerns. The concept of local differential privacy (LDP), which provides strict data privacy protection on the user side and allows effective statistical analysis on the server side, is able to protect user privacy and perform statistics on sensitive issues at the same time. This paper adopts padding-and-sampling-based frequent oracle (PSFO), combined with an interactive query-response method satisfying local differential privacy, to identify frequent itemsets in an efficient and accurate way. Therefore, this paper proposes FIML, an improved algorithm for finding frequent itemsets in the LDP setting of transaction data. The data collector generates frequent candidate sets based on the results of the previous stage and uses them for querying, and users randomize their responses in a reduced domain to achieve local differential privacy. Extensive experiments on real-world and synthetic datasets show that the FIML algorithm can find frequent itemsets more efficiently with the same privacy protection and computational cost.
引用
收藏
页码:1146 / 1155
页数:10
相关论文
共 50 条
  • [21] Local differential privacy-based frequent sequence mining
    Wang, Teng
    Hu, Zhi
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 3591 - 3601
  • [22] Approximate inverse frequent itemset mining: Privacy, complexity, and approximation
    Wang, YG
    Wu, XT
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 482 - 489
  • [23] An effective scheme for top-k frequent itemset mining under differential privacy conditions
    Wenjuan LIANG
    Hong CHEN
    Jing ZHANG
    Dan ZHAO
    Cuiping LI
    Science China(Information Sciences), 2020, 63 (05) : 200 - 202
  • [24] An effective scheme for top-k frequent itemset mining under differential privacy conditions
    Liang, Wenjuan
    Chen, Hong
    Zhang, Jing
    Zhao, Dan
    Li, Cuiping
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (05)
  • [25] An effective scheme for top-k frequent itemset mining under differential privacy conditions
    Wenjuan Liang
    Hong Chen
    Jing Zhang
    Dan Zhao
    Cuiping Li
    Science China Information Sciences, 2020, 63
  • [26] Frequent Itemsets Mining with a Guaranteed Local Differential Privacy in Small Datasets
    Afrose, Sharmin
    Hashem, Tanzima
    Ali, Mohammed Eunus
    33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 232 - 236
  • [27] Privacy-Preserving Frequent Itemset Mining in Outsourced Transaction Databases
    Chandrasekharan, Iyer
    Baruah, P. K.
    Mukkamala, Ravi
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 787 - 793
  • [28] Efficient Apriori Based Algorithms for Privacy Preserving Frequent Itemset Mining
    Csiszarik, Adrian
    Lestyan, Szilvia
    Lukacs, Andras
    2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 431 - 435
  • [29] Practical Privacy-Preserving Frequent Itemset Mining on Supermarket Transactions
    Ma, Chenyang
    Wang, Baocang
    Jooste, Kyle
    Zhang, Zhili
    Ping, Yuan
    IEEE SYSTEMS JOURNAL, 2020, 14 (02): : 1992 - 2002
  • [30] Privacy-Preserving Frequent Itemset Mining for Sparse and Dense Data
    Laud, Peeter
    Pankova, Alisa
    SECURE IT SYSTEMS, NORDSEC 2017, 2017, 10674 : 139 - 155