Frequent Itemset Mining with Local Differential Privacy

被引:11
|
作者
Li, Junhui [1 ]
Gan, Wensheng [1 ,3 ]
Gui, Yijie [1 ]
Wu, Yongdong [1 ]
Yu, Philip S. [2 ]
机构
[1] Jinan Univ, Guangzhou, Peoples R China
[2] Univ Illinois, Chicago, IL USA
[3] Pazhou Lab, Guangzhou 510330, Peoples R China
基金
中国国家自然科学基金;
关键词
differential privacy; frequent itemset; transaction database; local differential privacy;
D O I
10.1145/3511808.3557327
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of the Internet, a large amount of transaction data (e.g., shopping records, web browsing history), which represents user data, has been generated. By collecting user transaction data and learning specific patterns and association rules from it, service providers can provide better services. However, because of the increasing privacy awareness and the formulation of laws on data protection, collecting data directly from users will raise privacy concerns. The concept of local differential privacy (LDP), which provides strict data privacy protection on the user side and allows effective statistical analysis on the server side, is able to protect user privacy and perform statistics on sensitive issues at the same time. This paper adopts padding-and-sampling-based frequent oracle (PSFO), combined with an interactive query-response method satisfying local differential privacy, to identify frequent itemsets in an efficient and accurate way. Therefore, this paper proposes FIML, an improved algorithm for finding frequent itemsets in the LDP setting of transaction data. The data collector generates frequent candidate sets based on the results of the previous stage and uses them for querying, and users randomize their responses in a reduced domain to achieve local differential privacy. Extensive experiments on real-world and synthetic datasets show that the FIML algorithm can find frequent itemsets more efficiently with the same privacy protection and computational cost.
引用
收藏
页码:1146 / 1155
页数:10
相关论文
共 50 条
  • [1] A Frequent Itemset Mining Method Based on Local Differential Privacy
    Wu, Ning
    Zou, Yunfeng
    Shan, Chao
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 225 - 236
  • [2] Frequent Itemset Mining with Hadamard Response Under Local Differential Privacy
    Liu, Haijiang
    Bai, Xiangyu
    Ma, Xuebin
    Cui, Lianwei
    PROCEEDINGS OF 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2020), 2020, : 49 - 52
  • [3] PrivBasis: Frequent Itemset Mining with Differential Privacy
    Li, Ninghui
    Qardaji, Wahbeh
    Su, Dong
    Cao, Jianneng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11): : 1340 - 1351
  • [4] Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy
    Zhao, Dan
    Zhao, Su-Yun
    Chen, Hong
    Liu, Rui-Xuan
    Li, Cui-Ping
    Zhang, Xiao-Ying
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (06) : 1403 - 1422
  • [5] Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy
    Dan Zhao
    Su-Yun Zhao
    Hong Chen
    Rui-Xuan Liu
    Cui-Ping Li
    Xiao-Ying Zhang
    Journal of Computer Science and Technology, 2023, 38 : 1403 - 1422
  • [6] Improving the Effect of Frequent Itemset Mining with Hadamard Response under Local Differential Privacy
    Ma, Xuebin
    Liu, Haijiang
    Guan, Shengyi
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 436 - 443
  • [7] Frequent Itemset Mining with Differential Privacy Based on Transaction Truncation
    Xia, Ying
    Huang, Yu
    Zhang, Xu
    Bae, HaeYoung
    INFORMATION AND COMMUNICATIONS SECURITY, ICICS 2017, 2018, 10631 : 438 - 445
  • [8] PrivMiner: a similar-first approach to frequent itemset mining under local differential privacy
    Li, Yanhui
    Huang, Chen
    Cheng, Mengyuan
    Lv, Tianci
    Zhao, Yuxin
    Sun, Yongjiao
    Yuan, Ye
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2025, 28 (02):
  • [9] Frequent Itemset Mining of User's Multi-Attribute under Local Differential Privacy
    Liu, Haijiang
    Cui, Lianwei
    Ma, Xuebin
    Wu, Celimuge
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (01): : 369 - 385
  • [10] Frequent Itemset Mining Algorithm Based on Differential Privacy in Vertical Structure
    Long, Shigong
    Lu, Hongqin
    Chen, Tingting
    Zhou, Nannan
    Liu, Hai
    International Journal of Network Security, 2022, 24 (01) : 75 - 82