Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy

被引:2
|
作者
Zhao, Dan [1 ,2 ]
Zhao, Su-Yun [2 ]
Chen, Hong [2 ]
Liu, Rui-Xuan [2 ]
Li, Cui-Ping [2 ]
Zhang, Xiao-Ying [2 ]
机构
[1] Inst Sci & Tech Informat China, Beijing 100038, Peoples R China
[2] Renmin Univ China, Sch Informat, Key Lab Data Engn & Knowledge Engn, Minist Educ, Beijing 100872, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
local differential privacy; frequent itemset mining; frequency oracle;
D O I
10.1007/s11390-023-1346-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) approaches to collecting sensitive information for frequent itemset mining (FIM) can reliably guarantee privacy. Most current approaches to FIM under LDP add "padding and sampling" steps to obtain frequent itemsets and their frequencies because each user transaction represents a set of items. The current state-of-the-art approach, namely set-value itemset mining (SVSM), must balance variance and bias to achieve accurate results. Thus, an unbiased FIM approach with lower variance is highly promising. To narrow this gap, we propose an Item-Level LDP frequency oracle approach, named the Integrated-with-Hadamard-Transform-Based Frequency Oracle (IHFO). For the first time, Hadamard encoding is introduced to a set of values to encode all items into a fixed vector, and perturbation can be subsequently applied to the vector. An FIM approach, called optimized united itemset mining (O-UISM), is proposed to combine the padding-and-sampling-based frequency oracle (PSFO) and the IHFO into a framework for acquiring accurate frequent itemsets with their frequencies. Finally, we theoretically and experimentally demonstrate that O-UISM significantly outperforms the extant approaches in finding frequent itemsets and estimating their frequencies under the same privacy guarantee.
引用
收藏
页码:1403 / 1422
页数:20
相关论文
共 50 条
  • [1] Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy
    Dan Zhao
    Su-Yun Zhao
    Hong Chen
    Rui-Xuan Liu
    Cui-Ping Li
    Xiao-Ying Zhang
    Journal of Computer Science and Technology, 2023, 38 : 1403 - 1422
  • [2] Frequent Itemset Mining with Hadamard Response Under Local Differential Privacy
    Liu, Haijiang
    Bai, Xiangyu
    Ma, Xuebin
    Cui, Lianwei
    PROCEEDINGS OF 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2020), 2020, : 49 - 52
  • [3] Improving the Effect of Frequent Itemset Mining with Hadamard Response under Local Differential Privacy
    Ma, Xuebin
    Liu, Haijiang
    Guan, Shengyi
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 436 - 443
  • [4] Frequent Itemset Mining with Local Differential Privacy
    Li, Junhui
    Gan, Wensheng
    Gui, Yijie
    Wu, Yongdong
    Yu, Philip S.
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1146 - 1155
  • [5] A Frequent Itemset Mining Method Based on Local Differential Privacy
    Wu, Ning
    Zou, Yunfeng
    Shan, Chao
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 225 - 236
  • [6] PrivBasis: Frequent Itemset Mining with Differential Privacy
    Li, Ninghui
    Qardaji, Wahbeh
    Su, Dong
    Cao, Jianneng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11): : 1340 - 1351
  • [7] PrivMiner: a similar-first approach to frequent itemset mining under local differential privacy
    Li, Yanhui
    Huang, Chen
    Cheng, Mengyuan
    Lv, Tianci
    Zhao, Yuxin
    Sun, Yongjiao
    Yuan, Ye
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2025, 28 (02):
  • [8] Frequent Itemset Mining with Differential Privacy Based on Transaction Truncation
    Xia, Ying
    Huang, Yu
    Zhang, Xu
    Bae, HaeYoung
    INFORMATION AND COMMUNICATIONS SECURITY, ICICS 2017, 2018, 10631 : 438 - 445
  • [9] Frequent Itemset Mining of User's Multi-Attribute under Local Differential Privacy
    Liu, Haijiang
    Cui, Lianwei
    Ma, Xuebin
    Wu, Celimuge
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (01): : 369 - 385
  • [10] Frequent Itemset Mining Algorithm Based on Differential Privacy in Vertical Structure
    Long, Shigong
    Lu, Hongqin
    Chen, Tingting
    Zhou, Nannan
    Liu, Hai
    International Journal of Network Security, 2022, 24 (01) : 75 - 82