Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy

被引:2
|
作者
Zhao, Dan [1 ,2 ]
Zhao, Su-Yun [2 ]
Chen, Hong [2 ]
Liu, Rui-Xuan [2 ]
Li, Cui-Ping [2 ]
Zhang, Xiao-Ying [2 ]
机构
[1] Inst Sci & Tech Informat China, Beijing 100038, Peoples R China
[2] Renmin Univ China, Sch Informat, Key Lab Data Engn & Knowledge Engn, Minist Educ, Beijing 100872, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
local differential privacy; frequent itemset mining; frequency oracle;
D O I
10.1007/s11390-023-1346-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Local differential privacy (LDP) approaches to collecting sensitive information for frequent itemset mining (FIM) can reliably guarantee privacy. Most current approaches to FIM under LDP add "padding and sampling" steps to obtain frequent itemsets and their frequencies because each user transaction represents a set of items. The current state-of-the-art approach, namely set-value itemset mining (SVSM), must balance variance and bias to achieve accurate results. Thus, an unbiased FIM approach with lower variance is highly promising. To narrow this gap, we propose an Item-Level LDP frequency oracle approach, named the Integrated-with-Hadamard-Transform-Based Frequency Oracle (IHFO). For the first time, Hadamard encoding is introduced to a set of values to encode all items into a fixed vector, and perturbation can be subsequently applied to the vector. An FIM approach, called optimized united itemset mining (O-UISM), is proposed to combine the padding-and-sampling-based frequency oracle (PSFO) and the IHFO into a framework for acquiring accurate frequent itemsets with their frequencies. Finally, we theoretically and experimentally demonstrate that O-UISM significantly outperforms the extant approaches in finding frequent itemsets and estimating their frequencies under the same privacy guarantee.
引用
收藏
页码:1403 / 1422
页数:20
相关论文
共 50 条
  • [41] Frequent Itemset Mining Algorithm Based on Linear Table
    Lu, Jun
    Xu, Wenhe
    Zhou, Kailong
    Guo, Zhicong
    JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (01)
  • [42] A Distributed Frequent Itemset Mining Algorithm Based on Spark
    Gui, Feng
    Ma, Yunlong
    Zhang, Feng
    Liu, Min
    Li, Fei
    Shen, Weiming
    Bai, Hua
    PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2015, : 271 - 275
  • [43] Model-based probabilistic frequent itemset mining
    Bernecker, Thomas
    Cheng, Reynold
    Cheung, David W.
    Kriegel, Hans-Peter
    Lee, Sau Dan
    Renz, Matthias
    Verhein, Florian
    Wang, Liang
    Zuefle, Andreas
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 181 - 217
  • [44] Model-based probabilistic frequent itemset mining
    Thomas Bernecker
    Reynold Cheng
    David W. Cheung
    Hans-Peter Kriegel
    Sau Dan Lee
    Matthias Renz
    Florian Verhein
    Liang Wang
    Andreas Zuefle
    Knowledge and Information Systems, 2013, 37 : 181 - 217
  • [45] Mining Frequent Graph Patterns with Differential Privacy
    Shen, Entong
    Yu, Ting
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 545 - 553
  • [46] Frequent Sequence Pattern Mining with Differential Privacy
    Zhou, Fengli
    Lin, Xiaoli
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT I, 2018, 10954 : 454 - 466
  • [47] Survey of differential privacy in frequent pattern mining
    Ding, Li-Ping
    Lu, Guo-Qing
    Tongxin Xuebao/Journal on Communications, 2014, 35 (10): : 200 - 209
  • [48] PrivTrie: Effective Frequent Term Discovery under Local Differential Privacy
    Wang, Ning
    Xiao, Xiaokui
    Yang, Yin
    Ta Duy Hoang
    Shin, Hyejin
    Shin, Junbum
    Yu, Ge
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 821 - 832
  • [49] Travel Trajectory Frequent Pattern Mining Based on Differential Privacy Protection
    Wang, Weiya
    Yang, Geng
    Bao, Lin
    Ma, Ke
    Zhou, Hao
    Bai, Yunlu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [50] Verifiable Privacy-Preserving Outsourced Frequent Itemset Mining on Vertically Partitioned Databases
    Zhao, Zhen
    Lan, Lei
    Wang, Baocang
    Lai, Jianchang
    ELECTRONICS, 2023, 12 (08)