Local differential privacy protection for frequent sequence mining

被引:0
|
作者
Yang G. [1 ]
Gong C. [1 ]
Fang X. [1 ]
Ge B. [1 ]
Su S. [1 ]
机构
[1] School of Computer Science and Engineering, Anhui University of Science and Technology, Huainan
关键词
Association rule; Data utility; Frequent sequences; Local differential privacy; Local sensitivity; Privacy preserving; Proprietary privacy budget; Random response;
D O I
10.11990/jheu.201812051
中图分类号
学科分类号
摘要
To enhance the privacy protection of frequent sequences, improve its mining utility, and reduce the effect of data dimensionality, we propose a frequent sequence mining model that satisfies local differential privacy and design an algorithm to achieve it. The algorithm obtains frequent sequences on the basis of the idea of pruning. First, we analyzed interference in the data set using the randomized response method based on local sensitivity and utilized the sequence support degree and proprietary privacy budget to improve its applicability, and on the basis of the FP-growth prefix and suffix principle, we mined frequent sequences of level 3 and above using frequent sequences of level 2 and above. Second, we selected reasonable local sensitivity to traverse the data set before and after interference to determine the runtime of frequent sequence mining. Finally, on the basis of the combination nature of local differential privacy, we proved theoretically that the algorithm satisfies local differential privacy and verified experimentally its effectiveness. The experimental results indicate that the algorithm can implement local differential privacy protection of frequent sequences safely and efficiently, ensuring the accuracy of frequent sequences. © 2019, Editorial Department of Journal of HEU. All right reserved.
引用
收藏
页码:1903 / 1910
页数:7
相关论文
共 27 条
  • [21] Yoon Y.U., Park D.H., Kim J.G., Et al., Most frequent mode for intra-mode coding in video coding, Electronics Letters, 55, 4, pp. 188-190, (2019)
  • [22] Flood V.H., Johnsen J.M., Kochelek C., Et al., Common VWF sequence variants associated with higher VWF and FVIII are less frequent in subjects diagnosed with type 1 VWD, Research & Practice in Thrombosis & Haemostasis, 2, 2, pp. 390-398, (2018)
  • [23] Takeuchi Y., Shinozaki T., Matsuyama Y., A comparison of estimators from self-controlled case series, case-crossover design, and sequence symmetry analysis for pharmacoepidemiological studies, Bmc Medical Research Methodology, 18, 1, (2018)
  • [24] Zaki M.J., SPADE: An efficient algorithm for mining frequent sequences, Machine Learning, 42, 1, pp. 31-60, (2001)
  • [25] Mahanipour A., Nezamabadi-Pour H., GSP: an automatic programming technique with gravitational search algorithm, Applied Intelligence, 49, 4, pp. 1502-1516, (2019)
  • [26] Wang K.E., Sadredini E., Skadron K., Sequential pattern mining with the Micron automata processor, Proceedings of the ACM International Conference on Computing Frontiers, pp. 135-144, (2016)
  • [27] Pimus I., Peleg M., Schertz M., Sequence Mining of Comorbid neurodevelopmental disorders using the SPADE algorithm, Methods of Information in Medicine, 55, 3, pp. 223-233, (2016)