A scalable privacy-preserving recommendation scheme via bisecting k-means clustering

被引：45

作者：

Bilge, Alper ^{[1
]}

Polat, Huseyin ^{[1
]}

机构：

[1] Anadolu Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkey

来源：

INFORMATION PROCESSING & MANAGEMENT | 2013年 / 49卷 / 04期

关键词：

Accuracy; Binary decision diagrams; Clustering methods; Data preprocessing; Data privacy; Recommender systems; SYSTEMS;

D O I：

10.1016/j.ipm.2013.02.004

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals' privacy. However, collaborative filtering with privacy schemes commonly suffer from scalability and sparseness as the content in the domain proliferates. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this work, we propose a novel privacy-preserving collaborative filtering scheme based on bisecting k-means clustering in which we apply two preprocessing methods. The first preprocessing scheme deals with scalability problem by constructing a binary decision tree through a bisecting k-means clustering approach while the second produces clones of users by inserting pseudo-self-predictions into original user profiles to boost accuracy of scalability-enhanced structure. Sparse nature of collections are handled by transforming ratings into item features-based profiles. After analyzing our scheme with respect to privacy and supplementary costs, we perform experiments on benchmark data sets to evaluate it in terms of accuracy and online performance. Our empirical outcomes verify that combined effects of the proposed preprocessing schemes relieve scalability and augment accuracy significantly. (C) 2013 Elsevier Ltd. All rights reserved.

引用

页码：912 / 927

页数：16

共 50 条

[1] Outsourced and Privacy-Preserving K-means Clustering Scheme for Smart Grid
Shen, Xielin
Yuan, Bo
Peng, Weiwen
Qian, Yuanquan
Wu, Yonghua
2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 307 - 313
[2] Privacy-Preserving K-Means Clustering Upon Negative Databases
Hu, Xiaoyi
Lu, Liping
Zhao, Dongdong
Xiang, Jianwen
Liu, Xing
Zhou, Haiying
Xiong, Shengwu
Tian, Jing
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 191 - 204
[3] Importance of Data Standardization in Privacy-Preserving K-Means Clustering
Su, Chunhua
Zhan, Justin
Sakurai, Kouichi
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2009, 5667 : 276 - +
[4] Privacy-Preserving Hybrid K-Means
Gao, Zhiqiang
Sun, Yixiao
Cui, Xiaolong
Wang, Yutao
Duan, Yanyu
Wang, Xu An
INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2018, 14 (02) : 1 - 17
[5] K-Means Clustering With Local dχ-Privacy for Privacy-Preserving Data Analysis
Yang, Mengmeng
Tjuawinata, Ivan
Lam, Kwok-Yan
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2524 - 2537
[6] Privacy-preserving kernel k-means clustering outsourcing with random transformation
Keng-Pei Lin
Knowledge and Information Systems, 2016, 49 : 885 - 908
[7] Efficient privacy-preserving outsourced k-means clustering on distributed data
Qiu, Guowei
Zhao, Yingliang
Gui, Xiaolin
INFORMATION SCIENCES, 2024, 674
[8] Efficient and Privacy-Preserving k-means clustering For Big Data Mining
Gheid, Zakaria
Challal, Yacine
2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 791 - 798
[9] Privacy-preserving kernel k-means clustering outsourcing with random transformation
Lin, Keng-Pei
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) : 885 - 908
[10] A reversible privacy-preserving clustering technique based on k-means algorithm
Lin, Chen-Yi
APPLIED SOFT COMPUTING, 2020, 87

← 1 2 3 4 5 →