A scalable privacy-preserving recommendation scheme via bisecting k-means clustering

被引:45
|
作者
Bilge, Alper [1 ]
Polat, Huseyin [1 ]
机构
[1] Anadolu Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkey
关键词
Accuracy; Binary decision diagrams; Clustering methods; Data preprocessing; Data privacy; Recommender systems; SYSTEMS;
D O I
10.1016/j.ipm.2013.02.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals' privacy. However, collaborative filtering with privacy schemes commonly suffer from scalability and sparseness as the content in the domain proliferates. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this work, we propose a novel privacy-preserving collaborative filtering scheme based on bisecting k-means clustering in which we apply two preprocessing methods. The first preprocessing scheme deals with scalability problem by constructing a binary decision tree through a bisecting k-means clustering approach while the second produces clones of users by inserting pseudo-self-predictions into original user profiles to boost accuracy of scalability-enhanced structure. Sparse nature of collections are handled by transforming ratings into item features-based profiles. After analyzing our scheme with respect to privacy and supplementary costs, we perform experiments on benchmark data sets to evaluate it in terms of accuracy and online performance. Our empirical outcomes verify that combined effects of the proposed preprocessing schemes relieve scalability and augment accuracy significantly. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:912 / 927
页数:16
相关论文
共 50 条
  • [1] Outsourced and Privacy-Preserving K-means Clustering Scheme for Smart Grid
    Shen, Xielin
    Yuan, Bo
    Peng, Weiwen
    Qian, Yuanquan
    Wu, Yonghua
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 307 - 313
  • [2] Privacy-Preserving K-Means Clustering Upon Negative Databases
    Hu, Xiaoyi
    Lu, Liping
    Zhao, Dongdong
    Xiang, Jianwen
    Liu, Xing
    Zhou, Haiying
    Xiong, Shengwu
    Tian, Jing
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 191 - 204
  • [3] Importance of Data Standardization in Privacy-Preserving K-Means Clustering
    Su, Chunhua
    Zhan, Justin
    Sakurai, Kouichi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2009, 5667 : 276 - +
  • [4] Privacy-Preserving Hybrid K-Means
    Gao, Zhiqiang
    Sun, Yixiao
    Cui, Xiaolong
    Wang, Yutao
    Duan, Yanyu
    Wang, Xu An
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2018, 14 (02) : 1 - 17
  • [5] K-Means Clustering With Local dχ-Privacy for Privacy-Preserving Data Analysis
    Yang, Mengmeng
    Tjuawinata, Ivan
    Lam, Kwok-Yan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2524 - 2537
  • [6] Privacy-preserving kernel k-means clustering outsourcing with random transformation
    Keng-Pei Lin
    Knowledge and Information Systems, 2016, 49 : 885 - 908
  • [7] Efficient privacy-preserving outsourced k-means clustering on distributed data
    Qiu, Guowei
    Zhao, Yingliang
    Gui, Xiaolin
    INFORMATION SCIENCES, 2024, 674
  • [8] Efficient and Privacy-Preserving k-means clustering For Big Data Mining
    Gheid, Zakaria
    Challal, Yacine
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 791 - 798
  • [9] Privacy-preserving kernel k-means clustering outsourcing with random transformation
    Lin, Keng-Pei
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) : 885 - 908
  • [10] A reversible privacy-preserving clustering technique based on k-means algorithm
    Lin, Chen-Yi
    APPLIED SOFT COMPUTING, 2020, 87