A scalable privacy-preserving recommendation scheme via bisecting k-means clustering

被引:45
|
作者
Bilge, Alper [1 ]
Polat, Huseyin [1 ]
机构
[1] Anadolu Univ, Dept Comp Engn, TR-26555 Eskisehir, Turkey
关键词
Accuracy; Binary decision diagrams; Clustering methods; Data preprocessing; Data privacy; Recommender systems; SYSTEMS;
D O I
10.1016/j.ipm.2013.02.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals' privacy. However, collaborative filtering with privacy schemes commonly suffer from scalability and sparseness as the content in the domain proliferates. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this work, we propose a novel privacy-preserving collaborative filtering scheme based on bisecting k-means clustering in which we apply two preprocessing methods. The first preprocessing scheme deals with scalability problem by constructing a binary decision tree through a bisecting k-means clustering approach while the second produces clones of users by inserting pseudo-self-predictions into original user profiles to boost accuracy of scalability-enhanced structure. Sparse nature of collections are handled by transforming ratings into item features-based profiles. After analyzing our scheme with respect to privacy and supplementary costs, we perform experiments on benchmark data sets to evaluate it in terms of accuracy and online performance. Our empirical outcomes verify that combined effects of the proposed preprocessing schemes relieve scalability and augment accuracy significantly. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:912 / 927
页数:16
相关论文
共 50 条
  • [41] Privacy Preserving Distributed K-Means Clustering in Malicious Model Using Verifiable Secret Sharing Scheme
    Patel, Sankita
    Sonar, Mitali
    Jinwala, Devesh C.
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2014, 5 (02) : 44 - 70
  • [42] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
    Banerjee, Shreya
    Choudhary, Ankit
    Pal, Somnath
    2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
  • [43] Federated fuzzy k-means for privacy-preserving behavior analysis in smart grids
    Wang, Yi
    Ma, Jiahao
    Gao, Ning
    Wen, Qingsong
    Sun, Liang
    Guo, Hongye
    APPLIED ENERGY, 2023, 331
  • [44] Drug Audit Based on Bisecting K-means Clustering Algorithm
    Tao, Yingjuan
    Deng, Jinsheng
    Song, Xingshen
    2019 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2019, : 265 - 270
  • [45] Enhanced bisecting k-means clustering using intermediate cooperation
    Kashef, R.
    Kamel, M. S.
    PATTERN RECOGNITION, 2009, 42 (11) : 2557 - 2569
  • [46] A Distributed Anonymization Scheme for Privacy-preserving Recommendation Systems
    Luo, Zhifeng
    Chen, Shuhong
    Li, Yutian
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 491 - 494
  • [47] Privacy-preserving hierarchical-k-means clustering on horizontally partitioned data
    Xue, Anrong
    Jiang, Dongjie
    Ju, Shiguang
    Chen, Weihe
    Ma, Handa
    INTERNATIONAL SYMPOSIUM ON ADVANCES IN COMPUTER AND SENSOR NETWORKS AND SYSTEMS, PROCEEDINGS: IN CELEBRATION OF 60TH BIRTHDAY OF PROF. S. SITHARAMA IYENGAR FOR HIS CONTRIBUTIONS TO THE SCIENCE OF COMPUTING, 2008, : 453 - 459
  • [48] An Efficient Approach for Privacy Preserving Distributed K-Means Clustering in Unsecured Environment
    Shewale, Amit
    Keshavamurthy, B. N.
    Modi, Chirag N.
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 1, 2019, 707 : 425 - 431
  • [49] Decentralized and Scalable Privacy-Preserving Authentication Scheme in VANETs
    Tangade, Shrikant
    Manvi, Sunilkumar S.
    Lorenz, Pascal
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (09) : 8647 - 8655
  • [50] Privacy Preserving Distributed Cell-based K-means Clustering Algorithm
    Su, Fang
    Zu, Yun-xiao
    Li, Wei-hai
    INTERNATIONAL CONFERENCE ON MATHEMATICS, MODELLING AND SIMULATION TECHNOLOGIES AND APPLICATIONS (MMSTA 2017), 2017, 215 : 377 - 383