Privacy preserving k-means clustering in multi-party environment

被引:21
|
作者
Samet, Saeed [1 ]
Miri, Ali [1 ]
Orozco-Barbosa, Luis [2 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
[2] Univ Castilla La Mancha, Inst Invest Informat, Albacete 02071, Spain
关键词
data mining; clustering; classification; and association rules; mining methods and algorithms; security and privacy protection; distributed data structures;
D O I
10.5220/0002121703810385
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting meaningful and valuable knowledge from databases is often done by various data mining algorithms. Nowadays, databases are distributed among two or more parties because of different reasons such as physical and geographical restrictions and the most important issue is privacy. Related data is normally maintained by more than one organization, each of which wants to keep its individual information private. Thus, privacy-preserving techniques and protocols are designed to perform data mining on distributed environments when privacy is highly concerned. Cluster analysis is a technique in data mining, by which data can be divided into some meaningful clusters, and it has an important role in different fields such as bio-informatics, marketing, machine learning, climate and medicine. k-means Clustering is a prominent algorithm in this category which creates a one-level clustering of data. In this paper we introduce privacy-preserving protocols for this algorithm, along with a protocol for Secure comparison, known as the Millionaires' Problem, as a sub-protocol, to handle the clustering of horizontally or vertically partitioned data among two or more parties.
引用
收藏
页码:381 / +
页数:2
相关论文
共 50 条
  • [21] Importance of Data Standardization in Privacy-Preserving K-Means Clustering
    Su, Chunhua
    Zhan, Justin
    Sakurai, Kouichi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2009, 5667 : 276 - +
  • [22] K-Means Clustering With Local dχ-Privacy for Privacy-Preserving Data Analysis
    Yang, Mengmeng
    Tjuawinata, Ivan
    Lam, Kwok-Yan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2524 - 2537
  • [23] Data privacy protection in multi-party clustering
    Yang, Weijia
    Huang, Shangteng
    DATA & KNOWLEDGE ENGINEERING, 2008, 67 (01) : 185 - 199
  • [24] General-purpose multi-user privacy-preserving outsourced k-means clustering
    Ye, Jun
    Hu, Zhaowang
    Zhang, Zhengqi
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2025, 89
  • [25] Privacy preserving multi-party decision tree induction
    Zhan, JZ
    Chang, LW
    Matwin, S
    RESEARCH DIRECTIONS IN DATA AND APPLICATIONS SECURITY XVIII, 2004, 144 : 341 - 355
  • [26] Efficient privacy-preserving outsourced k-means clustering on distributed data
    Qiu, Guowei
    Zhao, Yingliang
    Gui, Xiaolin
    INFORMATION SCIENCES, 2024, 674
  • [27] Privacy-preserving kernel k-means clustering outsourcing with random transformation
    Keng-Pei Lin
    Knowledge and Information Systems, 2016, 49 : 885 - 908
  • [28] Efficient and Privacy-Preserving k-means clustering For Big Data Mining
    Gheid, Zakaria
    Challal, Yacine
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 791 - 798
  • [29] Privacy-preserving kernel k-means clustering outsourcing with random transformation
    Lin, Keng-Pei
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) : 885 - 908
  • [30] Privacy Preserving Distributed Cell-based K-means Clustering Algorithm
    Su, Fang
    Zu, Yun-xiao
    Li, Wei-hai
    INTERNATIONAL CONFERENCE ON MATHEMATICS, MODELLING AND SIMULATION TECHNOLOGIES AND APPLICATIONS (MMSTA 2017), 2017, 215 : 377 - 383