Privacy preserving k-means clustering in multi-party environment

被引：21

作者：

Samet, Saeed ^{[1
]}

Miri, Ali ^{[1
]}

Orozco-Barbosa, Luis ^{[2
]}

机构：

[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada

[2] Univ Castilla La Mancha, Inst Invest Informat, Albacete 02071, Spain

来源：

SECRYPT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY | 2007年

关键词：

data mining; clustering; classification; and association rules; mining methods and algorithms; security and privacy protection; distributed data structures;

D O I：

10.5220/0002121703810385

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Extracting meaningful and valuable knowledge from databases is often done by various data mining algorithms. Nowadays, databases are distributed among two or more parties because of different reasons such as physical and geographical restrictions and the most important issue is privacy. Related data is normally maintained by more than one organization, each of which wants to keep its individual information private. Thus, privacy-preserving techniques and protocols are designed to perform data mining on distributed environments when privacy is highly concerned. Cluster analysis is a technique in data mining, by which data can be divided into some meaningful clusters, and it has an important role in different fields such as bio-informatics, marketing, machine learning, climate and medicine. k-means Clustering is a prominent algorithm in this category which creates a one-level clustering of data. In this paper we introduce privacy-preserving protocols for this algorithm, along with a protocol for Secure comparison, known as the Millionaires' Problem, as a sub-protocol, to handle the clustering of horizontally or vertically partitioned data among two or more parties.

引用

页码：381 / +

页数：2

共 50 条

[21] Importance of Data Standardization in Privacy-Preserving K-Means Clustering
Su, Chunhua
Zhan, Justin
Sakurai, Kouichi
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2009, 5667 : 276 - +
[22] K-Means Clustering With Local dχ-Privacy for Privacy-Preserving Data Analysis
Yang, Mengmeng
Tjuawinata, Ivan
Lam, Kwok-Yan
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2524 - 2537
[23] Data privacy protection in multi-party clustering
Yang, Weijia
Huang, Shangteng
DATA & KNOWLEDGE ENGINEERING, 2008, 67 (01) : 185 - 199
[24] General-purpose multi-user privacy-preserving outsourced k-means clustering
Ye, Jun
Hu, Zhaowang
Zhang, Zhengqi
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2025, 89
[25] Privacy preserving multi-party decision tree induction
Zhan, JZ
Chang, LW
Matwin, S
RESEARCH DIRECTIONS IN DATA AND APPLICATIONS SECURITY XVIII, 2004, 144 : 341 - 355
[26] Efficient privacy-preserving outsourced k-means clustering on distributed data
Qiu, Guowei
Zhao, Yingliang
Gui, Xiaolin
INFORMATION SCIENCES, 2024, 674
[27] Privacy-preserving kernel k-means clustering outsourcing with random transformation
Keng-Pei Lin
Knowledge and Information Systems, 2016, 49 : 885 - 908
[28] Efficient and Privacy-Preserving k-means clustering For Big Data Mining
Gheid, Zakaria
Challal, Yacine
2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 791 - 798
[29] Privacy-preserving kernel k-means clustering outsourcing with random transformation
Lin, Keng-Pei
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) : 885 - 908
[30] Privacy Preserving Distributed Cell-based K-means Clustering Algorithm
Su, Fang
Zu, Yun-xiao
Li, Wei-hai
INTERNATIONAL CONFERENCE ON MATHEMATICS, MODELLING AND SIMULATION TECHNOLOGIES AND APPLICATIONS (MMSTA 2017), 2017, 215 : 377 - 383

← 1 2 3 4 5 →