Privacy preserving k-means clustering in multi-party environment

被引:21
|
作者
Samet, Saeed [1 ]
Miri, Ali [1 ]
Orozco-Barbosa, Luis [2 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
[2] Univ Castilla La Mancha, Inst Invest Informat, Albacete 02071, Spain
关键词
data mining; clustering; classification; and association rules; mining methods and algorithms; security and privacy protection; distributed data structures;
D O I
10.5220/0002121703810385
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting meaningful and valuable knowledge from databases is often done by various data mining algorithms. Nowadays, databases are distributed among two or more parties because of different reasons such as physical and geographical restrictions and the most important issue is privacy. Related data is normally maintained by more than one organization, each of which wants to keep its individual information private. Thus, privacy-preserving techniques and protocols are designed to perform data mining on distributed environments when privacy is highly concerned. Cluster analysis is a technique in data mining, by which data can be divided into some meaningful clusters, and it has an important role in different fields such as bio-informatics, marketing, machine learning, climate and medicine. k-means Clustering is a prominent algorithm in this category which creates a one-level clustering of data. In this paper we introduce privacy-preserving protocols for this algorithm, along with a protocol for Secure comparison, known as the Millionaires' Problem, as a sub-protocol, to handle the clustering of horizontally or vertically partitioned data among two or more parties.
引用
收藏
页码:381 / +
页数:2
相关论文
共 50 条
  • [31] A reversible privacy-preserving clustering technique based on k-means algorithm
    Lin, Chen-Yi
    APPLIED SOFT COMPUTING, 2020, 87
  • [32] Security and Correctness Analysis on Privacy-Preserving k-Means Clustering Schemes
    Su, Chunhua
    Bao, Feng
    Zhou, Jianying
    Takagi, Tsuyoshi
    Sakurai, Kouichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2009, E92A (04) : 1246 - 1250
  • [33] Privacy-Preserving k-means Clustering: an Application to Driving Style Recognition
    El Omri, Othmane
    Boudguiga, Aymen
    Izabachene, Malika
    Klaudel, Witold
    NETWORK AND SYSTEM SECURITY, NSS 2019, 2019, 11928 : 685 - 696
  • [34] Outsourced and Privacy-Preserving K-means Clustering Scheme for Smart Grid
    Shen, Xielin
    Yuan, Bo
    Peng, Weiwen
    Qian, Yuanquan
    Wu, Yonghua
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 307 - 313
  • [35] Privacy-Preserving Hybrid K-Means
    Gao, Zhiqiang
    Sun, Yixiao
    Cui, Xiaolong
    Wang, Yutao
    Duan, Yanyu
    Wang, Xu An
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2018, 14 (02) : 1 - 17
  • [36] Efficient and Scalable Multi-party Privacy-Preserving k-NN Classification
    Li, Xinglei
    Qian, Haifeng
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, PT II, SECURECOMM 2023, 2025, 568 : 266 - 286
  • [37] Multi-layer Topology Preserving Mapping for K-Means Clustering
    Wu, Ying
    Doyle, Thomas K.
    Fyfe, Colin
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2011, 2011, 6936 : 84 - +
  • [38] Clustering-Based Scalable Indexing for Multi-party Privacy-Preserving Record Linkage
    Ranbaduge, Thilina
    Vatsalan, Dinusha
    Christen, Peter
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 549 - 561
  • [39] K-Means Clustering with Local Distance Privacy
    Yang, Mengmeng
    Huang, Longxia
    Tang, Chenghua
    BIG DATA MINING AND ANALYTICS, 2023, 6 (04) : 433 - 442
  • [40] Locality Preserving Based K-Means Clustering
    Yang, Xiaohuan
    Wang, Xiaoming
    Tian, Yong
    Du, Yajun
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 86 - 95