A Reliable Application of MPC for Securing the Tri-Training Algorithm

被引:0
|
作者
Kurniawan, Hendra [1 ]
Mambo, Masahiro [2 ]
机构
[1] Kanazawa Univ, Grad Sch Nat Sci & Technol, Kanazawa 9201192, Japan
[2] Kanazawa Univ, Inst Sci & Engn, Kanazawa 9201192, Japan
关键词
Data models; Distributed databases; Data privacy; Classification algorithms; Computational modeling; Semisupervised learning; Data mining; Distributed data mining; multi-party computation; privacy-preserving; semi-supervised learning; tri-training; MULTIPARTY COMPUTATION; CLASSIFICATION; CARE;
D O I
10.1109/ACCESS.2023.3264903
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the widespread use of distributed data mining techniques in a variety of areas, the issue of protecting the privacy of sensitive data has received increasing attention in recent years. Privacy-preserving distributed data mining (PPDDM) focuses on decentralized data analysis without the disclosure of sensitive information from data owner. However, the previous PPDDM mostly works on a limited amount of labeled data. In contrast to the real world, unlabeled data is abundance and labeled data is scarce. The objectives of this paper are to study and to analyze privacy-preserving properties of semi-supervised learning (SSL) algorithm with the combination of labeled and unlabeled data, where data is distributed among multiple data owners. In this paper we propose a Privacy-preserving Distributed Data Mining (PPDDM) method by designing a reliable application of secure MPC to semi-supervised tri-training algorithms. We simulate the original tri-training algorithm and tri-training algorithm with secure MPC using a different types of classifiers and datasets. The simulation results show that tri-training in secure MPC has almost same accuracy compared to original tri-training algorithm. We also compare execution time in addition to performance evaluation of tri-training in secure and the original tri-training algorithms.
引用
收藏
页码:34718 / 34735
页数:18
相关论文
共 50 条
  • [41] Tri-training: Exploiting unlabeled data using three classifiers
    Zhou, ZH
    Li, M
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) : 1529 - 1541
  • [42] Tri-training based learning from positive and unlabeled data
    Zhang, Bangzuo
    Zuo, Wanli
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 640 - 644
  • [43] 基于Tri-training的评价单元识别
    蒋润
    顾春华
    阮彤
    计算机应用, 2014, 34 (04) : 1099 - 1104
  • [44] 基于Tri-training的入侵检测算法
    邬书跃
    余杰
    樊晓平
    计算机工程, 2012, 38 (06) : 158 - 160
  • [45] A Novel Semi-supervised SVM Based on Tri-training
    Li, KunLun
    Zhang, Wei
    Ma, Xiaotao
    Cao, Zheng
    Zhang, Chao
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL III, PROCEEDINGS, 2008, : 47 - +
  • [46] 基于Tri-Training半监督分类算法的研究
    张雁
    吕丹桔
    吴保国
    计算机技术与发展, 2013, 23 (07) : 77 - 79+83
  • [47] 基于密度峰值聚类的Tri-training算法
    罗宇航
    吴润秀
    崔志华
    张翼英
    何业慎
    赵嘉
    系统仿真学报 , 2024, (05) : 1189 - 1198
  • [48] 基于Tri-training的多特征融合图像检索
    陈秀新
    郑雅
    于重重
    贾克斌
    计算机应用研究, 2014, 31 (11) : 3506 - 3509
  • [49] Mixup Asymmetric Tri-Training for Heartbeat Classification under Domain Shift
    Li, Jiawei
    Wang, Guijin
    Chen, Ming
    Ding, Zijian
    Yang, Huazhong
    IEEE Signal Processing Letters, 2021, 28 : 718 - 722
  • [50] Trust Prediction Based on Extreme Learning Machine and Asymmetric Tri-Training
    Wang, Yan
    Tong, Xiangrong
    IEEE ACCESS, 2021, 9 : 64358 - 64367