A Reliable Application of MPC for Securing the Tri-Training Algorithm

被引:0
|
作者
Kurniawan, Hendra [1 ]
Mambo, Masahiro [2 ]
机构
[1] Kanazawa Univ, Grad Sch Nat Sci & Technol, Kanazawa 9201192, Japan
[2] Kanazawa Univ, Inst Sci & Engn, Kanazawa 9201192, Japan
关键词
Data models; Distributed databases; Data privacy; Classification algorithms; Computational modeling; Semisupervised learning; Data mining; Distributed data mining; multi-party computation; privacy-preserving; semi-supervised learning; tri-training; MULTIPARTY COMPUTATION; CLASSIFICATION; CARE;
D O I
10.1109/ACCESS.2023.3264903
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the widespread use of distributed data mining techniques in a variety of areas, the issue of protecting the privacy of sensitive data has received increasing attention in recent years. Privacy-preserving distributed data mining (PPDDM) focuses on decentralized data analysis without the disclosure of sensitive information from data owner. However, the previous PPDDM mostly works on a limited amount of labeled data. In contrast to the real world, unlabeled data is abundance and labeled data is scarce. The objectives of this paper are to study and to analyze privacy-preserving properties of semi-supervised learning (SSL) algorithm with the combination of labeled and unlabeled data, where data is distributed among multiple data owners. In this paper we propose a Privacy-preserving Distributed Data Mining (PPDDM) method by designing a reliable application of secure MPC to semi-supervised tri-training algorithms. We simulate the original tri-training algorithm and tri-training algorithm with secure MPC using a different types of classifiers and datasets. The simulation results show that tri-training in secure MPC has almost same accuracy compared to original tri-training algorithm. We also compare execution time in addition to performance evaluation of tri-training in secure and the original tri-training algorithms.
引用
收藏
页码:34718 / 34735
页数:18
相关论文
共 50 条
  • [21] 基于Tri-training的半监督SVM
    李昆仑
    张伟
    代运娜
    计算机工程与应用, 2009, 45 (22) : 103 - 106
  • [22] 基于特征变换的Tri-Training算法
    赵文亮
    郭华平
    范明
    计算机工程, 2014, 40 (05) : 183 - 187+191
  • [23] An Improved Social Spammer Detection Based on Tri-training
    Xu, Guangxia
    Zhao, Jingteng
    Huang, Deling
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 4040 - 4042
  • [24] Tri-Training for authorship attribution with limited training data: a comprehensive study
    Qian, Tieyun
    Liu, Bing
    Chen, Li
    Peng, Zhiyong
    Zhong, Ming
    He, Guoliang
    Li, Xuhui
    Xu, Gang
    NEUROCOMPUTING, 2016, 171 : 798 - 806
  • [25] Biomedical Named Entity Recognition with Tri-training learning
    Cai, YueHong
    Cheng, XianYi
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 2178 - +
  • [26] Semi-supervised patent text classification method based on improved Tri-training algorithm
    Hu Y.-Q.
    Qiu Q.-Y.
    Yu X.
    Wu J.-W.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (02): : 331 - 339
  • [27] 基于Tri-training的主动学习算法
    张雁
    吴保国
    吕丹桔
    林英
    计算机工程, 2014, 40 (06) : 215 - 218+229
  • [28] Web Spam Detection Based on Improved Tri-training
    Li, Hailong
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2014, : 61 - 65
  • [29] Multi-Source Tri-Training Transfer Learning
    Cheng, Yuhu
    Wang, Xuesong
    Cao, Ge
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06): : 1668 - 1672
  • [30] Semi-supervised active learning image classification method based on Tri-Training algorithm
    Zhang, Yongjun
    Yan, Siyu
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 206 - 210