A Reliable Application of MPC for Securing the Tri-Training Algorithm

被引:0
|
作者
Kurniawan, Hendra [1 ]
Mambo, Masahiro [2 ]
机构
[1] Kanazawa Univ, Grad Sch Nat Sci & Technol, Kanazawa 9201192, Japan
[2] Kanazawa Univ, Inst Sci & Engn, Kanazawa 9201192, Japan
关键词
Data models; Distributed databases; Data privacy; Classification algorithms; Computational modeling; Semisupervised learning; Data mining; Distributed data mining; multi-party computation; privacy-preserving; semi-supervised learning; tri-training; MULTIPARTY COMPUTATION; CLASSIFICATION; CARE;
D O I
10.1109/ACCESS.2023.3264903
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the widespread use of distributed data mining techniques in a variety of areas, the issue of protecting the privacy of sensitive data has received increasing attention in recent years. Privacy-preserving distributed data mining (PPDDM) focuses on decentralized data analysis without the disclosure of sensitive information from data owner. However, the previous PPDDM mostly works on a limited amount of labeled data. In contrast to the real world, unlabeled data is abundance and labeled data is scarce. The objectives of this paper are to study and to analyze privacy-preserving properties of semi-supervised learning (SSL) algorithm with the combination of labeled and unlabeled data, where data is distributed among multiple data owners. In this paper we propose a Privacy-preserving Distributed Data Mining (PPDDM) method by designing a reliable application of secure MPC to semi-supervised tri-training algorithms. We simulate the original tri-training algorithm and tri-training algorithm with secure MPC using a different types of classifiers and datasets. The simulation results show that tri-training in secure MPC has almost same accuracy compared to original tri-training algorithm. We also compare execution time in addition to performance evaluation of tri-training in secure and the original tri-training algorithms.
引用
收藏
页码:34718 / 34735
页数:18
相关论文
共 50 条
  • [31] 基于Tri-Training算法的数据编辑技术
    张雁
    林英
    吕丹桔
    计算机与数字工程, 2013, 41 (10) : 1583 - 1585
  • [32] 基于Tri-Training的驾驶风格分类算法
    董昊旻
    张维轩
    王文彬
    何云廷
    康子怡
    汽车技术, 2021, (04) : 6 - 11
  • [33] Deep Tri-Training for Semi-Supervised Image Segmentation
    An, Shan
    Zhu, Haogang
    Zhang, Jiaao
    Ye, Junjie
    Wang, Siliang
    Yin, Jianqin
    Zhang, Hong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10097 - 10104
  • [34] A Robust Random Forest-based Tri-Training Algorithm for Early In-trouble Student Prediction
    Vo Thi Ngoc Chau
    Nguyen Hua Phung
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 84 - 89
  • [35] Weighted fuzzy rough sets-based tri-training and its application to medical diagnosis
    Xing, Jinming
    Gao, Can
    Zhou, Jie
    APPLIED SOFT COMPUTING, 2022, 124
  • [36] Boosted Web Named Entity Recognition via Tri-Training
    Chou, Chien-Lung
    Chang, Chia-Hui
    Huang, Ya-Yun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 16 (02)
  • [37] Tri-training algorithm based on cross entropy and K-nearest neighbors for network intrusion detection
    Zhao, Jia
    Li, Song
    Wu, Runxiu
    Zhang, Yiying
    Zhang, Bo
    Han, Longzhe
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (12): : 3889 - 3903
  • [38] Improved tri-training method for identifying user abnormal behavior based on adaptive golden jackal algorithm
    Wang, Kun
    Gao, Jinggeng
    Kang, Xiaohua
    Li, Huan
    AIP ADVANCES, 2023, 13 (03)
  • [39] Tri-training and MapReduce-based massive data learning
    Guo, Mao-Zu
    Deng, Chao
    Liu, Yang
    Li, Ping
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 355 - 380
  • [40] 基于交叉熵的安全Tri-training算法
    张永
    陈蓉蓉
    张晶
    计算机研究与发展, 2021, (01) : 60 - 69