FDPBoost: Federated differential privacy gradient boosting decision trees

被引:3
|
作者
Li, Yingjie [1 ]
Feng, Yan [1 ]
Qian, Quan [1 ,2 ,3 ,4 ,5 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Ctr Mat Informat & Data Sci, Shanghai 200444, Peoples R China
[3] Shanghai Univ, Key Lab Silicate Cultural Rel Conservat, Minist Educ, Shanghai, Peoples R China
[4] Zhejiang Lab, Hangzhou 311100, Zhejiang, Peoples R China
[5] Shanghai Frontier Sci Ctr Mechanoinformat, Shanghai 200444, Peoples R China
关键词
Federated learning; Differential privacy; Gradient boosting decision tree; Distributed two-level boosting framework;
D O I
10.1016/j.jisa.2023.103468
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The big data era has led to an exponential increase in data usage, resulting in significantly advancements in data-driven domains and data mining. However, due to privacy and regulatory requirements, sharing data among various institutions is not always possible. Federated learning can help address this problem, but existing studies that combine differential privacy with tree models have shown significant accuracy loss. In this study, we propose a Federated Differential Privacy Gradient Boosting Decision Tree (FDPBoost) that protects the private datasets of different owners while improving model accuracy. We select sensitive features according to the secure feature set indicator, and use an exponential mechanism to protect sensitive features and assign significant weight to the Laplace mechanism to protect leaf node values. Additionally, a distributed two -level boosting framework is designed to allocate the privacy budget between intra-iteration and inter-iteration decision trees while protecting model communication. The FDPBoost is tested on five datasets sourced from the materials and medical domains. Our experiments reveal that FDPBoost achieves competitive accuracy with traditional federated gradient boosting decision trees while also exhibiting a significant reduction in error rate as compared to PPGBDT (Zhao et al.) and FV-tree (Gao et al.). Notably, FDPBoost's error rate on the tumor-diagnosis dataset is 30% lower than that of FV-tree.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Gradient sparsification for efficient wireless federated learning with differential privacy
    Kang Wei
    Jun Li
    Chuan Ma
    Ming Ding
    Feng Shu
    Haitao Zhao
    Wen Chen
    Hongbo Zhu
    Science China Information Sciences, 2024, 67
  • [22] Federated Functional Gradient Boosting
    Shen, Zebang
    Hassani, Hamed
    Kale, Satyen
    Karbasi, Amin
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [23] Booster: An Accelerator for Gradient Boosting Decision Trees Training and Inference
    He, Mingxuan
    Thottethodi, Mithuna
    Vijaykumar, T. N.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1051 - 1062
  • [24] Trojan attribute inference attack on gradient boosting decision trees
    Ito, Kunihiro
    Enkhtaivan, Batnyam
    Teranishi, Isamu
    Sakuma, Jun
    9TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY, EUROS&P 2024, 2024, : 542 - 559
  • [25] Privacy-Preserving Federated Learning based on Differential Privacy and Momentum Gradient Descent
    Weng, Shangyin
    Zhang, Lei
    Feng, Daquan
    Feng, Chenyuan
    Wang, Ruiyu
    Klaine, Paulo Valente
    Imran, Muhammad Ali
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [26] Boosting decision trees
    Drucker, H
    Cortes, C
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 479 - 485
  • [27] Boosting and Differential Privacy
    Dwork, Cynthia
    Rothblum, Guy N.
    Vadhan, Salil
    2010 IEEE 51ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2010, : 51 - 60
  • [28] Federated Learning Based on Kernel Local Differential Privacy and Low Gradient Sampling
    Chen, Yi
    Chen, Dan
    Tang, Niansheng
    IEEE ACCESS, 2025, 13 : 16959 - 16977
  • [29] Efficient and Privacy-Preserving Outsourcing of Gradient Boosting Decision Tree Inference
    Yuan, Shuai
    Li, Hongwei
    Qian, Xinyuan
    Hao, Meng
    Zhai, Yixiao
    Xu, Guowen
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2334 - 2348
  • [30] Establishment of a differential diagnosis method and an online prediction platform for AOSD and sepsis based on gradient boosting decision trees algorithm
    Dongmei Zhou
    Jingzhi Xie
    Jiarui Wang
    Juan Zong
    Quanquan Fang
    Fei Luo
    Ting Zhang
    Hua Ma
    Lina Cao
    Hanqiu Yin
    Songlou Yin
    Shuyan Li
    Arthritis Research & Therapy, 25