FDPBoost: Federated differential privacy gradient boosting decision trees

被引:3
|
作者
Li, Yingjie [1 ]
Feng, Yan [1 ]
Qian, Quan [1 ,2 ,3 ,4 ,5 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Ctr Mat Informat & Data Sci, Shanghai 200444, Peoples R China
[3] Shanghai Univ, Key Lab Silicate Cultural Rel Conservat, Minist Educ, Shanghai, Peoples R China
[4] Zhejiang Lab, Hangzhou 311100, Zhejiang, Peoples R China
[5] Shanghai Frontier Sci Ctr Mechanoinformat, Shanghai 200444, Peoples R China
关键词
Federated learning; Differential privacy; Gradient boosting decision tree; Distributed two-level boosting framework;
D O I
10.1016/j.jisa.2023.103468
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The big data era has led to an exponential increase in data usage, resulting in significantly advancements in data-driven domains and data mining. However, due to privacy and regulatory requirements, sharing data among various institutions is not always possible. Federated learning can help address this problem, but existing studies that combine differential privacy with tree models have shown significant accuracy loss. In this study, we propose a Federated Differential Privacy Gradient Boosting Decision Tree (FDPBoost) that protects the private datasets of different owners while improving model accuracy. We select sensitive features according to the secure feature set indicator, and use an exponential mechanism to protect sensitive features and assign significant weight to the Laplace mechanism to protect leaf node values. Additionally, a distributed two -level boosting framework is designed to allocate the privacy budget between intra-iteration and inter-iteration decision trees while protecting model communication. The FDPBoost is tested on five datasets sourced from the materials and medical domains. Our experiments reveal that FDPBoost achieves competitive accuracy with traditional federated gradient boosting decision trees while also exhibiting a significant reduction in error rate as compared to PPGBDT (Zhao et al.) and FV-tree (Gao et al.). Notably, FDPBoost's error rate on the tumor-diagnosis dataset is 30% lower than that of FV-tree.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Practical Federated Gradient Boosting Decision Trees
    Li, Qinbin
    Wen, Zeyi
    He, Bingsheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4642 - 4649
  • [2] Secure and Efficient Federated Gradient Boosting Decision Trees
    Zhao, Xue
    Li, Xiaohui
    Sun, Shuang
    Jia, Xu
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [3] Privacy-Preserving Gradient Boosting Decision Trees
    Li, Qinbin
    Wu, Zhaomin
    Wen, Zeyi
    He, Bingsheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 784 - 791
  • [4] eFL-Boost: Efficient Federated Learning for Gradient Boosting Decision Trees
    Yamamoto, Fuki
    Ozawa, Seiichi
    Wang, Lihua
    IEEE ACCESS, 2022, 10 : 43954 - 43963
  • [5] Towards Fair and Decentralized Federated Learning System for Gradient Boosting Decision Trees
    Gao, Shiqi
    Li, Xianxian
    Shi, Zhenkui
    Liu, Peng
    Li, Chunpei
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [6] Machine Unlearning in Gradient Boosting Decision Trees
    Lin, Huawei
    Chung, Jun Woo
    Lao, Yingjie
    Zhao, Weijie
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1374 - 1383
  • [7] Label Aggregation of Gradient Boosting Decision Trees
    Xiang, X. C.
    Zhang, H. X.
    Xia, S. T.
    PROCEEDINGS OF 2020 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION AND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND MACHINE LEARNING, IPMV 2020, 2020, : 140 - 145
  • [8] FPGA Accelerator for Gradient Boosting Decision Trees
    Alcolea, Adrian
    Resano, Javier
    ELECTRONICS, 2021, 10 (03) : 1 - 15
  • [9] On Incremental Learning for Gradient Boosting Decision Trees
    Zhang, Chongsheng
    Zhang, Yuan
    Shi, Xianjin
    Almpanidis, George
    Fan, Gaojuan
    Shen, Xiajiong
    NEURAL PROCESSING LETTERS, 2019, 50 (01) : 957 - 987
  • [10] Gradient Boosting Decision Trees for Echocardiogram Images
    de Melo, Vinicius Veloso
    Ushizima, Daniela Mayumi
    Baracho, Salety Ferreira
    Coelho, Regina Celia
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,