FDPBoost: Federated differential privacy gradient boosting decision trees

被引:3
|
作者
Li, Yingjie [1 ]
Feng, Yan [1 ]
Qian, Quan [1 ,2 ,3 ,4 ,5 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Ctr Mat Informat & Data Sci, Shanghai 200444, Peoples R China
[3] Shanghai Univ, Key Lab Silicate Cultural Rel Conservat, Minist Educ, Shanghai, Peoples R China
[4] Zhejiang Lab, Hangzhou 311100, Zhejiang, Peoples R China
[5] Shanghai Frontier Sci Ctr Mechanoinformat, Shanghai 200444, Peoples R China
关键词
Federated learning; Differential privacy; Gradient boosting decision tree; Distributed two-level boosting framework;
D O I
10.1016/j.jisa.2023.103468
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The big data era has led to an exponential increase in data usage, resulting in significantly advancements in data-driven domains and data mining. However, due to privacy and regulatory requirements, sharing data among various institutions is not always possible. Federated learning can help address this problem, but existing studies that combine differential privacy with tree models have shown significant accuracy loss. In this study, we propose a Federated Differential Privacy Gradient Boosting Decision Tree (FDPBoost) that protects the private datasets of different owners while improving model accuracy. We select sensitive features according to the secure feature set indicator, and use an exponential mechanism to protect sensitive features and assign significant weight to the Laplace mechanism to protect leaf node values. Additionally, a distributed two -level boosting framework is designed to allocate the privacy budget between intra-iteration and inter-iteration decision trees while protecting model communication. The FDPBoost is tested on five datasets sourced from the materials and medical domains. Our experiments reveal that FDPBoost achieves competitive accuracy with traditional federated gradient boosting decision trees while also exhibiting a significant reduction in error rate as compared to PPGBDT (Zhao et al.) and FV-tree (Gao et al.). Notably, FDPBoost's error rate on the tumor-diagnosis dataset is 30% lower than that of FV-tree.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] HYBRID MODEL FOR NETWORK ANOMALY DETECTION WITH GRADIENT BOOSTING DECISION TREES AND TABTRANSFORMER
    Xu, Xinyue
    Zheng, Xiaolu
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8538 - 8542
  • [42] Automated formatting verification technique of paperwork based on the gradient boosting on decision trees
    Nasyrov, Nail
    Komarov, Mikhail
    Tartynskikh, Petr
    Gorlushkina, Nataliya
    9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 365 - 374
  • [43] Classification of Pesticide Residues in Sorghum Based on Hyperspectral and Gradient Boosting Decision Trees
    Hu, Xinjun
    Zhang, Jiahong
    Lei, Yu
    Tian, Jianping
    Peng, Jianheng
    Chen, Manjiao
    JOURNAL OF FOOD SAFETY, 2024, 44 (05)
  • [44] Static PE Malware Detection Using Gradient Boosting Decision Trees Algorithm
    Huu-Danh Pham
    Tuan Dinh Le
    Thanh Nguyen Vu
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018, 2018, 11251 : 228 - 236
  • [45] Efficiency of Gradient Boosting Decision Trees Technique in Polish Companies' Bankruptcy Prediction
    Wyrobek, Joanna
    Kluza, Krzysztof
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2018, PT III, 2019, 854 : 24 - 35
  • [46] Credit scoring based on tree-enhanced gradient boosting decision trees
    Liu, Wanan
    Fan, Hong
    Xia, Meng
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
  • [47] Privacy-preserving gradient boosting tree: Vertical federated learning for collaborative bearing fault diagnosis
    Xia, Liqiao
    Zheng, Pai
    Li, Jinjie
    Tang, Wangchujun
    Zhang, Xiangying
    IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (03) : 208 - 219
  • [48] Gradient Boosting for Health IoT Federated Learning
    Wassan, Sobia
    Suhail, Beenish
    Mubeen, Riaqa
    Raj, Bhavana
    Agarwal, Ujjwal
    Khatri, Eti
    Gopinathan, Sujith
    Dhiman, Gaurav
    SUSTAINABILITY, 2022, 14 (24)
  • [49] An ensemble of random decision trees with local differential privacy in edge computing
    Wu, Xiaotong
    Qi, Lianyong
    Gao, Jiaquan
    Ji, Genlin
    Xu, Xiaolong
    NEUROCOMPUTING, 2022, 485 : 181 - 195
  • [50] Does Differential Privacy Really Protect Federated Learning From Gradient Leakage Attacks?
    Hu, Jiahui
    Du, Jiacheng
    Wang, Zhibo
    Pang, Xiaoyi
    Zhou, Yajie
    Sun, Peng
    Ren, Kui
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12635 - 12649