FDPBoost: Federated differential privacy gradient boosting decision trees

被引：3

作者：

Li, Yingjie ^{[1
]}

Feng, Yan ^{[1
]}

Qian, Quan ^{[1
,2
,3
,4
,5
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

[2] Shanghai Univ, Mat Genome Inst, Ctr Mat Informat & Data Sci, Shanghai 200444, Peoples R China

[3] Shanghai Univ, Key Lab Silicate Cultural Rel Conservat, Minist Educ, Shanghai, Peoples R China

[4] Zhejiang Lab, Hangzhou 311100, Zhejiang, Peoples R China

[5] Shanghai Frontier Sci Ctr Mechanoinformat, Shanghai 200444, Peoples R China

来源：

JOURNAL OF INFORMATION SECURITY AND APPLICATIONS | 2023年 / 74卷

关键词：

Federated learning; Differential privacy; Gradient boosting decision tree; Distributed two-level boosting framework;

D O I：

10.1016/j.jisa.2023.103468

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The big data era has led to an exponential increase in data usage, resulting in significantly advancements in data-driven domains and data mining. However, due to privacy and regulatory requirements, sharing data among various institutions is not always possible. Federated learning can help address this problem, but existing studies that combine differential privacy with tree models have shown significant accuracy loss. In this study, we propose a Federated Differential Privacy Gradient Boosting Decision Tree (FDPBoost) that protects the private datasets of different owners while improving model accuracy. We select sensitive features according to the secure feature set indicator, and use an exponential mechanism to protect sensitive features and assign significant weight to the Laplace mechanism to protect leaf node values. Additionally, a distributed two -level boosting framework is designed to allocate the privacy budget between intra-iteration and inter-iteration decision trees while protecting model communication. The FDPBoost is tested on five datasets sourced from the materials and medical domains. Our experiments reveal that FDPBoost achieves competitive accuracy with traditional federated gradient boosting decision trees while also exhibiting a significant reduction in error rate as compared to PPGBDT (Zhao et al.) and FV-tree (Gao et al.). Notably, FDPBoost's error rate on the tumor-diagnosis dataset is 30% lower than that of FV-tree.

引用

页数：11

共 50 条

[41] HYBRID MODEL FOR NETWORK ANOMALY DETECTION WITH GRADIENT BOOSTING DECISION TREES AND TABTRANSFORMER
Xu, Xinyue
Zheng, Xiaolu
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8538 - 8542
[42] Automated formatting verification technique of paperwork based on the gradient boosting on decision trees
Nasyrov, Nail
Komarov, Mikhail
Tartynskikh, Petr
Gorlushkina, Nataliya
9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 365 - 374
[43] Classification of Pesticide Residues in Sorghum Based on Hyperspectral and Gradient Boosting Decision Trees
Hu, Xinjun
Zhang, Jiahong
Lei, Yu
Tian, Jianping
Peng, Jianheng
Chen, Manjiao
JOURNAL OF FOOD SAFETY, 2024, 44 (05)
[44] Static PE Malware Detection Using Gradient Boosting Decision Trees Algorithm
Huu-Danh Pham
Tuan Dinh Le
Thanh Nguyen Vu
FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018, 2018, 11251 : 228 - 236
[45] Efficiency of Gradient Boosting Decision Trees Technique in Polish Companies' Bankruptcy Prediction
Wyrobek, Joanna
Kluza, Krzysztof
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2018, PT III, 2019, 854 : 24 - 35
[46] Credit scoring based on tree-enhanced gradient boosting decision trees
Liu, Wanan
Fan, Hong
Xia, Meng
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
[47] Privacy-preserving gradient boosting tree: Vertical federated learning for collaborative bearing fault diagnosis
Xia, Liqiao
Zheng, Pai
Li, Jinjie
Tang, Wangchujun
Zhang, Xiangying
IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (03) : 208 - 219
[48] Gradient Boosting for Health IoT Federated Learning
Wassan, Sobia
Suhail, Beenish
Mubeen, Riaqa
Raj, Bhavana
Agarwal, Ujjwal
Khatri, Eti
Gopinathan, Sujith
Dhiman, Gaurav
SUSTAINABILITY, 2022, 14 (24)
[49] An ensemble of random decision trees with local differential privacy in edge computing
Wu, Xiaotong
Qi, Lianyong
Gao, Jiaquan
Ji, Genlin
Xu, Xiaolong
NEUROCOMPUTING, 2022, 485 : 181 - 195
[50] Does Differential Privacy Really Protect Federated Learning From Gradient Leakage Attacks?
Hu, Jiahui
Du, Jiacheng
Wang, Zhibo
Pang, Xiaoyi
Zhou, Yajie
Sun, Peng
Ren, Kui
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12635 - 12649

← 1 2 3 4 5 →