Towards Privacy Preserving Cross Project Defect Prediction with Federated Learning

被引：6

作者：

Yamamoto, Hiroki ^{[1
]}

Wang, Dong ^{[1
]}

Rajbahadur, Gopi Krishnan ^{[2
]}

Kondo, Masanari ^{[1
]}

Kamei, Yasutaka ^{[1
]}

Ubayashi, Naoyasu ^{[1
]}

机构：

[1] Kyushu Univ, Fukuoka, Japan

[2] Huawei Technol Canada Co Ltd, Markham, ON, Canada

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER | 2023年

关键词：

Defect Prediction; Cross Project; Privacy Preservation; Federated Learning;

D O I：

10.1109/SANER56733.2023.00052

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Defect prediction models can predict defects in software projects, and many researchers study defect prediction models to assist debugging efforts in software development. In recent years, there has been growing interest in Cross Project Defect Prediction (CPDP), which predicts defects in a project using a defect prediction model learned from other projects' data when there is insufficient data to construct a defect prediction model. Since CPDP uses other projects' data, data privacy preservation is one of the most significant issues. However, prior CPDP studies still require data sharing among projects to train models, and do not fully consider protecting project confidentiality. To address this, we propose a CPDP model FLR employing federated learning, a distributed machine learning approach that does not require data sharing. We evaluate FLR, using 25 projects, to investigate its effectiveness and feature interpretation. Our key results show that first, FLR outperforms the existing privacy-preserving methods (i.e., LACE2). Meanwhile, the performance is relatively comparable to the conventional methods (e.g., supervised and unsupervised learning). Second, the results of the interpretation analysis show that scale-related features have a common effect on the prediction performance of the FLR. In addition, further insights demonstrate that parameters of federated learning (e.g., learning rates and the number of clients) also play a role in the performance. This study is served as a first step to confirm the feasibility of the employment of federated learning in CPDP to ensure privacy preservation and lays the groundwork for future research on applying other machine learning models to federated learning.

引用

页码：485 / 496

页数：12

共 50 条

[31] Fairness and privacy preserving in federated learning: A survey
Rafi, Taki Hasan
Noor, Faiza Anan
Hussain, Tahmid
Chae, Dong-Kyu
INFORMATION FUSION, 2024, 105
[32] Federated learning for privacy-preserving AI
Cheng, Yong
Liu, Yang
Chen, Tianjian
Yang, Qiang
COMMUNICATIONS OF THE ACM, 2020, 63 (12) : 33 - 36
[33] Privacy-Preserving and Reliable Federated Learning
Lu, Yi
Zhang, Lei
Wang, Lulu
Gao, Yuanyuan
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 346 - 361
[34] A Graph Federated Architecture with Privacy Preserving Learning
Rizk, Elsa
Sayed, Ali H.
SPAWC 2021: 2021 IEEE 22ND INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC 2021), 2020, : 131 - 135
[35] Privacy-preserving Techniques in Federated Learning
Liu Y.-X.
Chen H.
Liu Y.-H.
Li C.-P.
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 1057 - 1092
[36] Adaptive privacy-preserving federated learning
Xiaoyuan Liu
Hongwei Li
Guowen Xu
Rongxing Lu
Miao He
Peer-to-Peer Networking and Applications, 2020, 13 : 2356 - 2366
[37] Privacy-preserving Federated Learning for Industrial Defect Detection Systems via Differential Privacy and Image Obfuscation
Lin, Chia-Yu
Yeh, Yu-Chen
Lu, Makena
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1136 - 1141
[38] FedCDR: Federated Cross-Domain Recommendation for Privacy-Preserving Rating Prediction
Wu Meihan
Li, Li
Tao, Chang
Rigall, Eric
Wang Xiaodong
Xu Chengzhong
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2179 - 2188
[39] Towards Privacy-Preserving Federated Deep Learning infrastructure : proof-of-concept
Zhang, C.
Choudhury, A.
Bermejo, I.
Dekker, A.
RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S949 - S950
[40] Towards Privacy-Preserving Federated Neuromorphic Learning via Spiking Neuron Models
Han, Bing
Fu, Qiang
Zhang, Xinliang
ELECTRONICS, 2023, 12 (18)

← 1 2 3 4 5 →