Towards Privacy Preserving Cross Project Defect Prediction with Federated Learning

被引:6
|
作者
Yamamoto, Hiroki [1 ]
Wang, Dong [1 ]
Rajbahadur, Gopi Krishnan [2 ]
Kondo, Masanari [1 ]
Kamei, Yasutaka [1 ]
Ubayashi, Naoyasu [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Huawei Technol Canada Co Ltd, Markham, ON, Canada
关键词
Defect Prediction; Cross Project; Privacy Preservation; Federated Learning;
D O I
10.1109/SANER56733.2023.00052
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Defect prediction models can predict defects in software projects, and many researchers study defect prediction models to assist debugging efforts in software development. In recent years, there has been growing interest in Cross Project Defect Prediction (CPDP), which predicts defects in a project using a defect prediction model learned from other projects' data when there is insufficient data to construct a defect prediction model. Since CPDP uses other projects' data, data privacy preservation is one of the most significant issues. However, prior CPDP studies still require data sharing among projects to train models, and do not fully consider protecting project confidentiality. To address this, we propose a CPDP model FLR employing federated learning, a distributed machine learning approach that does not require data sharing. We evaluate FLR, using 25 projects, to investigate its effectiveness and feature interpretation. Our key results show that first, FLR outperforms the existing privacy-preserving methods (i.e., LACE2). Meanwhile, the performance is relatively comparable to the conventional methods (e.g., supervised and unsupervised learning). Second, the results of the interpretation analysis show that scale-related features have a common effect on the prediction performance of the FLR. In addition, further insights demonstrate that parameters of federated learning (e.g., learning rates and the number of clients) also play a role in the performance. This study is served as a first step to confirm the feasibility of the employment of federated learning in CPDP to ensure privacy preservation and lays the groundwork for future research on applying other machine learning models to federated learning.
引用
收藏
页码:485 / 496
页数:12
相关论文
共 50 条
  • [31] Fairness and privacy preserving in federated learning: A survey
    Rafi, Taki Hasan
    Noor, Faiza Anan
    Hussain, Tahmid
    Chae, Dong-Kyu
    INFORMATION FUSION, 2024, 105
  • [32] Federated learning for privacy-preserving AI
    Cheng, Yong
    Liu, Yang
    Chen, Tianjian
    Yang, Qiang
    COMMUNICATIONS OF THE ACM, 2020, 63 (12) : 33 - 36
  • [33] Privacy-Preserving and Reliable Federated Learning
    Lu, Yi
    Zhang, Lei
    Wang, Lulu
    Gao, Yuanyuan
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 346 - 361
  • [34] A Graph Federated Architecture with Privacy Preserving Learning
    Rizk, Elsa
    Sayed, Ali H.
    SPAWC 2021: 2021 IEEE 22ND INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC 2021), 2020, : 131 - 135
  • [35] Privacy-preserving Techniques in Federated Learning
    Liu Y.-X.
    Chen H.
    Liu Y.-H.
    Li C.-P.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 1057 - 1092
  • [36] Adaptive privacy-preserving federated learning
    Xiaoyuan Liu
    Hongwei Li
    Guowen Xu
    Rongxing Lu
    Miao He
    Peer-to-Peer Networking and Applications, 2020, 13 : 2356 - 2366
  • [37] Privacy-preserving Federated Learning for Industrial Defect Detection Systems via Differential Privacy and Image Obfuscation
    Lin, Chia-Yu
    Yeh, Yu-Chen
    Lu, Makena
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1136 - 1141
  • [38] FedCDR: Federated Cross-Domain Recommendation for Privacy-Preserving Rating Prediction
    Wu Meihan
    Li, Li
    Tao, Chang
    Rigall, Eric
    Wang Xiaodong
    Xu Chengzhong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2179 - 2188
  • [39] Towards Privacy-Preserving Federated Deep Learning infrastructure : proof-of-concept
    Zhang, C.
    Choudhury, A.
    Bermejo, I.
    Dekker, A.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S949 - S950
  • [40] Towards Privacy-Preserving Federated Neuromorphic Learning via Spiking Neuron Models
    Han, Bing
    Fu, Qiang
    Zhang, Xinliang
    ELECTRONICS, 2023, 12 (18)