Towards Privacy Preserving Cross Project Defect Prediction with Federated Learning

被引:6
|
作者
Yamamoto, Hiroki [1 ]
Wang, Dong [1 ]
Rajbahadur, Gopi Krishnan [2 ]
Kondo, Masanari [1 ]
Kamei, Yasutaka [1 ]
Ubayashi, Naoyasu [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Huawei Technol Canada Co Ltd, Markham, ON, Canada
关键词
Defect Prediction; Cross Project; Privacy Preservation; Federated Learning;
D O I
10.1109/SANER56733.2023.00052
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Defect prediction models can predict defects in software projects, and many researchers study defect prediction models to assist debugging efforts in software development. In recent years, there has been growing interest in Cross Project Defect Prediction (CPDP), which predicts defects in a project using a defect prediction model learned from other projects' data when there is insufficient data to construct a defect prediction model. Since CPDP uses other projects' data, data privacy preservation is one of the most significant issues. However, prior CPDP studies still require data sharing among projects to train models, and do not fully consider protecting project confidentiality. To address this, we propose a CPDP model FLR employing federated learning, a distributed machine learning approach that does not require data sharing. We evaluate FLR, using 25 projects, to investigate its effectiveness and feature interpretation. Our key results show that first, FLR outperforms the existing privacy-preserving methods (i.e., LACE2). Meanwhile, the performance is relatively comparable to the conventional methods (e.g., supervised and unsupervised learning). Second, the results of the interpretation analysis show that scale-related features have a common effect on the prediction performance of the FLR. In addition, further insights demonstrate that parameters of federated learning (e.g., learning rates and the number of clients) also play a role in the performance. This study is served as a first step to confirm the feasibility of the employment of federated learning in CPDP to ensure privacy preservation and lays the groundwork for future research on applying other machine learning models to federated learning.
引用
收藏
页码:485 / 496
页数:12
相关论文
共 50 条
  • [41] Privacy-preserving blockchain-based federated learning for traffic flow prediction
    Qi, Yuanhang
    Hossain, M. Shamim
    Nie, Jiangtian
    Li, Xuandi
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 117 : 328 - 337
  • [42] Privacy-Preserving Consumer Churn Prediction in Telecommunication through Federated Machine Learning
    Huh, Jaehyuk
    Lee, Woongsup
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 355 - 356
  • [43] PMF: A Privacy-preserving Human Mobility Prediction Framework via Federated Learning
    Feng, Jie
    Rong, Can
    Sun, Funing
    Guo, Diansheng
    Li, Yong
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (01):
  • [44] Privacy preserving personalized blockchain reliability prediction via federated learning in IoT environments
    Xu, Jianlong
    Lin, Jian
    Liang, Wei
    Li, Kuan-Ching
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2515 - 2526
  • [45] An Efficient Content Popularity Prediction of Privacy Preserving Based on Federated Learning and Wasserstein GAN
    Wang, Kailun
    Deng, Na
    Li, Xuanheng
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (05) : 3786 - 3798
  • [46] Privacy preserving personalized blockchain reliability prediction via federated learning in IoT environments
    Jianlong Xu
    Jian Lin
    Wei Liang
    Kuan-Ching Li
    Cluster Computing, 2022, 25 : 2515 - 2526
  • [47] Federated learning-based trajectory prediction model with privacy preserving for intelligent vehicle
    Han, Mu
    Xu, Kai
    Ma, Shidian
    Li, Aoxue
    Jiang, Haobin
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10861 - 10879
  • [48] Privacy Preserving Machine Learning with Homomorphic Encryption and Federated Learning
    Fang, Haokun
    Qian, Quan
    FUTURE INTERNET, 2021, 13 (04):
  • [49] A efficient and robust privacy-preserving framework for cross-device federated learning
    Du, Weidong
    Li, Min
    Wu, Liqiang
    Han, Yiliang
    Zhou, Tanping
    Yang, Xiaoyuan
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 4923 - 4937
  • [50] Cross the Chasm: Scalable Privacy-Preserving Federated Learning against Poisoning Attack
    Li, Yiran
    Hu, Guiqiang
    Liu, Xiaoyuan
    Ying, Zuobin
    2021 18TH INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2021,