Towards Privacy Preserving Cross Project Defect Prediction with Federated Learning

被引:6
|
作者
Yamamoto, Hiroki [1 ]
Wang, Dong [1 ]
Rajbahadur, Gopi Krishnan [2 ]
Kondo, Masanari [1 ]
Kamei, Yasutaka [1 ]
Ubayashi, Naoyasu [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Huawei Technol Canada Co Ltd, Markham, ON, Canada
关键词
Defect Prediction; Cross Project; Privacy Preservation; Federated Learning;
D O I
10.1109/SANER56733.2023.00052
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Defect prediction models can predict defects in software projects, and many researchers study defect prediction models to assist debugging efforts in software development. In recent years, there has been growing interest in Cross Project Defect Prediction (CPDP), which predicts defects in a project using a defect prediction model learned from other projects' data when there is insufficient data to construct a defect prediction model. Since CPDP uses other projects' data, data privacy preservation is one of the most significant issues. However, prior CPDP studies still require data sharing among projects to train models, and do not fully consider protecting project confidentiality. To address this, we propose a CPDP model FLR employing federated learning, a distributed machine learning approach that does not require data sharing. We evaluate FLR, using 25 projects, to investigate its effectiveness and feature interpretation. Our key results show that first, FLR outperforms the existing privacy-preserving methods (i.e., LACE2). Meanwhile, the performance is relatively comparable to the conventional methods (e.g., supervised and unsupervised learning). Second, the results of the interpretation analysis show that scale-related features have a common effect on the prediction performance of the FLR. In addition, further insights demonstrate that parameters of federated learning (e.g., learning rates and the number of clients) also play a role in the performance. This study is served as a first step to confirm the feasibility of the employment of federated learning in CPDP to ensure privacy preservation and lays the groundwork for future research on applying other machine learning models to federated learning.
引用
收藏
页码:485 / 496
页数:12
相关论文
共 50 条
  • [21] Personalized Privacy-Preserving Framework for Cross-Silo Federated Learning
    Tran, Van-Tuan
    Pham, Huy-Hieu
    Wong, Kok-Seng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (04) : 1014 - 1024
  • [22] Privacy preserving distributed machine learning with federated learning
    Chamikara, M. A. P.
    Bertok, P.
    Khalil, I.
    Liu, D.
    Camtepe, S.
    COMPUTER COMMUNICATIONS, 2021, 171 : 112 - 125
  • [23] Privacy-preserving Cross-domain Recommendation with Federated Graph Learning
    Tian, Changxin
    Xie, Yuexiang
    Chen, Xu
    Li, Yaliang
    Zhao, Xin
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (05)
  • [24] Anonymous and Efficient Authentication Scheme for Privacy-Preserving Federated Cross Learning
    Li, Zeshuai
    Liang, Xiaoyan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 281 - 293
  • [25] Preserving Privacy in Federated Learning with Ensemble Cross-Domain Knowledge Distillation
    Gong, Xuan
    Sharma, Abhishek
    Karanam, Srikrishna
    Wu, Ziyan
    Chen, Terrence
    Doermann, David
    Innanje, Arun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11891 - 11899
  • [26] Privacy-Preserving Personalized Federated Learning
    Hu, Rui
    Guo, Yuanxiong
    Li, Hongning
    Pei, Qingqi
    Gong, Yanmin
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [27] Frameworks for Privacy-Preserving Federated Learning
    Phong, Le Trieu
    Phuong, Tran Thi
    Wang, Lihua
    Ozawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 2 - 12
  • [28] ROFL: RObust privacy preserving Federated Learning
    Chattopadhyay, Nandish
    Singh, Arpit
    Chattopadhyay, Anupam
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2022, : 125 - 132
  • [29] Adaptive privacy-preserving federated learning
    Liu, Xiaoyuan
    Li, Hongwei
    Xu, Guowen
    Lu, Rongxing
    He, Miao
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (06) : 2356 - 2366
  • [30] Privacy preserving federated learning for full heterogeneity
    Chen, Kongyang
    Zhang, Xiaoxue
    Zhou, Xiuhua
    Mi, Bing
    Xiao, Yatie
    Zhou, Lei
    Wu, Zhen
    Wu, Lin
    Wang, Xiaoying
    ISA TRANSACTIONS, 2023, 141 : 73 - 83