Cross Project Defect Prediction via Balanced Distribution Adaptation Based Transfer Learning

被引:0
|
作者
Zhou Xu
Shuai Pang
Tao Zhang
Xia-Pu Luo
Jin Liu
Yu-Tian Tang
Xiao Yu
Lei Xue
机构
[1] Harbin Engineering University,College of Computer Science and Technology
[2] Wuhan University,School of Computer Science
[3] The Hong Kong Polytechnic University,Department of Computing
[4] Chinese Academy of Sciences,Key Laboratory of Network Assessment Technology, Institute of Information Engineering
[5] Guilin University of Electronic Technology,Guangxi Key Laboratory of Trusted Software
[6] City University of Hong Kong,Department of Computer Science
关键词
cross-project defect prediction; transfer learning; balancing distribution; effort-aware indicator;
D O I
暂无
中图分类号
学科分类号
摘要
Defect prediction assists the rational allocation of testing resources by detecting the potentially defective software modules before releasing products. When a project has no historical labeled defect data, cross project defect prediction (CPDP) is an alternative technique for this scenario. CPDP utilizes labeled defect data of an external project to construct a classification model to predict the module labels of the current project. Transfer learning based CPDP methods are the current mainstream. In general, such methods aim to minimize the distribution differences between the data of the two projects. However, previous methods mainly focus on the marginal distribution difference but ignore the conditional distribution difference, which will lead to unsatisfactory performance. In this work, we use a novel balanced distribution adaptation (BDA) based transfer learning method to narrow this gap. BDA simultaneously considers the two kinds of distribution differences and adaptively assigns different weights to them. To evaluate the effectiveness of BDA for CPDP performance, we conduct experiments on 18 projects from four datasets using six indicators (i.e., F-measure, g-means, Balance, AUC, EARecall, and EAF-measure). Compared with 12 baseline methods, BDA achieves average improvements of 23.8%, 12.5%, 11.5%, 4.7%, 34.2%, and 33.7% in terms of the six indicators respectively over four datasets.
引用
收藏
页码:1039 / 1062
页数:23
相关论文
共 50 条
  • [41] A cross-project defect prediction method based on multi-adaptation and nuclear norm
    Huang, Qingan
    Ma, Le
    Jiang, Siyu
    Wu, Guobin
    Song, Hengjie
    Jiang, Libiao
    Zheng, Chunyun
    IET SOFTWARE, 2022, 16 (02) : 200 - 213
  • [42] Joint feature representation learning and progressive distribution matching for cross-project defect prediction
    Zou, Quanyi
    Lu, Lu
    Yang, Zhanyu
    Gu, Xiaowei
    Qiu, Shaojian
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 137 (137)
  • [43] Cross project project defect prediction using class distribution estimation and oversampling
    Limsettho, Nachai
    Bennin, Kwabena Ebo
    Keung, Jacky W.
    Hata, Hideaki
    Matsumoto, Kenichi
    INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 100 : 87 - 102
  • [44] Cross-Project Defect Prediction via Semi-Supervised Discriminative Feature Learning
    Xing, Danlei
    Wu, Fei
    Sun, Ying
    Jing, Xiao-Yuan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (10) : 2237 - 2240
  • [45] Manifold Learning for Cross-project Software Defect Prediction
    Sun, Jing
    Jing, Xiaoyuan
    Dong, Xiwei
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 567 - 571
  • [46] Balanced Adaptation Regularization Based Transfer Learning for Unsupervised Cross-Domain Fault Diagnosis
    Hu, Qin
    Si, Xiaosheng
    Qin, Aisong
    Lv, Yunrong
    Liu, Mei
    IEEE SENSORS JOURNAL, 2022, 22 (12) : 12139 - 12151
  • [47] Graph-Based Feature Learning for Cross-Project Software Defect Prediction
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Algabri, Redhwan
    Lee, Sungon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 161 - 180
  • [48] Heterogeneous Defect Prediction Based on Federated Transfer Learning via Knowledge Distillation
    Wang, Aili
    Zhang, Yutong
    Yan, Yixin
    IEEE ACCESS, 2021, 9 : 29530 - 29540
  • [49] Power Equipment Fault Diagnosis Model Based on Deep Transfer Learning with Balanced Distribution Adaptation
    Wang, Kaijie
    Wu, Bin
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 178 - 188
  • [50] MHCPDP: multi-source heterogeneous cross-project defect prediction via multi-source transfer learning and autoencoder
    Jie Wu
    Yingbo Wu 
    Nan Niu
    Min Zhou
    Software Quality Journal, 2021, 29 : 405 - 430