Software Defect Prediction Based on Cost-Sensitive Dictionary Learning

被引:8
|
作者
Wan, Hongyan [1 ]
Wu, Guoqing [1 ]
Yu, Mali [2 ]
Yuan, Mengting [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[2] Jiujiang Univ, Sch Informat Sci & Technol, Jiujiang 332005, Peoples R China
关键词
Software defect prediction; dictionary learning; cost-sensitive; bilevel optimization; sparse coding; SPARSE REPRESENTATIONS; NEURAL-NETWORKS; QUALITY;
D O I
10.1142/S0218194019500384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction technology has been widely used in improving the quality of software system. Most real software defect datasets tend to have fewer defective modules than defective-free modules. Highly class-imbalanced data typically make accurate predictions difficult. The imbalanced nature of software defect datasets makes the prediction model classifying a defective module as a defective-free one easily. As there exists the similarity during the different software modules, one module can be represented by the sparse representation coefficients over the pre-defined dictionary which consists of historical software defect datasets. In this study, we make use of dictionary learning method to predict software defect. We optimize the classifier parameters and the dictionary atoms iteratively, to ensure that the extracted features (sparse representation) are optimal for the trained classifier. We prove the optimal condition of the elastic net which is used to solve the sparse coding coefficients and the regularity of the elastic net solution. Due to the reason that the misclassification of defective modules generally incurs much higher cost risk than the misclassification of defective-free ones, we take the different misclassification costs into account, increasing the punishment on misclassification defective modules in the procedure of dictionary learning, making the classification inclining to classify a module as a defective one. Thus, we propose a cost-sensitive software defect prediction method using dictionary learning (CSDL). Experimental results on the 10 class-imbalance datasets of NASA show that our method is more effective than several typical state-of-the-art defect prediction methods.
引用
收藏
页码:1219 / 1243
页数:25
相关论文
共 50 条
  • [41] Predicting Software Defects: A Cost-Sensitive Approach
    Bezerra, Miguel E. R.
    Oliveira, Adriano L. I.
    Adeodato, Paulo J. L.
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2515 - 2522
  • [42] Breast cancer recurrence prediction with ensemble methods and cost-sensitive learning
    Yang, Pei-Tse
    Wu, Wen-Shuo
    Wu, Chia-Chun
    Shih, Yi-Nuo
    Hsieh, Chung-Ho
    Hsu, Jia-Lien
    OPEN MEDICINE, 2021, 16 (01): : 754 - 768
  • [43] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [44] Cost-sensitive learning of SVM for ranking
    Xu, Jun
    Cao, Yunbo
    Li, Hang
    Huang, Yalou
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 833 - 840
  • [45] Cost-Sensitive Learning in Answer Extraction
    Wiegand, Michael
    Leidner, Jochen L.
    Klakow, Dietrich
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 711 - 714
  • [46] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [47] Adversarial Learning With Cost-Sensitive Classes
    Shen, Haojing
    Chen, Sihong
    Wang, Ran
    Wang, Xizhao
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 4855 - 4866
  • [48] Cost-Sensitive Decision Tree Learning
    Vadera, Sunil
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 4 - 5
  • [49] Cost-sensitive positive and unlabeled learning
    Chen, Xiuhua
    Gong, Chen
    Yang, Jian
    INFORMATION SCIENCES, 2021, 558 : 229 - 245
  • [50] Robust SVM for Cost-Sensitive Learning
    Jiangzhang Gan
    Jiaye Li
    Yangcai Xie
    Neural Processing Letters, 2022, 54 : 2737 - 2758