Prediction of software fault-prone classes using ensemble random forest with adaptive synthetic sampling algorithm

被引:22
|
作者
Balaram, A. [1 ]
Vasundra, S. [1 ]
机构
[1] JNTUA Univ, Dept CSE, Anantapur, Andhra Pradesh, India
关键词
Adaptive synthetic sampling; Butterfly optimization; Ensemble random forest; Imbalanced data; Software fault prediction; MACHINE;
D O I
10.1007/s10515-021-00311-z
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The process of predicting fault module in software is known as Software Fault Prediction (SFP) which is important for releasing software versions that are dependent on the predefined metrics due to historical faults in software. The fault prediction in software such as components, classes and modules, at an early stage in the development cycle, is important as it significantly contributes to time reduction and cost reduction. Therefore, the modules that are used for processing each step is reduced by the unnecessary efforts eliminated the faults during development process. However, the problem of imbalanced dataset becomes a significant challenge during SFP for software fault prediction at an early stage. The limitations such as inclusion of software metric for SFP models, cost effectiveness of the fault and the fault density prediction, are still few obstacles faced by research. The proposed Butterfly optimization performs feature selection that helps to predict meticulous and remarkable results by developing the applications of Machine Learning techniques. The present research uses Ensemble Random Forest with Adaptive Synthetic Sampling (E-RF-ADASYN) for fault prediction by using various classifiers which is mentioned in the proposed method section. The proposed E-RF-ADASYN obtained Area Under Curve (AUC) of 0.854767 better when compared with the existing method Rough-KNN Noise-Filtered Easy Ensemble (RKEE) of 0.771.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Improving Prediction Accuracy using Random Forest Algorithm
    Elsayed, Nesma
    Abd Elaleem, Sherif
    Marie, Mohamed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 436 - 441
  • [42] Effective Macrosomia Prediction Using Random Forest Algorithm
    Wang, Fangyi
    Wang, Yongchao
    Ji, Xiaokang
    Wang, Zhiping
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (06)
  • [43] Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm
    Tang, Yu
    Dai, Qi
    Yang, Mengyuan
    Du, Tony
    Chen, Lifang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) : 1967 - 1987
  • [44] Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm
    Yu Tang
    Qi Dai
    Mengyuan Yang
    Tony Du
    Lifang Chen
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1967 - 1987
  • [45] Software fault prediction with imbalanced datasets using SMOTE-Tomek sampling technique and Genetic Algorithm models
    Gupta, Mansi
    Rajnish, Kumar
    Bhattacharjee, Vandana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47627 - 47648
  • [46] Software fault prediction with imbalanced datasets using SMOTE-Tomek sampling technique and Genetic Algorithm models
    Mansi Gupta
    Kumar Rajnish
    Vandana Bhattacharjee
    Multimedia Tools and Applications, 2024, 83 : 47627 - 47648
  • [47] Applying Swarm Ensemble Clustering Technique for Fault Prediction Using Software Metrics
    Coelho, Rodrigo A.
    Guimaraes, Fabricio dos R. N.
    Esmin, Ahmed A. A.
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 356 - 361
  • [48] Prediction of Aptamer Protein Interaction Using Random Forest Algorithm
    Manju, N.
    Samiha, C. M.
    Kumar, S. P. Pavan
    Gururaj, H. L.
    Flammini, Francesco
    IEEE ACCESS, 2022, 10 : 49677 - 49687
  • [49] Prediction of PKCθ Inhibitory Activity Using the Random Forest Algorithm
    Hao, Ming
    Li, Yan
    Wang, Yonghua
    Zhang, Shuwei
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2010, 11 (09) : 3413 - 3433
  • [50] Accurate prediction of sugarcane yield using a random forest algorithm
    Yvette Everingham
    Justin Sexton
    Danielle Skocaj
    Geoff Inman-Bamber
    Agronomy for Sustainable Development, 2016, 36