Enhancing Software Defect Prediction accuracy using Modified Entropy Calculation in Random Forest Algorithm

被引:0
|
作者
Suryawanshi, Ranjeetsingh [1 ]
Kadam, Amol [1 ]
机构
[1] Bharati Vidyapeeth Deemed Be Univ, Coll Engn, Pune, India
关键词
Random forest; decision tree; classification; prediction; entropy; Taylor series; NETWORKS;
D O I
10.52783/jes.754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Imagine you are trying to classify software defect for a large dataset. How will you choose the best algorithm to do that? For the above problem we have various algorithms like Random Forest, Support Vector Machine, Neural Networks, Naive Bayes, K -Nearest Neighbours, Decision Tree, Logistic Regression etc. One of the most used methods is Random Forest algorithm, which uses multiple Decision Trees to make predictions. However, this algorithm relies on a complex calculation called Entropy, which measures the uncertainty in the data. Entropy function that uses natural logarithm which may be time consuming calculation. Is there a better way to calculate entropy? In this research, have explored a different way to calculate the natural logarithm using the Taylor series expression. It is a series consisting of sum of infinite terms that approximates any function by using its derivatives. We further modified the Random Forest algorithm by replacing the natural logarithm the Taylor series expression in the Entropy formula. We tested our modified algorithm on dataset and compared its performance with the original Entropy formula. We found that our modification in the algorithm has improved the accuracy of the algorithm on software defect prediction.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [21] Feature selection using firefly algorithm in software defect prediction
    Anbu, M.
    Mala, G. S. Anandha
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 10925 - 10934
  • [22] Improved Accuracy of Calculation of Vehicle Crash Severity in Highways using Random Forest over Logistic Regression Algorithm
    Vignesh, S.
    Sashirekha, K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1520 - 1526
  • [23] Improved Accuracy of Calculation of Vehicle Crash Severity in Highways using Random Forest over Decision Tree Algorithm
    Vignesh, S.
    Sashi, Rekha K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1471 - 1478
  • [24] Improved Accuracy of Calculation of Vehicle Crash Severity in Highways using Random Forest over Naive Bayes Algorithm
    Vignesh, S.
    Sashi, Rekha K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1479 - 1485
  • [25] Prediction of Aptamer Protein Interaction Using Random Forest Algorithm
    Manju, N.
    Samiha, C. M.
    Kumar, S. P. Pavan
    Gururaj, H. L.
    Flammini, Francesco
    IEEE ACCESS, 2022, 10 : 49677 - 49687
  • [26] Accurate prediction of sugarcane yield using a random forest algorithm
    Yvette Everingham
    Justin Sexton
    Danielle Skocaj
    Geoff Inman-Bamber
    Agronomy for Sustainable Development, 2016, 36
  • [27] Prediction of PKCθ Inhibitory Activity Using the Random Forest Algorithm
    Hao, Ming
    Li, Yan
    Wang, Yonghua
    Zhang, Shuwei
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2010, 11 (09) : 3413 - 3433
  • [28] Accurate prediction of sugarcane yield using a random forest algorithm
    Everingham, Yvette
    Sexton, Justin
    Skocaj, Danielle
    Inman-Bamber, Geoff
    AGRONOMY FOR SUSTAINABLE DEVELOPMENT, 2016, 36 (02)
  • [29] Prediction of Permeability Using Random Forest and Genetic Algorithm Model
    Wang, Junhui
    Yan, Wanzi
    Wan, Zhijun
    Wang, Yi
    Lv, Jiakun
    Zhou, Aiping
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2020, 125 (03): : 1135 - 1157
  • [30] A Novel Approach for Prediction of Human Disease using Symptoms by Multilayer Perceptron Algorithm to Improve Accuracy and Compared with Random Forest Algorithm
    Prabhu, S. Avinash
    Parthipan, V.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 700 - 705