Enhancing Software Defect Prediction accuracy using Modified Entropy Calculation in Random Forest Algorithm

被引:0
|
作者
Suryawanshi, Ranjeetsingh [1 ]
Kadam, Amol [1 ]
机构
[1] Bharati Vidyapeeth Deemed Be Univ, Coll Engn, Pune, India
关键词
Random forest; decision tree; classification; prediction; entropy; Taylor series; NETWORKS;
D O I
10.52783/jes.754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Imagine you are trying to classify software defect for a large dataset. How will you choose the best algorithm to do that? For the above problem we have various algorithms like Random Forest, Support Vector Machine, Neural Networks, Naive Bayes, K -Nearest Neighbours, Decision Tree, Logistic Regression etc. One of the most used methods is Random Forest algorithm, which uses multiple Decision Trees to make predictions. However, this algorithm relies on a complex calculation called Entropy, which measures the uncertainty in the data. Entropy function that uses natural logarithm which may be time consuming calculation. Is there a better way to calculate entropy? In this research, have explored a different way to calculate the natural logarithm using the Taylor series expression. It is a series consisting of sum of infinite terms that approximates any function by using its derivatives. We further modified the Random Forest algorithm by replacing the natural logarithm the Taylor series expression in the Entropy formula. We tested our modified algorithm on dataset and compared its performance with the original Entropy formula. We found that our modification in the algorithm has improved the accuracy of the algorithm on software defect prediction.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [1] Software Defect Prediction Using Random Forest Algorithm
    Soe, Yan Naung
    Santosa, Paulus Insap
    Hartanto, Rudy
    2018 12TH SOUTH EAST ASIAN TECHNICAL UNIVERSITY CONSORTIUM (SYMPOSIUM SEATUC 2018): ENGINEERING EDUCATION AND RESEARCH FOR SUSTAINABLE DEVELOPMENT, 2018,
  • [2] Software Defect Prediction using Feature Selection and Random Forest Algorithm
    Ibrahim, Dyana Rashid
    Ghnemat, Rawan
    Hudaib, Amjad
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 252 - 257
  • [3] Improving Prediction Accuracy using Random Forest Algorithm
    Elsayed, Nesma
    Abd Elaleem, Sherif
    Marie, Mohamed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 436 - 441
  • [4] Software maintainability prediction using an enhanced random forest algorithm
    Gupta, Shikha
    Chug, Anuradha
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (02): : 441 - 449
  • [5] Efficient Random Forest Algorithm for Multi-objective Optimization in Software Defect Prediction
    Kanwar, Shailza
    Awasthi, Lalit Kumar
    Shrivastava, Vivek
    IETE JOURNAL OF RESEARCH, 2024, 70 (04) : 3690 - 3702
  • [6] A Package Based Clustering for Enhancing Software Defect Prediction Accuracy
    Islam, Rayhanul
    Sakib, Kazi
    2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 81 - 86
  • [7] A Novel method for Software Defect Prediction: Hybrid of FCM and Random Forest
    Pushphavathi, T. P.
    Suma, V
    Ramaswamy, V
    2014 INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2014,
  • [8] Predicting sulfotyrosine sites using the random forest algorithm with significantly improved prediction accuracy
    Yang, Zheng Rong
    BMC BIOINFORMATICS, 2009, 10 : 361
  • [9] Predicting sulfotyrosine sites using the random forest algorithm with significantly improved prediction accuracy
    Zheng Rong Yang
    BMC Bioinformatics, 10
  • [10] Prediction of Consumer Behaviour using Random Forest Algorithm
    Valecha, Harsh
    Varma, Aparna
    Khare, Ishita
    Sachdeva, Aakash
    Goyal, Mukta
    2018 5TH IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (UPCON), 2018, : 653 - 658