Improving and comparing performance of machine learning classifiers optimized by swarm intelligent algorithms for code smell detection

被引：1

作者：

Jain, Shivani ^{[1
]}

Saha, Anju ^{[1
]}

机构：

[1] GGS Indraprastha Univ, USIC&T, Sect 16 C, Delhi 110078, India

来源：

SCIENCE OF COMPUTER PROGRAMMING | 2024年 / 237卷

关键词：

Code Smell Detection; Machine Learning; Meta-heuristic Algorithms; Optimization; Support Vector Machine; PRICE;

D O I：

10.1016/j.scico.2024.103140

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In complex systems, the maintenance phase engenders the emergence of code smells due to incessant shifts in requirements and designs, stringent timelines, and the developer's relative inexperience. While not conventionally classified as errors, code smells inherently signify flawed design structures that lead to future bugs and errors. It increases the software budget and eventually makes the system hard to maintain or completely obsolete. To mitigate these challenges, practitioners must detect and refactor code smells. However, the theoretical interpretation of smell definitions and intelligent establishment of threshold values pose a significant conundrum. Supervised machine learning emerges as a potent strategy to address these problems and alleviate the dependence on expert intervention. The learning mechanism of these algorithms can be refined through data pre-processing and hyperparameter tuning. Selecting the best values for hyperparameters can be tedious and requires an expert. This study introduces an innovative paradigm that fuses twelve swarm-based, meta-heuristic algorithms with two machine learning classifiers, optimizing their hyperparameters, eliminating the need for an expert, and automating the entire code smell detection process. Through this synergistic approach, the highest post-optimization accuracy, precision, recall, F-measure, and ROC-AUC values are 99.09%, 99.20%, 99.09%, 98.06%, and 100%, respectively. The most remarkable upsurge is 35.9% in accuracy, 53.79% in precision, 35.90% in recall, 44.73% in F-measure, and 36.28% in ROC-AUC. Artificial Bee Colony, Grey Wolf, and Salp Swarm Optimizer are the top-performing swarm-intelligent algorithms. God and Data Class are the most readily detectable smells with optimized classifiers. Statistical tests underscore the profound impact of employing swarm-based algorithms to optimize machine learning classifiers, corroborated by statistical tests. This seamless integration enhances classifier performance, automates code smell detection, and offers a robust solution to a persistent software engineering challenge.

引用

页数：31

共 50 条

[41] Machine learning techniques for code smell detection: A systematic literature review and meta-analysis
Azeem, Muhammad Ilyas
Palomba, Fabio
Shi, Lin
Wang, Qing
INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 108 : 115 - 138
[42] Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms
Saxena, Surabhi
Mohapatra, Debashish
Padhee, Subhransu
Sahoo, Goutam Kumar
EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 587 - 603
[43] Comparative Anlaysis of Machine Learning Algorithms along with Classifiers for AF Detection using a Scale
Kim, Hyun-Woo
Lee, Keonsoo
Moon, Chanki
Nam, Yunyoung
2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 427 - 429
[44] Comparing the performance of machine learning and deep learning algorithms classifying messages in Facebook learning group
Huang-Fu, Cheng-Yo
Liao, Chen-Hsuan
Wu, Jiun-Yu
IEEE 21ST INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2021), 2021, : 347 - 349
[45] Improving the performance of machine learning classifiers for Breast Cancer diagnosis based on feature selection
Perez, Noel
Guevara, Miguel A.
Silva, Augusto
Ramos, Isabel
Loureiro, Joana
FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 209 - 217
[46] Comparing the Effects of Annotation Type on Machine Learning Detection Performance
Mullen, James F., Jr.
Tanner, Franklin R.
Sallee, Phil A.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 855 - 861
[47] Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory Study
Indika, Amila
Washington, Peter Y.
Peruma, Anthony
2023 IEEE/ACM 2ND INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING, NLBSE, 2023, : 20 - 23
[48] Performance analysis of machine learning classifiers for non-technical loss detection
Ghori, Khawaja MoyeezUllah
Imran, Muhammad
Nawaz, Asad
Abbasi, Rabeeh Ayaz
Ullah, Ata
Szathmary, Laszlo
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 14 (11) : 15327 - 15342
[49] A large empirical assessment of the role of data balancing in machine-learning-based code smell detection
Pecorelli, Fabiano
Di Nucci, Dario
De Roover, Coen
De Lucia, Andrea
JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 169
[50] Enhancing Smartphone Malware Detection Performance by Applying Machine Learning Hybrid Classifiers
Amamra, Abdelfattah
Talhi, Chamseddine
Robert, Jean-Marc
Hamiche, Martin
COMPUTER APPLICATIONS FOR SOFTWARE ENGINEERING, DISASTER RECOVERY, AND BUSINESS CONTINUITY, 2012, 340 : 131 - 137

← 1 2 3 4 5 →