Improving and comparing performance of machine learning classifiers optimized by swarm intelligent algorithms for code smell detection

被引:1
|
作者
Jain, Shivani [1 ]
Saha, Anju [1 ]
机构
[1] GGS Indraprastha Univ, USIC&T, Sect 16 C, Delhi 110078, India
关键词
Code Smell Detection; Machine Learning; Meta-heuristic Algorithms; Optimization; Support Vector Machine; PRICE;
D O I
10.1016/j.scico.2024.103140
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In complex systems, the maintenance phase engenders the emergence of code smells due to incessant shifts in requirements and designs, stringent timelines, and the developer's relative inexperience. While not conventionally classified as errors, code smells inherently signify flawed design structures that lead to future bugs and errors. It increases the software budget and eventually makes the system hard to maintain or completely obsolete. To mitigate these challenges, practitioners must detect and refactor code smells. However, the theoretical interpretation of smell definitions and intelligent establishment of threshold values pose a significant conundrum. Supervised machine learning emerges as a potent strategy to address these problems and alleviate the dependence on expert intervention. The learning mechanism of these algorithms can be refined through data pre-processing and hyperparameter tuning. Selecting the best values for hyperparameters can be tedious and requires an expert. This study introduces an innovative paradigm that fuses twelve swarm-based, meta-heuristic algorithms with two machine learning classifiers, optimizing their hyperparameters, eliminating the need for an expert, and automating the entire code smell detection process. Through this synergistic approach, the highest post-optimization accuracy, precision, recall, F-measure, and ROC-AUC values are 99.09%, 99.20%, 99.09%, 98.06%, and 100%, respectively. The most remarkable upsurge is 35.9% in accuracy, 53.79% in precision, 35.90% in recall, 44.73% in F-measure, and 36.28% in ROC-AUC. Artificial Bee Colony, Grey Wolf, and Salp Swarm Optimizer are the top-performing swarm-intelligent algorithms. God and Data Class are the most readily detectable smells with optimized classifiers. Statistical tests underscore the profound impact of employing swarm-based algorithms to optimize machine learning classifiers, corroborated by statistical tests. This seamless integration enhances classifier performance, automates code smell detection, and offers a robust solution to a persistent software engineering challenge.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] Machine learning techniques for code smell detection: A systematic literature review and meta-analysis
    Azeem, Muhammad Ilyas
    Palomba, Fabio
    Shi, Lin
    Wang, Qing
    INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 108 : 115 - 138
  • [42] Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms
    Saxena, Surabhi
    Mohapatra, Debashish
    Padhee, Subhransu
    Sahoo, Goutam Kumar
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (02) : 587 - 603
  • [43] Comparative Anlaysis of Machine Learning Algorithms along with Classifiers for AF Detection using a Scale
    Kim, Hyun-Woo
    Lee, Keonsoo
    Moon, Chanki
    Nam, Yunyoung
    2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 427 - 429
  • [44] Comparing the performance of machine learning and deep learning algorithms classifying messages in Facebook learning group
    Huang-Fu, Cheng-Yo
    Liao, Chen-Hsuan
    Wu, Jiun-Yu
    IEEE 21ST INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2021), 2021, : 347 - 349
  • [45] Improving the performance of machine learning classifiers for Breast Cancer diagnosis based on feature selection
    Perez, Noel
    Guevara, Miguel A.
    Silva, Augusto
    Ramos, Isabel
    Loureiro, Joana
    FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 209 - 217
  • [46] Comparing the Effects of Annotation Type on Machine Learning Detection Performance
    Mullen, James F., Jr.
    Tanner, Franklin R.
    Sallee, Phil A.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 855 - 861
  • [47] Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory Study
    Indika, Amila
    Washington, Peter Y.
    Peruma, Anthony
    2023 IEEE/ACM 2ND INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING, NLBSE, 2023, : 20 - 23
  • [48] Performance analysis of machine learning classifiers for non-technical loss detection
    Ghori, Khawaja MoyeezUllah
    Imran, Muhammad
    Nawaz, Asad
    Abbasi, Rabeeh Ayaz
    Ullah, Ata
    Szathmary, Laszlo
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 14 (11) : 15327 - 15342
  • [49] A large empirical assessment of the role of data balancing in machine-learning-based code smell detection
    Pecorelli, Fabiano
    Di Nucci, Dario
    De Roover, Coen
    De Lucia, Andrea
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 169
  • [50] Enhancing Smartphone Malware Detection Performance by Applying Machine Learning Hybrid Classifiers
    Amamra, Abdelfattah
    Talhi, Chamseddine
    Robert, Jean-Marc
    Hamiche, Martin
    COMPUTER APPLICATIONS FOR SOFTWARE ENGINEERING, DISASTER RECOVERY, AND BUSINESS CONTINUITY, 2012, 340 : 131 - 137