Augmented electric eel foraging optimization algorithm for feature selection with high-dimensional biological and medical diagnosis

被引:2
|
作者
Al-Betar, Mohammed Azmi [1 ,2 ]
Braik, Malik Sh. [3 ]
Mohamed, Elfadil A. [1 ]
Awadallah, Mohammed A. [4 ,5 ]
Nasor, Mohamed [1 ]
机构
[1] Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman, United Arab Emirates
[2] Department of Information Technology, Al-Huson University College, Al-Balqa Applied University, Irbid, Jordan
[3] Department of Computer Science, Al-Balqa Applied University, Al-Salt,19117, Jordan
[4] Department of Computer Science, Al-Aqsa University, P.O. Box 4051, Gaza, Palestine
[5] Artificial Intelligence Research Center (AIRC), Ajman University, Ajman, United Arab Emirates
关键词
Bioinformatics - Diagnosis - Feature Selection - Higher order statistics;
D O I
10.1007/s00521-024-10288-x
中图分类号
学科分类号
摘要
This paper explores the importance of the electric eel foraging optimization (EEFO) algorithm in addressing feature selection (FS) problems, with the aim of ameliorating the practical benefit of FS in real-world applications. The use of EEFO to solve FS problems props our goal of providing clean and useful datasets that provide robust effectiveness for use in classification and clustering tasks. High-dimensional feature selection problems (HFSPs) are more common nowadays yet intricate where they contain a large number of features. Hence, the vast number of features in them should be carefully selected in order to determine the optimal subset of features. As the basic EEFO algorithm experiences premature convergence, there is a need to enhance its global and local search capabilities when applied in the field of FS. In order to tackle such issues, a binary augmented EEFO (BAEEFO) algorithm was developed and proposed for HFSPs. The following strategies were integrated into the mathematical model of the original EEFO algorithm to create BAEEFO: (1) resting behavior with nonlinear coefficient; (2) weight coefficient and confidence effect in the hunting process; (3) spiral search strategy; and (4) Gaussian mutation and random perturbations when the algorithm update is stagnant. Experimental findings confirm the effectiveness of the proposed BAEEFO method on 23 HFSPs gathered from the UCI repository, recording up to a 10% accuracy increment over the basic BEEFO algorithm. In most test cases, BAEEFO outperformed its competitors in classification accuracy rates and outperformed BEEFO in 90% of the datasets used. Thereby, BAEEFO has demonstrated strong competitiveness in terms of fitness scores and classification accuracy. When compared to its competitors, BAEEFO produced superior reduction rates with the fewest number of features selected. The findings in this research underscore the critical need for FS to combat the curse of dimensionality concerns and find highly useful features in data mining applications such as classification. The use of a new meta-heuristic algorithm incorporated with efficient search strategies in solving HFSPs represents a step forward in using this algorithm to solve other practical real-world problems in a variety of domains. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:22171 / 22221
页数:50
相关论文
共 50 条
  • [31] Two-stage improved Grey Wolf optimization algorithm for feature selection on high-dimensional classification
    Chaonan Shen
    Kai Zhang
    Complex & Intelligent Systems, 2022, 8 : 2769 - 2789
  • [32] Copula entropy-based golden jackal optimization algorithm for high-dimensional feature selection problems
    Askr, Heba
    Abdel-Salam, Mahmoud
    Hassanien, Aboul Ella
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [33] Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets
    Oyelade, Olaide N. N.
    Agushaka, Jeffrey O. O.
    Ezugwu, Absalom E. E.
    PLOS ONE, 2023, 18 (03):
  • [34] Hybrid binary arithmetic optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical data
    Elham Pashaei
    Elnaz Pashaei
    The Journal of Supercomputing, 2022, 78 : 15598 - 15637
  • [35] A novel bacterial foraging optimization algorithm for feature selection
    Chen, Yu-Peng
    Li, Ying
    Wang, Gang
    Zheng, Yue-Feng
    Xu, Qian
    Fan, Jia-Hao
    Cui, Xue-Ting
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 83 : 1 - 17
  • [36] Multitasking Feature Selection Using a Clonal Selection Algorithm for High-Dimensional Microarray Data
    Wang, Yi
    Luo, Dan
    Yao, Jian
    ELECTRONICS, 2024, 13 (23):
  • [37] High-Dimensional Feature Selection Based on Improved Binary Ant Colony Optimization Combined with Hybrid Rice Optimization Algorithm
    Ye, A. Zhiwei
    Li, B. Ruihan
    Zhou, C. Wen
    Wang, D. Mingwei
    Mei, E. Mengqing
    Shu, F. Zhe
    Shen, G. Jun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [38] Interaction-based feature selection and classification for high-dimensional biological data
    Wang, Haitian
    Lo, Shaw-Hwa
    Zheng, Tian
    Hu, Inchi
    BIOINFORMATICS, 2012, 28 (21) : 2834 - 2842
  • [39] Research of Medical High-dimensional Imbalanced Data Classification-Ensemble Feature Selection Algorithm with Random Forest
    Zhu, Min
    Su, Bo
    Ning, Gangmin
    2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 273 - 277
  • [40] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25