Stable Feature Selection using Improved Whale Optimization Algorithm for Microarray Datasets

被引:0
|
作者
Theng, Dipti [1 ]
Bhoyar, Kishor K. [2 ]
机构
[1] YCCE, Comp Technol Dept, Nagpur, Maharashtra, India
[2] YCCE, Comp Sci & Engn Dept, Nagpur, Maharashtra, India
关键词
feature selection; stability of feature selection; whale optimization algorithm; marine predator algorithm; grey wolf optimization; microarray datasets; high dimensional datasets;
D O I
10.14201/adcaij.31187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A microarray is a collection of DNA sequences that reflect an organism's whole gene set and are organized in a grid pattern for use in genetic testing. Microarray datasets are extremely high-dimensional and have a very small sample size, posing the challenges of insufficient data and high computational complexity. Identification of true biomarkers that are the most significant features (a very small subset of the complete feature set) is desired to solve these issues. This reduces over-fitting, and time complexity, and improves model generalization. Various feature selection algorithms are used for this biomarker identification. This research proposed a modification to the whale optimization algorithm (WOAm) for biomarker discovery, in which the fitness of each search agent is evaluated using the hinge loss function during the hunting for prey phase to determine the optimal search agent. Also compared the results of the proposed modified algorithm with the original whale optimization algorithm and also with contemporary algorithms like the marine predator algorithm and grey wolf optimization. All these algorithms are evaluated on six different high-dimensional microarray datasets. It has been observed that the proposed modification for the whale optimization algorithm has significantly improved the results of feature selection across all the datasets. Domain experts trust the resultant biomarker/ associated genes by the stability of the results obtained. The chosen feature set's stability was also evaluated during the research work. According to the findings, our proposed WOAm has superior stability compared to other algorithms for the CNS, colon, Leukemia, and OSCC. datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] AltWOA: Altruistic Whale Optimization Algorithm for feature selection on microarray datasets
    Kundu, Rohit
    Chattopadhyay, Soham
    Cuevas, Erik
    Sarkar, Ram
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
  • [2] An Improved Whale Optimization Algorithm for Feature Selection
    Guo, Wenyan
    Liu, Ting
    Dai, Fang
    Xu, Peng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 62 (01): : 337 - 354
  • [3] A novel feature selection using binary hybrid improved whale optimization algorithm
    Uzer, Mustafa Serter
    Inan, Onur
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (09): : 10020 - 10045
  • [4] A novel feature selection using binary hybrid improved whale optimization algorithm
    Mustafa Serter Uzer
    Onur Inan
    The Journal of Supercomputing, 2023, 79 : 10020 - 10045
  • [5] SVM parameters and feature selection optimization based on improved whale algorithm
    Guo H.
    Fu J.-D.
    Li Z.-D.
    Yan Y.
    Li X.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (10): : 2952 - 2963
  • [6] Improved whale optimization algorithm for feature selection in Arabic sentiment analysis
    Tubishat, Mohammad
    Abushariah, Mohammad A. M.
    Idris, Norisma
    Aljarah, Ibrahim
    APPLIED INTELLIGENCE, 2019, 49 (05) : 1688 - 1707
  • [7] Stability Investigation of Improved Whale Optimization Algorithm in the Process of Feature Selection
    Khaire, Utkarsh Mahadeo
    Dhanalakshmi, R.
    IETE TECHNICAL REVIEW, 2022, 39 (02) : 286 - 300
  • [8] Improved whale optimization algorithm for feature selection in Arabic sentiment analysis
    Mohammad Tubishat
    Mohammad A. M. Abushariah
    Norisma Idris
    Ibrahim Aljarah
    Applied Intelligence, 2019, 49 : 1688 - 1707
  • [9] Feature selection in high-dimensional microarray cancer datasets using an improved equilibrium optimization approach
    Balakrishnan, Kulanthaivel
    Dhanalakshmi, Ramasamy
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
  • [10] Ant Colony Algorithm for Feature Selection on Microarray Datasets
    Fahrudin, Tresna Maulana
    Syarif, Iwan
    Barakbah, Ali Ridho
    2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 351 - 356