A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Particle Swarm Optimization based Feature Selection Approach to Transfer Learning in Classification
    Nguyen, Bach Hoai
    Xue, Bing
    Andreae, Peter
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 37 - 44
  • [2] An innovative approach for feature selection based on chicken swarm optimization
    Hafez, Ahmed Ibrahem
    Zawbaa, Hossam M.
    Emary, E.
    Mahmoud, Hamdi A.
    Hassanien, Aboul Ella
    PROCEEDINGS OF THE 2015 SEVENTH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2015), 2015, : 19 - 24
  • [3] Feature Selection for Alzheimer's Gene Expression Data Using Modified Binary Particle Swarm Optimization
    Ramaswamy, Ramya
    Kandhasamy, Premalatha
    Palaniswamy, Swathypriyadharsini
    IETE JOURNAL OF RESEARCH, 2023, 69 (01) : 9 - 20
  • [4] Feature Selection Based on Adaptive Particle Swarm Optimization with Leadership Learning
    Ye, Zhiwei
    Xu, Yi
    He, Qiyi
    Wang, Mingwei
    Bai, Wanfang
    Xiao, Hongwei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [5] Developed Modified Particle Swarm Optimization For Feature Selection On Learning Based Big Data In Cloud Computing
    Thenmozhi, L.
    Chandrakala, N.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (01) : 310 - 320
  • [6] Multiobjective Binary Biogeography Based Optimization for Feature Selection Using Gene Expression Data
    Li, Xiangtao
    Yin, Minghao
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2013, 12 (04) : 343 - 353
  • [7] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [8] Stable Feature Selection for Gene Expression using Enhanced Binary Particle Swarm Optimization
    Dhrif, Hassen
    Wuchty, Stefan
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 437 - 444
  • [9] Microarray Gene Expression Dataset Feature Selection and Classification with Swarm Optimization to Diagnosis Diseases
    Krishna, Peddarapu Rama
    Rajarajeswari, Pothuraju
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 536 - 546
  • [10] Unsupervised Feature Selection for Microarray Gene Expression Data Based on Discriminative Structure Learning
    Ye, Xiucai
    Sakurai, Tetsuya
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2018, 24 (06) : 725 - 741