A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Set based particle swarm optimization for the feature selection problem
    Engelbrecht, Andries P.
    Grobler, Jacomine
    Langeveld, Joost
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 324 - 336
  • [42] Particle Swarm Optimization Based Feature Selection for Face Recognition
    Eleyan, Alaa
    2019 SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC 2019), 2019, : 1 - 4
  • [43] Feature Selection Based on Swallow Swarm Optimization for Fuzzy Classification
    Hodashinsky, Ilya
    Sarin, Konstantin
    Shelupanov, Alexander
    Slezkin, Artem
    SYMMETRY-BASEL, 2019, 11 (11):
  • [44] Opposition Based Comprehensive Learning Particle Swarm Optimization
    Wu, Zhangjun
    Ni, Zhiwei
    Zhang, Chang
    Gu, Lichuan
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 1013 - 1019
  • [45] QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection
    Sadeg, Souhila
    Hamdad, Leila
    Remache, Amine Riad
    Karech, Mehdi Nedjmeddine
    Benatchba, Karima
    Habbas, Zineb
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 785 - 796
  • [46] Research on Feature Selection based on Improved Particle Swarm Optimization
    Wang, Guo Qing
    Jia, Jun Bo
    Li, Xu Yuan
    MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3, 2012, 591-593 : 2651 - +
  • [47] A semiparametric approach for marker gene selection based on gene expression data
    Guan, Z
    Zhao, HY
    BIOINFORMATICS, 2005, 21 (04) : 529 - 536
  • [48] Feature Selection of Gene Expression Data Using a Modified Artificial Fish Swarm Algorithm With Population Variation
    Li, Zong-Zheng
    Wang, Fang-Ling
    Qin, Feng
    Yusoff, Yusliza Binti
    Zain, Azlan Mohd
    IEEE ACCESS, 2024, 12 : 72688 - 72706
  • [49] A Comprehensive Survey of Recent Hybrid Feature Selection Methods in Cancer Microarray Gene Expression Data
    Almazrua, Halah
    Alshamlan, Hala
    IEEE Access, 2022, 10 : 71427 - 71449
  • [50] A Comprehensive Survey of Recent Hybrid Feature Selection Methods in Cancer Microarray Gene Expression Data
    Almazrua, Halah
    Alshamlan, Hala
    IEEE ACCESS, 2022, 10 : 71427 - 71449