A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Improved salp swarm algorithm based on particle swarm optimization for feature selection
    Rehab Ali Ibrahim
    Ahmed A. Ewees
    Diego Oliva
    Mohamed Abd Elaziz
    Songfeng Lu
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 3155 - 3169
  • [32] Particle swarm optimization algorithm based on comprehensive scoring framework for high-dimensional feature selection
    Wei, Bo
    Yang, Shanshan
    Zha, Wentao
    Deng, Li
    Huang, Jiangyi
    Su, Xiaohui
    Wang, Feng
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 95
  • [33] Null space based feature selection method for gene expression data
    Alok Sharma
    Seiya Imoto
    Satoru Miyano
    Vandana Sharma
    International Journal of Machine Learning and Cybernetics, 2012, 3 : 269 - 276
  • [34] Null space based feature selection method for gene expression data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    Sharma, Vandana
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2012, 3 (04) : 269 - 276
  • [35] Joint classifier and feature optimization for comprehensive cancer diagnosis using gene expression data
    Krishnapuram, B
    Carin, L
    Hartemink, AJ
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (2-3) : 227 - 242
  • [36] Correlation feature selection based improved-Binary Particle Swarm Optimization for gene selection and cancer classification
    Jain, Indu
    Jain, Vinod Kumar
    Jain, Renu
    APPLIED SOFT COMPUTING, 2018, 62 : 203 - 215
  • [37] A Method of Feature Selection based on Particle Swarm Optimization Algorithm with Trans-gene Operator
    Deng Ruifen
    Liu Binghan
    Xia Tian
    Wang Weizhi
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 3568 - +
  • [38] Feature selection based on rough sets and particle swarm optimization
    Wang, Xiangyang
    Yang, Jie
    Teng, Xiaolong
    Xia, Weijun
    Jensen, Richard
    PATTERN RECOGNITION LETTERS, 2007, 28 (04) : 459 - 471
  • [39] Probe mechanism based particle swarm optimization for feature selection
    Zhang, Hongbo
    Qin, Xiwen
    Gao, Xueliang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (06): : 8393 - 8411
  • [40] Intrusion Feature Selection Algorithm Based on Particle Swarm Optimization
    Tong, Lihong
    Wu, Qingtao
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2014, 14 (12): : 40 - 44