A comprehensive learning based swarm optimization approach for feature selection in gene expression data

被引:2
|
作者
Easwaran, Subha [1 ]
Venugopal, Jothi Prakash [2 ]
Subramanian, Arul Antran Vijay [3 ]
Sundaram, Gopikrishnan [4 ]
Naseeba, Beebi [4 ]
机构
[1] Karpagam Coll Engn, Dept Sci & Humanities, Coimbatore 641032, Tamil Nadu, India
[2] Karpagam Coll Engn, Dept Informat Technol, Coimbatore 641032, Tamil Nadu, India
[3] Karpagam Coll Engn, Dept Comp Sci & Engn, Coimbatore 641032, Tamil Nadu, India
[4] VIT AP Univ, Sch Comp Sci & Engn, Amaravathi 522241, Andhra Pradesh, India
关键词
Comprehensive learning; Feature selection; Gene expression; Gene selection; Swarm intelligence; Cancer classification; MICROARRAY; CLASSIFICATION;
D O I
10.1016/j.heliyon.2024.e37165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression data analysis is challenging due to the high dimensionality and complexity of the data. Feature selection, which identifies relevant genes, is a common preprocessing step. We propose a Comprehensive Learning-Based Swarm Optimization (CLBSO) approach for feature selection in gene expression data. CLBSO leverages the strengths of ants and grasshoppers to efficiently explore the high-dimensional search space. Ants perform local search and leave pheromone trails to guide the swarm, while grasshoppers use their ability to jump long distances to explore new regions and avoid local optima. The proposed approach was evaluated on several publicly available gene expression datasets and compared with state-of-the-art feature selection methods. CLBSO achieved an average accuracy improvement of 15% over the original high-dimensional data and outperformed other feature selection methods by up to 10%. For instance, in the Pancreatic cancer dataset, CLBSO achieved 97.2% accuracy, significantly higher than XGBoost-MOGA's 84.0%. Convergence analysis showed CLBSO required fewer iterations to reach optimal solutions. Statistical analysis confirmed significant performance improvements, and stability analysis demonstrated consistent gene subset selection across different runs. These findings highlight the robustness and efficacy of CLBSO in handling complex gene expression datasets, making it a valuable tool for enhancing classification tasks in bioinformatics.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A Hybrid Filter/Wrapper Approach of Feature Selection for Gene Expression Data
    Ke, Chao-Hsuan
    Yang, Cheng-Hong
    Chuang, Li-Yeh
    Yang, Cheng-San
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2663 - +
  • [22] An Interpretable Feature Selection Based on Particle Swarm Optimization
    Liu, Yi
    Qin, Wei
    Zheng, Qibin
    Li, Gensong
    Li, Mengmeng
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (08) : 1495 - 1500
  • [23] Particle swarm optimization with a modified sigmoid function for gene selection from gene expression data
    Mohamad M.S.
    Omatu S.
    Deris S.
    Yoshioka M.
    Artificial Life and Robotics, 2010, 15 (01) : 21 - 24
  • [24] Chicken Swarm-Based Feature Subset Selection with Optimal Machine Learning Enabled Data Mining Approach
    Hamdi, Monia
    Hilali-Jaghdam, Ines
    Khayyat, Manal M.
    Elnaim, Bushra M. E.
    Abdel-Khalek, Sayed
    Mansour, Romany F.
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [25] Intelligent Facial Expression Recognition Using Particle Swarm Optimization Based Feature Selection
    Robson, Adam
    Zhang, Li
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 305 - 311
  • [26] Feature selection based on an improved cat swarm optimization algorithm for big data classification
    Kuan-Cheng Lin
    Kai-Yuan Zhang
    Yi-Hung Huang
    Jason C. Hung
    Neil Yen
    The Journal of Supercomputing, 2016, 72 : 3210 - 3221
  • [27] Feature selection based on an improved cat swarm optimization algorithm for big data classification
    Lin, Kuan-Cheng
    Zhang, Kai-Yuan
    Huang, Yi-Hung
    Hung, Jason C.
    Yen, Neil
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (08): : 3210 - 3221
  • [28] Variance Based Particle Swarm Optimization for Function Optimization and Feature Selection
    Prasad, Yamuna
    Biswas, K. K.
    Hanmandlu, M.
    Jain, Chakresh Kumar
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING (SEMCCO 2015), 2016, 9873 : 104 - 115
  • [29] Hybrid Feature Selection of Breast Cancer Gene Expression Microarray Data Based on Metaheuristic Methods: A Comprehensive Review
    Ali, Nursabillilah Mohd
    Besar, Rosli
    Ab Aziz, Nor Azlina
    SYMMETRY-BASEL, 2022, 14 (10):
  • [30] Improved salp swarm algorithm based on particle swarm optimization for feature selection
    Ibrahim, Rehab Ali
    Ewees, Ahmed A.
    Oliva, Diego
    Abd Elaziz, Mohamed
    Lu, Songfeng
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (08) : 3155 - 3169