Hybrid binary COOT algorithm with simulated annealing for feature selection in high-dimensional microarray data

被引:23
|
作者
Pashaei, Elnaz [1 ]
Pashaei, Elham [2 ]
机构
[1] Istanbul Aydin Univ, Dept Software Engn, Istanbul, Turkey
[2] Istanbul Gelisim Univ, Dept Comp Engn, Istanbul, Turkey
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 01期
关键词
Cancer classification; Feature selection; Gene selection; COOT optimization algorithm; GENE SELECTION; OPTIMIZATION ALGORITHM; SALP SWARM; CLASSIFICATION;
D O I
10.1007/s00521-022-07780-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microarray analysis of gene expression can help with disease and cancer diagnosis and prognosis. Identification of gene biomarkers is one of the most difficult issues in microarray cancer classification due to the diverse complexity of different cancers and the high dimensionality of data. In this paper, a new gene selection strategy based on the binary COOT (BCOOT) optimization algorithm is proposed. The COOT algorithm is a newly proposed optimizer whose ability to solve gene selection problems has yet to be explored. Three binary variants of the COOT algorithm are suggested to search for the targeting genes to classify cancer and diseases. The proposed algorithms are BCOOT, BCOOT-C, and BCOOT-CSA. In the first method, a hyperbolic tangent transfer function is used to convert the continuous version of the COOT algorithm to binary. In the second approach, a crossover operator (C) is used to improve the global search of the BCOOT algorithm. In the third method, BCOOT-C is hybridized with simulated annealing (SA) to boost the algorithm's local exploitation capabilities in order to find robust and stable informative genes. Furthermore, minimum redundancy maximum relevance (mRMR) is used as a prefiltering technique to eliminate redundant genes. The proposed algorithms are tested on ten well-known microarray datasets and then compared to other powerful optimization algorithms, and recent state-of-the-art gene selection techniques. The experimental results demonstrate that the BCOOT-CSA approach surpasses BCOOT and BCOOT-C and outperforms other techniques in terms of prediction accuracy and the number of selected genes in most cases.
引用
收藏
页码:353 / 374
页数:22
相关论文
共 50 条
  • [31] An efficient multivariate feature ranking method for gene selection in high-dimensional microarray data
    Lee, Junghye
    Choi, In Young
    Jun, Chi-Hyuck
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 166
  • [32] A fast dual-module hybrid high-dimensional feature selection algorithm
    Yang, Geying
    He, Junjiang
    Lan, Xiaolong
    Li, Tao
    Fang, Wenbo
    INFORMATION SCIENCES, 2024, 681
  • [33] Feature Selection for high Dimensional DNA Microarray data using hybrid approaches
    Kumar, Ammu Prasanna
    Valsala, Preeja
    BIOINFORMATION, 2013, 9 (16) : 824 - 828
  • [34] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [35] Feature selection for high-dimensional data in astronomy
    Zheng, Hongwen
    Zhang, Yanxia
    ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964
  • [36] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [37] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [38] Enhancing classification with hybrid feature selection: A multi-objective genetic algorithm for high-dimensional data
    Bohrer, Jonas da S.
    Dorn, Marcio
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [39] A Hybrid Feature Selection Algorithm Applied to High-dimensional Imbalanced Small-sample Data Classification
    Feng, Fang
    Lv, Qingquan
    Wang, Mingsong
    Yang, Xuhui
    Zhou, Qingguo
    Zhou, Rui
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 41 - 46
  • [40] Feature selection for high-dimensional temporal data
    Michail Tsagris
    Vincenzo Lagani
    Ioannis Tsamardinos
    BMC Bioinformatics, 19