Optimizing the Hybrid Feature Selection in the DNA Microarray for Cancer Diagnosis Using Fuzzy Entropy and the Giza Pyramid Construction Algorithm

被引:0
|
作者
Motevalli, Masoumeh [1 ]
Khalilian, Madjid [1 ]
Bastanfard, Azam [1 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Karaj Branch, Karaj, Iran
关键词
Cancer diagnosis; microarray data; gene representation; feature selection; metaheuristics; fuzzy entropy; GENE-EXPRESSION DATA; CLASSIFICATION; SEARCH; OPTIMIZATION;
D O I
10.1142/S1469026824500317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biotechnological analysis of DNA microarray genes provides valuable insights into the discovery and treatment of diseases such as cancer. It may also be crucial for the prevention and treatment of other genetic diseases. However, due to the large number of features and dimensions in a DNA microarray, the "curse of dimensions" problem is very common. Many machine learning methods require an effective subset of input genes to achieve high accuracy. Unfortunately, extracting features (genes) is an inherently NP-hard problem. Recently, the use of metaheuristics to overcome the NP-hardness of the feature extraction problem has attracted the attention of many researchers. In this paper, we use the combination of fuzzy entropy and Giza Pyramid Construction (GPC) for feature selection. First, redundant features in the microarray dataset are removed using the fuzzy entropy approach. GPC is then used to reduce the execution time. This results in the selection of a near-optimal subset of genes for cancer detection. Dimensionality reduction with GPC followed by classification with Convolutional Neural Network (CNN) creates a synergy to increase efficiency. The proposed method is tested on five well-known cancer patient datasets: leukemia, lymphoma, MLL, ovarian, and SRBCT. The performance of CNN was also measured with four well-known classifiers, including K-nearest neighbor, na & iuml;ve Bayesian, decision tree, and logistic regression. Our results show that, on average, CNN has the highest accuracy, recall, precision, and F-measure in all datasets.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] An efficient feature selection and classification system for microarray cancer data using genetic algorithm and deep belief networks
    Lawrence M.O.
    Jimoh R.G.
    Yahya W.B.
    Multimedia Tools and Applications, 2025, 84 (8) : 4393 - 4434
  • [32] CANCER MICROARRAY DATA FEATURE SELECTION USING MULTI-OBJECTIVE BINARY PARTICLE SWARM OPTIMIZATION ALGORITHM
    Annavarapu, Chandra Sekhara Rao
    Dara, Suresh
    Banka, Haider
    EXCLI JOURNAL, 2016, 15 : 460 - 473
  • [33] Feature Selection in Cancer Microarray Data using Multi-Objective Genetic Algorithm combined with Correlation Coefficient
    Hasnat, Abul
    Molla, Azhar Uddin
    IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGICAL TRENDS IN COMPUTING, COMMUNICATIONS AND ELECTRICAL ENGINEERING (ICETT), 2016,
  • [34] A Filter Based Feature Selection Algorithm Using Null Space of Covariance Matrix for DNA Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    CURRENT BIOINFORMATICS, 2012, 7 (03) : 289 - 294
  • [35] Optimizing cancer diagnosis: A hybrid approach of genetic operators and Sinh Cosh Optimizer for tumor identification and feature gene selection
    Emam, Marwa M.
    Houssein, Essam H.
    Samee, Nagwan Abdel
    Alkhalifa, Amal K.
    Hosney, Mosa E.
    Computers in Biology and Medicine, 2024, 180
  • [36] Hybrid Filter and Genetic Algorithm-Based Feature Selection for Improving Cancer Classification in High-Dimensional Microarray Data
    Ali, Waleed
    Saeed, Faisal
    PROCESSES, 2023, 11 (02)
  • [37] High-performance breast cancer diagnosis method using hybrid feature selection method
    Moradi, Mohammad
    Rezai, Abdalhossein
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2025, 70 (02): : 171 - 181
  • [38] Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets
    Aalaei, Shokoufeh
    Shahraki, Hadi
    Rowhanimanesh, Alireza
    Eslami, Saeid
    IRANIAN JOURNAL OF BASIC MEDICAL SCIENCES, 2016, 19 (05) : 476 - 482
  • [39] Identification and diagnosis of cervical cancer using a hybrid feature selection approach with the bayesian optimization-based optimized catboost classification algorithm
    Dhar J.
    Roy S.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (09) : 3459 - 3477
  • [40] An adaptive feature reduction algorithm for cancer classification using wavelet decomposition of serum proteomic and DNA microarray data
    Rashid, Sabrina
    Maruf, Golam Morshed
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 305 - 312