Optimizing the Hybrid Feature Selection in the DNA Microarray for Cancer Diagnosis Using Fuzzy Entropy and the Giza Pyramid Construction Algorithm

被引:0
|
作者
Motevalli, Masoumeh [1 ]
Khalilian, Madjid [1 ]
Bastanfard, Azam [1 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Karaj Branch, Karaj, Iran
关键词
Cancer diagnosis; microarray data; gene representation; feature selection; metaheuristics; fuzzy entropy; GENE-EXPRESSION DATA; CLASSIFICATION; SEARCH; OPTIMIZATION;
D O I
10.1142/S1469026824500317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biotechnological analysis of DNA microarray genes provides valuable insights into the discovery and treatment of diseases such as cancer. It may also be crucial for the prevention and treatment of other genetic diseases. However, due to the large number of features and dimensions in a DNA microarray, the "curse of dimensions" problem is very common. Many machine learning methods require an effective subset of input genes to achieve high accuracy. Unfortunately, extracting features (genes) is an inherently NP-hard problem. Recently, the use of metaheuristics to overcome the NP-hardness of the feature extraction problem has attracted the attention of many researchers. In this paper, we use the combination of fuzzy entropy and Giza Pyramid Construction (GPC) for feature selection. First, redundant features in the microarray dataset are removed using the fuzzy entropy approach. GPC is then used to reduce the execution time. This results in the selection of a near-optimal subset of genes for cancer detection. Dimensionality reduction with GPC followed by classification with Convolutional Neural Network (CNN) creates a synergy to increase efficiency. The proposed method is tested on five well-known cancer patient datasets: leukemia, lymphoma, MLL, ovarian, and SRBCT. The performance of CNN was also measured with four well-known classifiers, including K-nearest neighbor, na & iuml;ve Bayesian, decision tree, and logistic regression. Our results show that, on average, CNN has the highest accuracy, recall, precision, and F-measure in all datasets.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Optimizing feature selection and parameter tuning for breast cancer detection using hybrid GAHBA-DNN framework
    Devi K.K.
    Sekar J.R.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 8037 - 8048
  • [42] mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling
    Alshamlan, Hala
    Badr, Ghada
    Alohali, Yousef
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [43] An Intelligent System for Lung Cancer Diagnosis Using a New Genetic Algorithm Based Feature Selection Method
    Chunhong Lu
    Zhaomin Zhu
    Xiaofeng Gu
    Journal of Medical Systems, 2014, 38
  • [44] An Intelligent System for Lung Cancer Diagnosis Using a New Genetic Algorithm Based Feature Selection Method
    Lu, Chunhong
    Zhu, Zhaomin
    Gu, Xiaofeng
    JOURNAL OF MEDICAL SYSTEMS, 2014, 38 (09)
  • [45] Gene Selection Using Hybrid Multi-Objective Cuckoo Search Algorithm With Evolutionary Operators for Cancer Microarray Data
    Othman, Mohd Shahizan
    Kumaran, Shamini Raja
    Yusuf, Lizawati Mi
    IEEE ACCESS, 2020, 8 : 186348 - 186361
  • [46] Feature selection strategy based on hybrid crow search optimization algorithm integrated with chaos theory and fuzzy c-means algorithm for medical diagnosis problems
    Ahmed M. Anter
    Mumtaz Ali
    Soft Computing, 2020, 24 : 1565 - 1584
  • [47] Feature selection strategy based on hybrid crow search optimization algorithm integrated with chaos theory and fuzzy c-means algorithm for medical diagnosis problems
    Anter, Ahmed M.
    Ali, Mumtaz
    SOFT COMPUTING, 2020, 24 (03) : 1565 - 1584
  • [48] Hybrid Feature Selection Using the Firefly Algorithm for Automatic Detection of Benign/Malignant Breast Cancer in Ultrasound Images
    Jesuharan, Dafni Rose
    Delsy, Thason Thaj Mary
    Kandasamy, Vijayakumar
    Kanagasabapathy, Pradeep Mohan Kumar
    TRAITEMENT DU SIGNAL, 2023, 40 (06) : 2671 - 2681
  • [49] Pap smear diagnosis using a hybrid intelligent scheme focusing on genetic algorithm based feature selection and nearest neighbor classification
    Marinakis, Yannis
    Dounias, Georgios
    Jantzen, Jan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2009, 39 (01) : 69 - 78
  • [50] A hybrid feature selection model based on improved squirrel search algorithm and rank aggregation using fuzzy techniques for biomedical data classification
    Nagarajan, Gayathri
    Babu, L. D. Dhinesh
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2021, 10 (01):