Optimizing the Hybrid Feature Selection in the DNA Microarray for Cancer Diagnosis Using Fuzzy Entropy and the Giza Pyramid Construction Algorithm

被引:0
|
作者
Motevalli, Masoumeh [1 ]
Khalilian, Madjid [1 ]
Bastanfard, Azam [1 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Karaj Branch, Karaj, Iran
关键词
Cancer diagnosis; microarray data; gene representation; feature selection; metaheuristics; fuzzy entropy; GENE-EXPRESSION DATA; CLASSIFICATION; SEARCH; OPTIMIZATION;
D O I
10.1142/S1469026824500317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biotechnological analysis of DNA microarray genes provides valuable insights into the discovery and treatment of diseases such as cancer. It may also be crucial for the prevention and treatment of other genetic diseases. However, due to the large number of features and dimensions in a DNA microarray, the "curse of dimensions" problem is very common. Many machine learning methods require an effective subset of input genes to achieve high accuracy. Unfortunately, extracting features (genes) is an inherently NP-hard problem. Recently, the use of metaheuristics to overcome the NP-hardness of the feature extraction problem has attracted the attention of many researchers. In this paper, we use the combination of fuzzy entropy and Giza Pyramid Construction (GPC) for feature selection. First, redundant features in the microarray dataset are removed using the fuzzy entropy approach. GPC is then used to reduce the execution time. This results in the selection of a near-optimal subset of genes for cancer detection. Dimensionality reduction with GPC followed by classification with Convolutional Neural Network (CNN) creates a synergy to increase efficiency. The proposed method is tested on five well-known cancer patient datasets: leukemia, lymphoma, MLL, ovarian, and SRBCT. The performance of CNN was also measured with four well-known classifiers, including K-nearest neighbor, na & iuml;ve Bayesian, decision tree, and logistic regression. Our results show that, on average, CNN has the highest accuracy, recall, precision, and F-measure in all datasets.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] Feature Selection for Microarray Data Classification Using Hybrid Information Gain and a Modified Binary Krill Herd Algorithm
    Zhang, Ge
    Hou, Jincui
    Wang, Jianlin
    Yan, Chaokun
    Luo, Junwei
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2020, 12 (03) : 288 - 301
  • [22] Feature Selection for Microarray Data Classification Using Hybrid Information Gain and a Modified Binary Krill Herd Algorithm
    Ge Zhang
    Jincui Hou
    Jianlin Wang
    Chaokun Yan
    Junwei Luo
    Interdisciplinary Sciences: Computational Life Sciences, 2020, 12 : 288 - 301
  • [23] Feature Selection and Classification of Microarray Data for Cancer Prediction Using MapReduce Implementation of Random Forest Algorithm
    Dhanalakshmi, R.
    Khaire, Utkarsh M.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2019, 78 (03): : 158 - 161
  • [24] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Wang, Zixuan
    Zhou, Yi
    Takagi, Tatsuya
    Song, Jiangning
    Tian, Yu-Shi
    Shibuya, Tetsuo
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [25] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Zixuan Wang
    Yi Zhou
    Tatsuya Takagi
    Jiangning Song
    Yu-Shi Tian
    Tetsuo Shibuya
    BMC Bioinformatics, 24
  • [26] Optimizing the feature set for a Bayesian network for breast cancer diagnosis, by using genetic algorithm techniques
    Wang, XH
    Zheng, B
    Chang, YH
    Good, WF
    MEDICAL IMAGING 1999: IMAGE PROCESSING, PTS 1 AND 2, 1999, 3661 : 1574 - 1580
  • [27] A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis
    Yang, Fangyuan
    Xu, Zhaozhao
    Wang, Hong
    Sun, Lisha
    Zhai, Mengjiao
    Zhang, Juan
    PLOS ONE, 2024, 19 (03):
  • [28] Breast cancer diagnosis based on hybrid rule-based feature selection with deep learning algorithm
    Awotunde J.B.
    Panigrahi R.
    Khandelwal B.
    Garg A.
    Bhoi A.K.
    Research on Biomedical Engineering, 2023, 39 (01) : 115 - 127
  • [29] Improving classification accuracy of cancer types using parallel hybrid feature selection on microarray gene expression data
    Venkataramana, Lokeswari
    Jacob, Shomona Gracia
    Ramadoss, Rajavel
    Saisuma, Dodda
    Haritha, Dommaraju
    Manoja, Kunthipuram
    GENES & GENOMICS, 2019, 41 (11) : 1301 - 1313
  • [30] Improving classification accuracy of cancer types using parallel hybrid feature selection on microarray gene expression data
    Lokeswari Venkataramana
    Shomona Gracia Jacob
    Rajavel Ramadoss
    Dodda Saisuma
    Dommaraju Haritha
    Kunthipuram Manoja
    Genes & Genomics, 2019, 41 : 1301 - 1313