Feature Selection for Microarray Gene Expression Data Using Simulated Annealing Guided by the Multivariate Joint Entropy

被引:14
|
作者
Fernando Gonzalez-Navarro, Felix [1 ]
Belanche-Munoz, Lluis A. [2 ]
机构
[1] Univ Autonoma Baja California, Inst Ingn, Mexicali, Baja California, Mexico
[2] Univ Politecn Cataluna, Dept Llenguatges & Sistemes Informat, Barcelona, Spain
来源
COMPUTACION Y SISTEMAS | 2014年 / 18卷 / 02期
关键词
Feature selection; microarray gene expression data; multivariate joint entropy; simulated annealing;
D O I
10.13053/CyS-18-2-2014-032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microarray classification poses many challenges for data analysis, given that a gene expression data set may consist of dozens of observations with thousands or even tens of thousands of genes. In this context, feature subset selection techniques can be very useful to reduce the representation space to one that is manageable by classification techniques. In this work we use the discretized multivariate joint entropy as the basis for a fast evaluation of gene relevance in a Microarray Gene Expression context. The proposed algorithm combines a simulated annealing schedule specially designed for feature subset selection with the incrementally computed joint entropy, reusing previous values to compute current feature subset relevance. This combination turns out to be a powerful tool when applied to the maximization of gene subset relevance. Our method delivers highly interpretable solutions that are more accurate than competing methods. The algorithm is fast, effective and has no critical parameters. The experimental results in several public-domain microarray data sets show a notoriously high classification performance and low size subsets, formed mostly by biologically meaningful genes. The technique is general and could be used in other similar scenarios.
引用
收藏
页码:275 / 293
页数:19
相关论文
共 50 条
  • [31] Feature selection methods on gene expression microarray data for cancer classification: A systematic review
    Alhenawi, Esra'a
    Al-Sayyed, Rizik
    Hudaib, Amjad
    Mirjalili, Seyedali
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 140
  • [32] A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data
    Wang, Hong
    Jing, Xingjian
    Niu, Ben
    KNOWLEDGE-BASED SYSTEMS, 2017, 126 : 8 - 19
  • [33] Unsupervised Feature Selection for Microarray Gene Expression Data Based on Discriminative Structure Learning
    Ye, Xiucai
    Sakurai, Tetsuya
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2018, 24 (06) : 725 - 741
  • [34] Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification
    Sun, Lin
    Zhang, Xiaoyu
    Qian, Yuhua
    Xu, Jiucheng
    Zhang, Shiguang
    INFORMATION SCIENCES, 2019, 502 : 18 - 41
  • [35] Improving classification accuracy of cancer types using parallel hybrid feature selection on microarray gene expression data
    Lokeswari Venkataramana
    Shomona Gracia Jacob
    Rajavel Ramadoss
    Dodda Saisuma
    Dommaraju Haritha
    Kunthipuram Manoja
    Genes & Genomics, 2019, 41 : 1301 - 1313
  • [36] Improving classification accuracy of cancer types using parallel hybrid feature selection on microarray gene expression data
    Venkataramana, Lokeswari
    Jacob, Shomona Gracia
    Ramadoss, Rajavel
    Saisuma, Dodda
    Haritha, Dommaraju
    Manoja, Kunthipuram
    GENES & GENOMICS, 2019, 41 (11) : 1301 - 1313
  • [37] Gene selection for tumor classification using microarray gone expression data
    Yendrapalli, K.
    Basnet, R.
    Mukkamala, S.
    Sung, A. H.
    WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 290 - +
  • [38] Feature gene selection based on fuzzy neighborhood joint entropy
    Yan Wang
    Minjie Sun
    Linbo Long
    Jinhui Liu
    Yifan Ren
    Complex & Intelligent Systems, 2024, 10 : 129 - 144
  • [39] Feature gene selection based on fuzzy neighborhood joint entropy
    Wang, Yan
    Sun, Minjie
    Long, Linbo
    Liu, Jinhui
    Ren, Yifan
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 129 - 144
  • [40] A Hybrid Feature Selection Method Using Gene Expression Data
    Chuang, Li-Yeh
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, : 100 - +