Using a Genetic Algorithm and a Perceptron for Feature Selection and Supervised Class Learning in DNA Microarray Data

被引:1
|
作者
Michal Karzynski
Álvaro Mateos
Javier Herrero
Joaquín Dopazo
机构
[1] Centro Nacional de Investigaciones Oncológicas (CNIO),Bioinformatics Unit
来源
关键词
clustering; dimensionality reduction; feature selection; gene expression; genetic algorithm; perceptron; SOTA; weights;
D O I
暂无
中图分类号
学科分类号
摘要
Class prediction and feature selection is keyin the context of diagnostic applications ofDNA microarrays. Microarray data is noisy andtypically composed of a low number of samplesand a large number of genes. Perceptrons canconstitute an efficient tool for accurateclassification of microarray data.Nevertheless, the large input layers necessaryfor the direct application of perceptrons andthe low samples available for the trainingprocess hamper its use. Two strategies can betaken for an optimal use of a perceptron with afavourable balance between samples for trainingand the size of the input layer: (a) reducingthe dimensionality of the data set fromthousands to no more than one hundred, highlyinformative average values, and using theweights of the perceptron for feature selectionor (b) using a selection of only few genesthat produce an optimal classification with theperceptron. In this case, feature selection iscarried out first. Obviously, a combinedapproach is also possible. In this manuscriptwe explore and compare both alternatives. Westudy the informative contents of the data atdifferent levels of compression with a veryefficient clustering algorithm (Self OrganizingTree Algorithm). We show how a simple geneticalgorithm selects a subset of gene expressionvalues with 100% accuracy in theclassification of samples with maximumefficiency. Finally, the importance ofdimensionality reduction is discussed in lightof its capacity for reducing noise andredundancies in microarray data.
引用
收藏
页码:39 / 51
页数:12
相关论文
共 50 条
  • [41] Feature selection for semi-supervised multi-target regression using genetic algorithm
    Syed, Farrukh Hasan
    Tahir, Muhammad Atif
    Rafi, Muhammad
    Shahab, Mir Danish
    APPLIED INTELLIGENCE, 2021, 51 (12) : 8961 - 8984
  • [42] Feature selection for semi-supervised multi-target regression using genetic algorithm
    Farrukh Hasan Syed
    Muhammad Atif Tahir
    Muhammad Rafi
    Mir Danish Shahab
    Applied Intelligence, 2021, 51 : 8961 - 8984
  • [43] Identifying cancer biomarkers from leukemia data using feature selection and supervised learning
    Begum, Shemim
    Chakraborty, Debasis
    Sarkar, Ram
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON CONTROL, MEASUREMENT AND INSTRUMENTATION (CMI), 2016, : 249 - 253
  • [44] Feature subset selection using a genetic algorithm
    Yang, JH
    Honavar, V
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (02): : 44 - 49
  • [45] Feature Selection Using Diploid Genetic Algorithm
    Jasuja A.
    Annals of Data Science, 2020, 7 (01) : 33 - 43
  • [46] Face feature selection using genetic algorithm
    Yin Hongtao
    Fu Ping
    Sha Xuejun
    ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 980 - 983
  • [47] RETRACTED: A hybrid feature selection algorithm for microarray data (Retracted Article)
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (12):
  • [48] Feature selection algorithm based on mutual information and lasso for microarray data
    Zhongxin W.
    Gang S.
    Jing Z.
    Jia Z.
    Gang, Sun (ahfysungang@163.com), 1600, Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands (10): : 278 - 286
  • [49] LIA: A Label-Independent Algorithm for Feature Selection for Supervised Learning
    Gilboa-Freedman, Gail
    Patelsky, Alon
    Sheldon, Tal
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, 2019, 11943 : 106 - 117
  • [50] Genetic algorithm for feature selection of EEG heterogeneous data
    Saibene, Aurora
    Gasparini, Francesca
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217