Using a Genetic Algorithm and a Perceptron for Feature Selection and Supervised Class Learning in DNA Microarray Data

被引:1
|
作者
Michal Karzynski
Álvaro Mateos
Javier Herrero
Joaquín Dopazo
机构
[1] Centro Nacional de Investigaciones Oncológicas (CNIO),Bioinformatics Unit
来源
关键词
clustering; dimensionality reduction; feature selection; gene expression; genetic algorithm; perceptron; SOTA; weights;
D O I
暂无
中图分类号
学科分类号
摘要
Class prediction and feature selection is keyin the context of diagnostic applications ofDNA microarrays. Microarray data is noisy andtypically composed of a low number of samplesand a large number of genes. Perceptrons canconstitute an efficient tool for accurateclassification of microarray data.Nevertheless, the large input layers necessaryfor the direct application of perceptrons andthe low samples available for the trainingprocess hamper its use. Two strategies can betaken for an optimal use of a perceptron with afavourable balance between samples for trainingand the size of the input layer: (a) reducingthe dimensionality of the data set fromthousands to no more than one hundred, highlyinformative average values, and using theweights of the perceptron for feature selectionor (b) using a selection of only few genesthat produce an optimal classification with theperceptron. In this case, feature selection iscarried out first. Obviously, a combinedapproach is also possible. In this manuscriptwe explore and compare both alternatives. Westudy the informative contents of the data atdifferent levels of compression with a veryefficient clustering algorithm (Self OrganizingTree Algorithm). We show how a simple geneticalgorithm selects a subset of gene expressionvalues with 100% accuracy in theclassification of samples with maximumefficiency. Finally, the importance ofdimensionality reduction is discussed in lightof its capacity for reducing noise andredundancies in microarray data.
引用
收藏
页码:39 / 51
页数:12
相关论文
共 50 条
  • [21] Feature Selection for high Dimensional DNA Microarray data using hybrid approaches
    Kumar, Ammu Prasanna
    Valsala, Preeja
    BIOINFORMATION, 2013, 9 (16) : 824 - 828
  • [22] Multitasking Feature Selection Using a Clonal Selection Algorithm for High-Dimensional Microarray Data
    Wang, Yi
    Luo, Dan
    Yao, Jian
    ELECTRONICS, 2024, 13 (23):
  • [23] Exploring the consequences of distributed feature selection in DNA microarray data
    Bolon-Canedo, Veronica
    Sechidis, Konstantinos
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Brown, Gavin
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1665 - 1672
  • [24] A Filter Based Feature Selection Algorithm Using Null Space of Covariance Matrix for DNA Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    CURRENT BIOINFORMATICS, 2012, 7 (03) : 289 - 294
  • [25] A Novel Feature Selection Algorithm using Particle Swarm Optimization for Cancer Microarray Data
    Sahu, Barnali
    Mishra, Debahuti
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 27 - 31
  • [26] Evolutionary algorithm for feature subset selection in predicting tumor outcomes using microarray data
    Tan, Qihua
    Thomassen, Mads
    Jochumsen, Kirsten M.
    Zhao, Jing Hua
    Christensen, Kaare
    Kruse, Torbert A.
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2008, 4983 : 426 - +
  • [27] Feature Selection for Self-Supervised Classification With Applications to Microarray and Sequence Data
    Kung, Sun-Yuan
    Mak, Man-Wai
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2008, 2 (03) : 297 - 309
  • [28] Feature Selection Software Development Using Artificial Bee Colony on DNA Microarray Data
    Andaru, Wildan
    Syarif, Iwan
    Barakbah, Ali Ridho
    2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), 2017, : 6 - 11
  • [29] Unsupervised reduction of the dimensionality followed by supervised learning with a perceptron improves the classification of conditions in DNA microarray gene expression data.
    Conde, L
    Mateos, A
    Herrero, J
    Dopazo, J
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 77 - 86
  • [30] Hybrid feature selection based on SLI and genetic algorithm for microarray datasets
    Sedighe Abasabadi
    Hossein Nematzadeh
    Homayun Motameni
    Ebrahim Akbari
    The Journal of Supercomputing, 2022, 78 : 19725 - 19753