An epicurean learning approach to gene-expression data classification

被引:20
|
作者
Albrecht, A [1 ]
Vinterbo, SA
Ohno-Machado, L
机构
[1] Univ Hertfordshire, Dept Comp Sci, Hatfield AL10 9AB, Herts, England
[2] Harvard Univ, Sch Med, Decis Syst Grp, Boston, MA USA
[3] MIT, Div Hlth Sci & Technol, Cambridge, MA 02139 USA
关键词
perceptrons; simulated annealing; gene-expression analysis;
D O I
10.1016/S0933-3657(03)00036-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the use of perceptrons for classification of microarray data where we use two datasets that were published in [Nat. Med. 7 (6) (2001) 673] and [Science 286 (1999) 531]. The classification problem studied by Khan et al. is related to the diagnosis of small round blue cell tumours (SRBCT) of childhood which are difficult to classify both clinically and via routine histology. Golub et al. study acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL). We used a simulated annealing-based method in learning a system of perceptions, each obtained by resampling of the training set. Our results are comparable to those of Khan et al. and Golub et al., indicating that there is a role for perceptrons in the classification of tumours based on gene-expression data. We also show that it is critical to perform feature selection in this type of models, i.e. we propose a method for identifying genes that might be significant for the particular tumour types. For SRBCTs, zero error on test data has been obtained for only 13 out of 2308 genes; for the ALL/AML problem, we have zero error for 9 out of 7129 genes that are used for the classification procedure. Furthermore, we provide evidence that Epicurean-style learning and simulated annealing-based search are both essential for obtaining the best classification results. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:75 / 87
页数:13
相关论文
共 50 条
  • [41] Deep Learning Based Tumor Type Classification Using Gene Expression Data
    Lyu, Boyu
    Haque, Anamul
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 89 - 96
  • [42] Partial Mutual Information for Classification of Gene Expression Data by Learning Vector Quantization
    Lange, Mandy
    Nebel, David
    Villmann, Thomas
    ADVANCES IN SELF-ORGANIZING MAPS AND LEARNING VECTOR QUANTIZATION, 2014, 295 : 259 - 269
  • [43] New ensemble machine learning method for classification and prediction on gene expression data
    Wang, Ching Wei
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 60 - 63
  • [44] Cancer Classification Based on Microarray Gene Expression Data Using Deep Learning
    Guillen, Pablo
    Ebalunode, Jerry
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1403 - 1405
  • [45] The Omics Dashboard for interactive exploration of gene-expression data
    Paley, Suzanne
    Parker, Karen
    Spaulding, Aaron
    Tomb, Jean-Francois
    O'Maille, Paul
    Karp, Peter D.
    NUCLEIC ACIDS RESEARCH, 2017, 45 (21) : 12113 - 12124
  • [46] Key aspects of analyzing microarray gene-expression data
    Chen, James J.
    PHARMACOGENOMICS, 2007, 8 (05) : 473 - 482
  • [47] Inferring cancer metabolism from gene-expression data
    Mohanty, Vakul
    NATURE REVIEWS CANCER, 2024, 24 (04) : 230 - 230
  • [48] Minimum standards set out for gene-expression data
    Jonathan Knight
    Nature, 2002, 415 : 946 - 946
  • [49] Consensus clustering and functional interpretation of gene-expression data
    Stephen Swift
    Allan Tucker
    Veronica Vinciotti
    Nigel Martin
    Christine Orengo
    Xiaohui Liu
    Paul Kellam
    Genome Biology, 5
  • [50] Clustered alignments of gene-expression time series data
    Smith, Adam A.
    Vollrath, Aaron
    Bradfield, Christopher A.
    Craven, Mark
    BIOINFORMATICS, 2009, 25 (12) : I119 - I127