An epicurean learning approach to gene-expression data classification

被引:20
|
作者
Albrecht, A [1 ]
Vinterbo, SA
Ohno-Machado, L
机构
[1] Univ Hertfordshire, Dept Comp Sci, Hatfield AL10 9AB, Herts, England
[2] Harvard Univ, Sch Med, Decis Syst Grp, Boston, MA USA
[3] MIT, Div Hlth Sci & Technol, Cambridge, MA 02139 USA
关键词
perceptrons; simulated annealing; gene-expression analysis;
D O I
10.1016/S0933-3657(03)00036-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the use of perceptrons for classification of microarray data where we use two datasets that were published in [Nat. Med. 7 (6) (2001) 673] and [Science 286 (1999) 531]. The classification problem studied by Khan et al. is related to the diagnosis of small round blue cell tumours (SRBCT) of childhood which are difficult to classify both clinically and via routine histology. Golub et al. study acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL). We used a simulated annealing-based method in learning a system of perceptions, each obtained by resampling of the training set. Our results are comparable to those of Khan et al. and Golub et al., indicating that there is a role for perceptrons in the classification of tumours based on gene-expression data. We also show that it is critical to perform feature selection in this type of models, i.e. we propose a method for identifying genes that might be significant for the particular tumour types. For SRBCTs, zero error on test data has been obtained for only 13 out of 2308 genes; for the ALL/AML problem, we have zero error for 9 out of 7129 genes that are used for the classification procedure. Furthermore, we provide evidence that Epicurean-style learning and simulated annealing-based search are both essential for obtaining the best classification results. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:75 / 87
页数:13
相关论文
共 50 条
  • [21] GENE EXPRESSION DATA CLASSIFICATION AND PATTERN ANALYSIS USING DATA DRIVEN APPROACH
    Ramisa, Aiman Jabeen
    Hossain, Ananna
    Islam, S. K. Md Injamul
    Swadesh, Ponuel Mollah
    Islam, Md Toushif
    Rahman, Md Anisur
    Parvez, Mohammad Zavid
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2021, : 82 - 90
  • [22] Gene expression data classification using topology and machine learning models
    Tamal K. Dey
    Sayan Mandal
    Soham Mukherjee
    BMC Bioinformatics, 22
  • [23] Cancer Classification of Gene Expression Data using Machine Learning Models
    De Guia, Joseph M.
    Devaraj, Madhavi
    Vea, Larry A.
    2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,
  • [24] Gene expression data classification using topology and machine learning models
    Dey, Tamal K.
    Mandal, Sayan
    Mukherjee, Soham
    BMC BIOINFORMATICS, 2022, 22 (SUPPL 10)
  • [25] Clustering gene-expression data with repeated measurements
    Yeung, KY
    Medvedovic, M
    Bumgarner, RE
    GENOME BIOLOGY, 2003, 4 (05)
  • [26] Squeezing out more gene-expression data
    Constans, A
    SCIENTIST, 2004, 18 (23): : 37 - 37
  • [27] Evolutionary algorithms for clustering gene-expression data
    Hruschka, ER
    de Castro, LN
    Campello, RJGB
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 403 - 406
  • [28] Clustering gene-expression data with repeated measurements
    Ka Yee Yeung
    Mario Medvedovic
    Roger E Bumgarner
    Genome Biology, 4
  • [29] A fuzzy approach to clustering and selecting features for classification of gene expression data
    Chitsaz, Elham
    Taheri, Mohammad
    Katebi, Seraj D.
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 1650 - 1655
  • [30] An efficient statistical feature selection approach for classification of gene expression data
    Chandra, B.
    Gupta, Manish
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) : 529 - 535