Optimization Based Tumor Classification from Microarray Gene Expression Data

被引:55
|
作者
Dagliyan, Onur [1 ]
Uney-Yuksektepe, Fadime [2 ]
Kavakli, I. Halil [1 ]
Turkay, Metin [3 ]
机构
[1] Koc Univ, Dept Chem & Biol Engn, Istanbul, Turkey
[2] Istanbul Kultur Univ, Dept Ind Engn, Istanbul, Turkey
[3] Koc Univ, Dept Ind Engn, Istanbul, Turkey
来源
PLOS ONE | 2011年 / 6卷 / 02期
关键词
BAYESIAN VARIABLE SELECTION; PARTIAL LEAST-SQUARES; B-CELL LYMPHOMAS; PROSTATE-CANCER; LOGISTIC-REGRESSION; PREDICTION; LEUKEMIA; BINDING; IDENTIFICATION; ORGANIZATION;
D O I
10.1371/journal.pone.0014579
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed to obtain such classifications. These algorithms usually require parameter optimization to obtain accurate results depending on the type of data. Additionally, it is highly critical to find an optimal set of markers among those up or down regulated genes that can be clinically utilized to build assays for the diagnosis or to follow progression of specific cancer types. In this paper, we employ a mixed integer programming based classification algorithm named hyper-box enclosure method (HBE) for the classification of some cancer types with a minimal set of predictor genes. This optimization based method which is a user friendly and efficient classifier may allow the clinicians to diagnose and follow progression of certain cancer types. Methodology/Principal Findings: We apply HBE algorithm to some well known data sets such as leukemia, prostate cancer, diffuse large B-cell lymphoma (DLBCL), small round blue cell tumors (SRBCT) to find some predictor genes that can be utilized for diagnosis and prognosis in a robust manner with a high accuracy. Our approach does not require any modification or parameter optimization for each data set. Additionally, information gain attribute evaluator, relief attribute evaluator and correlation-based feature selection methods are employed for the gene selection. The results are compared with those from other studies and biological roles of selected genes in corresponding cancer type are described. Conclusions/Significance: The performance of our algorithm overall was better than the other algorithms reported in the literature and classifiers found in WEKA data-mining package. Since it does not require a parameter optimization and it performs consistently very high prediction rate on different type of data sets, HBE method is an effective and consistent tool for cancer type prediction with a small number of gene markers.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [22] Classification of Microarray Gene Expression Data Using an Infiltration Tactics Optimization (ITO) Algorithm
    Zahoor, Javed
    Zafar, Kashif
    GENES, 2020, 11 (07) : 1 - 28
  • [23] A STUDY ON GENE SELECTION AND CLASSIFICATION ALGORITHMS FOR CLASSIFICATION OF MICROARRAY GENE EXPRESSION DATA
    Chin, Yeo Lee
    Deris, Safaai
    JURNAL TEKNOLOGI, 2005, 43
  • [24] Tumor classification ranking from microarray data
    Rattikorn Hewett
    Phongphun Kijsanayothin
    BMC Genomics, 9
  • [25] Tumor classification ranking from microarray data
    Hewett, Rattikorn
    Kijsanayothin, Phongphun
    BMC GENOMICS, 2008, 9 (Suppl 2)
  • [26] An efficient approach for classification of gene expression microarray data
    Sreepada, Rama Syamala
    Vipsita, Swati
    Mohapatra, Puspanjali
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 344 - 348
  • [27] Dimension reduction for classification with gene expression microarray data
    Dai, Jian J.
    Lieu, Linh
    Rocke, David
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2006, 5
  • [28] Cancer Classification Based on Microarray Gene Expression Data Using Deep Learning
    Guillen, Pablo
    Ebalunode, Jerry
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1403 - 1405
  • [29] SVM-based tumor classification with gene expression data
    Wang, Shulin
    Wang, Ji
    Chen, Huowang
    Zhang, Boyun
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 864 - 870
  • [30] Feature selection and ranking of key genes for tumor classification: Using microarray gene expression data
    Mukkamala, Srinivas
    Liu, Qingzhong
    Veeraghattam, Rajeev
    Sung, Andrew H.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2006, PROCEEDINGS, 2006, 4029 : 951 - 961