Optimization Based Tumor Classification from Microarray Gene Expression Data

被引:55
|
作者
Dagliyan, Onur [1 ]
Uney-Yuksektepe, Fadime [2 ]
Kavakli, I. Halil [1 ]
Turkay, Metin [3 ]
机构
[1] Koc Univ, Dept Chem & Biol Engn, Istanbul, Turkey
[2] Istanbul Kultur Univ, Dept Ind Engn, Istanbul, Turkey
[3] Koc Univ, Dept Ind Engn, Istanbul, Turkey
来源
PLOS ONE | 2011年 / 6卷 / 02期
关键词
BAYESIAN VARIABLE SELECTION; PARTIAL LEAST-SQUARES; B-CELL LYMPHOMAS; PROSTATE-CANCER; LOGISTIC-REGRESSION; PREDICTION; LEUKEMIA; BINDING; IDENTIFICATION; ORGANIZATION;
D O I
10.1371/journal.pone.0014579
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed to obtain such classifications. These algorithms usually require parameter optimization to obtain accurate results depending on the type of data. Additionally, it is highly critical to find an optimal set of markers among those up or down regulated genes that can be clinically utilized to build assays for the diagnosis or to follow progression of specific cancer types. In this paper, we employ a mixed integer programming based classification algorithm named hyper-box enclosure method (HBE) for the classification of some cancer types with a minimal set of predictor genes. This optimization based method which is a user friendly and efficient classifier may allow the clinicians to diagnose and follow progression of certain cancer types. Methodology/Principal Findings: We apply HBE algorithm to some well known data sets such as leukemia, prostate cancer, diffuse large B-cell lymphoma (DLBCL), small round blue cell tumors (SRBCT) to find some predictor genes that can be utilized for diagnosis and prognosis in a robust manner with a high accuracy. Our approach does not require any modification or parameter optimization for each data set. Additionally, information gain attribute evaluator, relief attribute evaluator and correlation-based feature selection methods are employed for the gene selection. The results are compared with those from other studies and biological roles of selected genes in corresponding cancer type are described. Conclusions/Significance: The performance of our algorithm overall was better than the other algorithms reported in the literature and classifiers found in WEKA data-mining package. Since it does not require a parameter optimization and it performs consistently very high prediction rate on different type of data sets, HBE method is an effective and consistent tool for cancer type prediction with a small number of gene markers.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] RPCA-Based Tumor Classification Using Gene Expression Data
    Liu, Jin-Xing
    Xu, Yong
    Zheng, Chun-Hou
    Kong, Heng
    Lai, Zhi-Hui
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (04) : 964 - 970
  • [42] ANFIS-Based Wrapper Model Gene Selection for Cancer Classification on Microarray Gene Expression Data
    Mahmoudi, Sina
    Lahijan, Biyuk Sadeghi
    Kanan, Hamidreza Rashidy
    2013 13TH IRANIAN CONFERENCE ON FUZZY SYSTEMS (IFSC), 2013,
  • [43] A graph-theoretic technique for classification of normal and tumor tissues using gene expression microarray data
    Kim, Saejoon
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4621 - 4624
  • [44] A Comparative Study of Two Multiple Classification Methods Based on Partial Least Squares Using Tumor Microarray Gene Expression Data
    Jin Zhichao
    Gao Qingbin
    He Jia
    COMPREHENSIVE EVALUATION OF ECONOMY AND SOCIETY WITH STATISTICAL SCIENCE, 2009, : 1212 - 1222
  • [45] Optimized gene selection and classification of cancer from microarray gene expression data using deep learning
    Shah, Shamveel Hussain
    Iqbal, Muhammad Javed
    Ahmad, Iftikhar
    Khan, Suleman
    Rodrigues, Joel J. P. C.
    NEURAL COMPUTING & APPLICATIONS, 2020,
  • [46] Tumor classification from gene expression data:: A coding-based multiclass learning approach
    Hüntemann, A
    González, JC
    Tapia, E
    BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2005, 3745 : 211 - 222
  • [48] Cancer classification based on microarray gene expression data using a principal component accumulation method
    LIU JingJingCAI WenSheng SHAO XueGuang Research Center for Analytical SciencesCollege of ChemistryNankai UniversityTianjin China
    Science China(Chemistry), 2011, 54 (05) : 802 - 811
  • [49] Cancer classification based on microarray gene expression data using a principal component accumulation method
    Liu JingJing
    Cai WenSheng
    Shao XueGuang
    SCIENCE CHINA-CHEMISTRY, 2011, 54 (05) : 802 - 811
  • [50] SVM-ABC based cancer microarray (gene expression) hybrid method for data classification
    Gulande, Punam
    Awale, R. N.
    COMPUTATIONAL INTELLIGENCE, 2023, 39 (06) : 1054 - 1072