An efficient approach for classification of gene expression microarray data

被引:1
|
作者
Sreepada, Rama Syamala [1 ]
Vipsita, Swati [1 ]
Mohapatra, Puspanjali [1 ]
机构
[1] IIIT Bhubaneswar, Dept Comp Sci Engn, Bhubaneswar 751003, Orissa, India
关键词
Microarray; Feature extraction; feature selection; Probabilistic Neural Network; Genetic Algorithms; ALGORITHM;
D O I
10.1109/EAIT.2014.46
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarrays help in storing gene expression data from a cell. Each microarray describes features of each cell. The rows in microarray represent the samples and the columns represent the gene expression level of the cell. Microarray data is of high dimension due to which classification using conventional methods becomes tedious and inefficient. Therefore, reducing the dimension of long feature vector and extracting relevant features out of it becomes a very challenging task. This can be achieved using various techniques of feature extraction and/or feature selection. Design of an efficient classification model is another crucial task for any classification problem. In this paper, emphasis is given for significant feature extraction as well as efficient design of classifier. The task of microarray classification is done in two phases. In the first phase, a hybrid approach of Genetic Algorithm (GA) and Principal Component Analysis (PCA) is used for extracting relevant features. In the second phase, Probabilistic Neural Network (PNN) is used as the classifier and GA is implemented to optimize the topology of the PNN. The datasets used in the experiment are Colon Tumor, Diffuse Large B Cell Lymphoma (DLBCL) and Leukemia (ALL and AML). The proposed technique gave efficient results for the datasets used.
引用
收藏
页码:344 / 348
页数:5
相关论文
共 50 条
  • [21] Gene Expression Microarray Data Classification based on PCA and Cuttlefish Algorithm
    Dino, Hivi Ismat
    Hussein, Haval Ismael
    Hassan, Masoud Muhammed
    Eesa, Adel Sabry
    PROCEEDING OF THE 2ND 2022 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSASE 2022), 2022, : 277 - 282
  • [22] A graph-theoretic classification of gene expression microarray data of cancer
    Kim, Saejoon
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 179 - 182
  • [23] Cancer Classification by Sparse Representation using Microarray Gene Expression Data
    Hang, Xiyi
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, PROCEEDINGS, 2008, : 174 - 177
  • [24] Gene selection for tumor classification using microarray gone expression data
    Yendrapalli, K.
    Basnet, R.
    Mukkamala, S.
    Sung, A. H.
    WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 290 - +
  • [25] Informative gene discovery for cancer classification from microarray expression data
    Ng, M
    Chan, LW
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 393 - 398
  • [26] MIDClass: Microarray Data Classification by Association Rules and Gene Expression Intervals
    Giugno, Rosalba
    Pulvirenti, Alfredo
    Cascione, Luciano
    Pigola, Giuseppe
    Ferro, Alfredo
    PLOS ONE, 2013, 8 (08):
  • [27] Transductive Support Vector Machines for classification of microarray gene expression data
    Semolini, R
    Von Zuben, FJ
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2946 - 2951
  • [28] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [29] A mixture model approach for the analysis of microarray gene expression data
    Allison, David B.
    Gadbury, Gary L.
    Heo, Moonseong
    Fernández, José R.
    Lee, Cheol-Koo
    Prolla, Tomas A.
    Weindruch, Richard
    Computational Statistics and Data Analysis, 2002, 38 (05): : 1 - 20
  • [30] A mixture model approach for the analysis of microarray gene expression data
    Allison, DB
    Gadbury, GL
    Heo, MS
    Fernández, JR
    Lee, CK
    Prolla, TA
    Weindruch, R
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 39 (01) : 1 - 20