Analysis of complexity indices for classification problems: Cancer gene expression data

被引:41
|
作者
Lorena, Ana C.
Costa, Ivan G. [1 ]
Spolaor, Newton
de Souto, Marcilio C. P. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
关键词
Classification; Gene expression data; Complexity indices; Linear separability; BREAST-CANCER; MICROARRAY; SENSITIVITY; PREDICTION; ALGORITHMS; SELECTION; RANKING;
D O I
10.1016/j.neucom.2011.03.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, cancer diagnosis at a molecular level has been made possible through the analysis of gene expression data. More specifically, one usually uses machine learning (ML) techniques to build, from cancer gene expression data, automatic diagnosis models (classifiers). Cancer gene expression data often present some characteristics that can have a negative impact in the generalization ability of the classifiers generated. Some of these properties are data sparsity and an unbalanced class distribution. We investigate the results of a set of indices able to extract the intrinsic complexity information from the data. Such measures can be used to analyze, among other things, which particular characteristics of cancer gene expression data mostly impact the prediction ability of support vector machine classifiers. In this context, we also show that, by applying a proper feature selection procedure to the data, one can reduce the influence of those characteristics in the error rates of the classifiers induced. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 50 条
  • [31] Toward a Measure of Classification Complexity in Gene Expression Signatures
    Kamath, Vidya
    Yeatman, Timothy J.
    Eschrich, Steven A.
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 5704 - +
  • [32] GENE EXPRESSION DATA CLASSIFICATION AND PATTERN ANALYSIS USING DATA DRIVEN APPROACH
    Ramisa, Aiman Jabeen
    Hossain, Ananna
    Islam, S. K. Md Injamul
    Swadesh, Ponuel Mollah
    Islam, Md Toushif
    Rahman, Md Anisur
    Parvez, Mohammad Zavid
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2021, : 82 - 90
  • [33] Optimized kernel machines for cancer classification using gene expression data
    Xiong, H
    Chen, XW
    Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005, : 268 - 274
  • [34] Cancer Classification of Gene Expression Data using Machine Learning Models
    De Guia, Joseph M.
    Devaraj, Madhavi
    Vea, Larry A.
    2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,
  • [35] A graph-theoretic classification of gene expression microarray data of cancer
    Kim, Saejoon
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 179 - 182
  • [36] AN ADAPTIVE WAVELET DENOISING AND CLASSIFICATION OF MULTICLASS CANCER GENE EXPRESSION DATA
    Bhaskar, Navya S.
    Devi, Aswathy T.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 780 - 786
  • [37] Cancer Classification by Sparse Representation using Microarray Gene Expression Data
    Hang, Xiyi
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, PROCEEDINGS, 2008, : 174 - 177
  • [38] Informative gene discovery for cancer classification from microarray expression data
    Ng, M
    Chan, LW
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 393 - 398
  • [39] Classification of breast cancer using microarray gene expression data: A survey
    Abd-Elnaby, Muhammed
    Alfonse, Marco
    Roushdy, Mohamed
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117
  • [40] Pan-cancer classification on gene expression data by neural network
    Yu, Kijin
    Kim, Bong-Hyun
    Lee, Peter Chang Whan
    CANCER RESEARCH, 2019, 79 (13)