Benchmarking Classification Models for Cancer Prediction from Gene Expression Data: A Novel Approach and New Findings

被引:0
|
作者
Ramani, R. Geetha [1 ]
Jacob, Shomona Gracia [1 ]
机构
[1] Anna Univ, Madras 600025, Tamil Nadu, India
来源
STUDIES IN INFORMATICS AND CONTROL | 2013年 / 22卷 / 02期
关键词
Cancer prediction; Gene Expression; Feature Relevance; Multi-class classification; MICROARRAY DATA; SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gene Selection from gene expression data for Cancer prediction has been an area of intensive research, aiming at identifying the minimal and optimal set of candidate genes that could generate accurate predictive performance. The two major problems encountered in this process are the high dimensionality of data with comparatively few instances and the need to categorize records under multiple classes. In this paper we propose a novel approach called Rank-Weight Feature Selection that utilizes the filtering capacity of more than one feature selection algorithm to detect the minimal set of predictive genes that generate higher predictor performance in categorizing and predicting diverse oncogenic gene expression data. The filtered features (genes) are weighted based on the number of feature relevance algorithms reporting them to be significant. The ranked genes are then used to validate the proposed method by utilizing ten classifiers over five diverse gene expression datasets. The results proved that the proposed approach generated higher predictive performance with fewer features than previously reported results with the most relevant and minimal set of genes and commend classifiers based on their accuracy and reliability in predicting cancer data.
引用
收藏
页码:133 / 142
页数:10
相关论文
共 50 条
  • [41] A new feature selection approach for optimizing prediction models, applied to breast cancer subtype classification
    Pham Quang Huy
    Ngom, Alioune
    Rueda, Luis
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1535 - 1541
  • [42] VALIDATION OF CLASSIFICATION MODELS AND DATA REDUCTION METHODS BASED ON GENE EXPRESSION DATA
    Rafiee, Mohammad
    Rafiei, Fatemeh
    Tabatabaei, Seyyed Mohammad
    AlaviMajd, Hamid
    Rafiei, Ali
    Khodakarim, Soheila
    JP JOURNAL OF BIOSTATISTICS, 2019, 16 (02) : 79 - 90
  • [44] A Novel Aggregated Multiple Imputation Approach for Enhanced Survival Prediction and Classification on Breast Cancer and Lung Cancer Data
    Deepa, P.
    Gunavathi, C.
    IEEE ACCESS, 2024, 12 : 189102 - 189121
  • [45] Computational intelligence approach for gene expression data mining and classification
    Wang, ZY
    Kung, SY
    Zhang, JY
    Khan, J
    Xuan, JH
    Wang, Y
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 449 - 452
  • [46] An epicurean learning approach to gene-expression data classification
    Albrecht, A
    Vinterbo, SA
    Ohno-Machado, L
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 28 (01) : 75 - 87
  • [47] Meta-learning approach to gene expression data classification
    de Souza, Bruno Feres
    Soares, Carlos
    de Carvalho, Andre C. P. L. F.
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2009, 2 (02) : 285 - 303
  • [48] Analyzing RNA-Seq Gene Expression Data for Cancer Classification Through ML Approach
    Wahid, Abdul
    Banday, M. Tariq
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 798 - 810
  • [49] Prediction of Child Tumours from Microarray Gene Expression Data Through Parallel Gene Selection and Classification on Spark
    Lokeswari, Y. V.
    Jacob, Shomona Gracia
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 651 - 661
  • [50] Prediction of RNA Methylation Status From Gene Expression Data Using Classification and Regression Methods
    Xue, Hao
    Wei, Zhen
    Chen, Kunqi
    Tang, Yujiao
    Wu, Xiangyu
    Su, Jionglong
    Meng, Jia
    EVOLUTIONARY BIOINFORMATICS, 2020, 16