Novel machine learning approach for classification of high-dimensional microarray data

被引:0
|
作者
Rabia Aziz Musheer
C. K. Verma
Namita Srivastava
机构
[1] VIT University Bhopal,Department of SASL (Mathematics)
[2] Maulana Azad National Institute of Technology,Department of Mathematics and Computer Application
来源
Soft Computing | 2019年 / 23卷
关键词
Independent component analysis (ICA); Artificial bee colony (ABC); Naïve Bayes (NB); Cancer classification;
D O I
暂无
中图分类号
学科分类号
摘要
Independent component analysis (ICA) is a powerful concept for reducing the dimension of big data in many applications. It has been used for the feature extraction of microarray gene expression data in numerous works. One of the merits of ICA is that a number of extracted features are always equal to the number of samples. When ICA is applied to microarray data, whenever, it faces the challenges of how to find the best subset of genes (features) from extracted features. To resolve this problem, in this paper, we propose a new (artificial bee colony) ABC-based feature selection approach for microarray data. Our approach is based on two stages: ICA-based extraction approach to reduce the size of data and ABC-based wrapper approach to optimize the reduced feature vectors. To validate our proposed approach, extensive experiments were conducted to compare the performance of ICA + ABC with the results obtained from recently published and other previously suggested methods of gene selection for Naïve Bayes (NB) classifier. To compare the performance of the proposed approach with other algorithms, a statistical hypothesis test was employed with six benchmark cancer classification datasets of the microarray. The experimental result shows that the proposed approach demonstrates an improvement over all the algorithms for NB classifier with a certain level of significance.
引用
收藏
页码:13409 / 13421
页数:12
相关论文
共 50 条
  • [21] Using the Machine Learning Approach to Predict Patient Survival from High-Dimensional Survival Data
    Zhang, Wenbin
    Tang, Jian
    Wang, Nuo
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1234 - 1238
  • [22] Deep learning approach for cancer subtype classification using high-dimensional gene expression data
    Shen, Jiquan
    Shi, Jiawei
    Luo, Junwei
    Zhai, Haixia
    Liu, Xiaoyan
    Wu, Zhengjiang
    Yan, Chaokun
    Luo, Huimin
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [23] Hybrid deep learning approach to improve classification of low-volume high-dimensional data
    Pegah Mavaie
    Lawrence Holder
    Michael K. Skinner
    BMC Bioinformatics, 24
  • [24] Hybrid deep learning approach to improve classification of low-volume high-dimensional data
    Mavaie, Pegah
    Holder, Lawrence
    Skinner, Michael K.
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [25] Feature selection, mutual information, and the classification of high-dimensional patternsApplications to image classification and microarray data analysis
    Boyan Bonev
    Francisco Escolano
    Miguel Cazorla
    Pattern Analysis and Applications, 2008, 11 : 309 - 319
  • [26] A classification algorithm for high-dimensional data
    Roy, Asim
    INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 345 - 355
  • [27] Gene selection from microarray data for cancer classification - a machine learning approach
    Wang, Y
    Tetko, IV
    Hall, MA
    Frank, E
    Facius, A
    Mayer, KFX
    Mewes, HW
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (01) : 37 - 46
  • [28] Gene ranking from microarray data for cancer classification -: A machine learning approach
    Ruiz, Roberto
    Pontes, Beatriz
    Giraldez, Raul
    Aguilar-Ruiz, Jesus S.
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 1272 - 1280
  • [29] A machine learning based approach towards high-dimensional mediation analysis
    Natha, Tanmay
    Caffoa, Brian
    Wagerb, Tor
    Lindquista, Martin A.
    NEUROIMAGE, 2023, 268
  • [30] Classification of usual interstitial pneumonia in patients with interstitial lung disease: assessment of a machine learning approach using high-dimensional transcriptional data
    Kim, Su Yeon
    Diggans, James
    Pankratz, Dan
    Huang, Jing
    Pagan, Moraima
    Sindy, Nicole
    Tom, Ed
    Anderson, Jessica
    Choi, Yoonha
    Lynch, David A.
    Steele, Mark P.
    Flaherty, Kevin R.
    Brown, Kevin K.
    Farah, Humam
    Bukstein, Michael J.
    Pardo, Annie
    Selman, Moises
    Wolters, Paul J.
    Nathan, Steven D.
    Colby, Thomas V.
    Myers, Jeffrey L.
    Katzenstein, Anna-Luise A.
    Raghu, Ganesh
    Kennedy, Giulia C.
    LANCET RESPIRATORY MEDICINE, 2015, 3 (06): : 473 - 482