PRFE-driven gene selection with multi-classifier ensemble for cancer classification

被引:0
|
作者
Behuria, Smitirekha [1 ]
Swain, Sujata [1 ]
Bandyopadhyay, Anjan [1 ]
Al-Sadoon, Mohammad Khalid [2 ]
Mallik, Saurav [3 ,4 ]
机构
[1] Kalinga Inst Ind Technol, Sch Comp Engn, Bhubaneswar 751024, Odisha, India
[2] King Saud Univ, Coll Sci, Dept Zool, POB 2455, Riyadh 11451, Saudi Arabia
[3] Harvard TH Chan Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA
[4] Univ Arizona, Dept Pharmacol & Toxicol, Tucson, MA 85721 USA
关键词
Principal recursive feature eliminator (PRFE); Recursive feature elimination; Long short-term memory; LightGBM; CatBoost; Convolutional neural network; Gene expression analysis; BREAST-CANCER; EXPRESSION; ALGORITHM;
D O I
10.1016/j.eij.2025.100637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this era, cancer remains a paramount concern due to its pervasive impact on individuals and societies, persistent challenges in treatment and prevention, and the ongoing need for global collaboration and innovation to improve outcomes and reduce its burden. Cancer marked by uncontrolled cell growth is a leading global cause of mortality, necessitating advanced methods for accurate diagnosis. This study introduces an innovative unsupervised feature selection technique Principal Recursive Feature Eliminator (PRFE) for selection of genes and cancer classification. Subsequently, seven different classifiers-Support Vector Machine, Random Forest, CatBoost, Light Gradient Boosting Method, Artificial Neural Network, Convolutional Neural Network, Long Short-Term Memory are used to increase the model's robustness. The proposed approach is evaluated on nine benchmark gene expression datasets with a combination of two different algorithms. A series of experiments are conducted to assess the proposed method, focusing on the selected features and identifying optimal classifiers. We have calculated F1-Score, accuracy, recall, and precision. The suggested strategy performs better than expected, as the results highlight its potential to improve cancer classification techniques with an accuracy of 99.98%. We conclude from this analysis that, across many datasets, the CatBoost and CNN model outperforms the other models. This research contributes to the ongoing efforts to improve diagnostic precision and treatment outcomes in cancer research.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] An Ensemble of Cooperative Parallel Metaheuristics for Gene Selection in Cancer Classification
    Boucheham, Anouar
    Batouche, Mohamed
    Meshoul, Souham
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2015), PT II, 2015, 9044 : 301 - 312
  • [42] A New Multi-classifier Ensemble Algorithm Based on D-S Evidence Theory
    Kaiyi Zhao
    Li Li
    Zeqiu Chen
    Ruizhi Sun
    Gang Yuan
    Jiayao Li
    Neural Processing Letters, 2022, 54 : 5005 - 5021
  • [43] A New Multi-classifier Ensemble Algorithm Based on D-S Evidence Theory
    Zhao, Kaiyi
    Li, Li
    Chen, Zeqiu
    Sun, Ruizhi
    Yuan, Gang
    Li, Jiayao
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5005 - 5021
  • [44] A Novel SPDF Ensemble Classifier for Cancer Classification
    Zhang, Chunying
    Wu, Fang
    Tong, Tuopeng
    Chen, Sun
    Song, Kai
    Ma, Min
    Zheng, Guangqiang
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 1037 - 1041
  • [45] Cancer Classification Utilizing Voting Classifier with Ensemble Feature Selection Method and Transcriptomic Data
    Khatun, Rabea
    Akter, Maksuda
    Islam, Md. Manowarul
    Uddin, Md. Ashraf
    Talukder, Md. Alamin
    Kamruzzaman, Joarder
    Azad, Akm
    Paul, Bikash Kumar
    Almoyad, Muhammad Ali Abdulllah
    Aryal, Sunil
    Moni, Mohammad Ali
    GENES, 2023, 14 (09)
  • [46] Hybrid Feature Selection and Ensemble Learning Methods for Gene Selection and Cancer Classification
    Qasem, Sultan Noman
    Saeed, Faisal
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 193 - 200
  • [47] Optimizing multi-classifier fusion for seabed sediment classification using machine learning
    Anokye, Michael
    Cui, Xiaodong
    Yang, Fanlin
    Wang, Ping
    Sun, Yuewen
    Ma, Hadong
    Amoako, Emmanuel Oduro
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [48] A mixture generative adversarial network with category multi-classifier for hyperspectral image classification
    Li, Hengchao
    Wang, Weiye
    Ye, Shaohui
    Deng, Yangjun
    Zhang, Fan
    Du, Qian
    REMOTE SENSING LETTERS, 2020, 11 (11) : 983 - 992
  • [49] Classification Based on Multi-classifier of SVM fusion for Steel Strip Surface Defects
    Gao Yi
    Yang Yanxi
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3617 - 3622
  • [50] An automatic methodology for construction of multi-classifier systems based on the combination of selection and fusion
    de Lima T.P.F.
    da Silva A.J.
    Ludermir T.B.
    de Oliveira W.R.
    Progress in Artificial Intelligence, 2014, 2 (4) : 205 - 215