PRFE-driven gene selection with multi-classifier ensemble for cancer classification

被引:0
|
作者
Behuria, Smitirekha [1 ]
Swain, Sujata [1 ]
Bandyopadhyay, Anjan [1 ]
Al-Sadoon, Mohammad Khalid [2 ]
Mallik, Saurav [3 ,4 ]
机构
[1] Kalinga Inst Ind Technol, Sch Comp Engn, Bhubaneswar 751024, Odisha, India
[2] King Saud Univ, Coll Sci, Dept Zool, POB 2455, Riyadh 11451, Saudi Arabia
[3] Harvard TH Chan Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA
[4] Univ Arizona, Dept Pharmacol & Toxicol, Tucson, MA 85721 USA
关键词
Principal recursive feature eliminator (PRFE); Recursive feature elimination; Long short-term memory; LightGBM; CatBoost; Convolutional neural network; Gene expression analysis; BREAST-CANCER; EXPRESSION; ALGORITHM;
D O I
10.1016/j.eij.2025.100637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this era, cancer remains a paramount concern due to its pervasive impact on individuals and societies, persistent challenges in treatment and prevention, and the ongoing need for global collaboration and innovation to improve outcomes and reduce its burden. Cancer marked by uncontrolled cell growth is a leading global cause of mortality, necessitating advanced methods for accurate diagnosis. This study introduces an innovative unsupervised feature selection technique Principal Recursive Feature Eliminator (PRFE) for selection of genes and cancer classification. Subsequently, seven different classifiers-Support Vector Machine, Random Forest, CatBoost, Light Gradient Boosting Method, Artificial Neural Network, Convolutional Neural Network, Long Short-Term Memory are used to increase the model's robustness. The proposed approach is evaluated on nine benchmark gene expression datasets with a combination of two different algorithms. A series of experiments are conducted to assess the proposed method, focusing on the selected features and identifying optimal classifiers. We have calculated F1-Score, accuracy, recall, and precision. The suggested strategy performs better than expected, as the results highlight its potential to improve cancer classification techniques with an accuracy of 99.98%. We conclude from this analysis that, across many datasets, the CatBoost and CNN model outperforms the other models. This research contributes to the ongoing efforts to improve diagnostic precision and treatment outcomes in cancer research.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Specificity enhancement in classification of breast MRI lesion based on multi-classifier
    Keyvanfard, Farzaneh
    Shoorehdeli, Mahdi Aliyari
    Teshnehlab, Mohammad
    Nie, Ke
    Su, Min-Ying
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 : S35 - S45
  • [32] Effects of Diversity Measures on the Design of Ensemble Classifiers by Multiobjective Genetic Fuzzy Rule Selection with a Multi-classifier Coding Scheme
    Nojima, Yusuke
    Ishibuchi, Hisao
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2008, 5271 : 755 - 763
  • [33] Automatic classification of thyroid histopathology images using multi-classifier system
    Angel Arul Jothi J
    Mary Anita Rajam V
    Multimedia Tools and Applications, 2017, 76 : 18711 - 18730
  • [34] Automatic classification of thyroid histopathology images using multi-classifier system
    Jothi, Angel Arul J.
    Rajam, Mary Anita, V
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (18) : 18711 - 18730
  • [35] Interactive patent classification based on multi-classifier fusion and active learning
    Zhang, Xiaoyu
    Neurocomputing, 2014, 127 (01) : 200 - 205
  • [36] Specificity enhancement in classification of breast MRI lesion based on multi-classifier
    Farzaneh Keyvanfard
    Mahdi Aliyari Shoorehdeli
    Mohammad Teshnehlab
    Ke Nie
    Min-Ying Su
    Neural Computing and Applications, 2013, 22 : 35 - 45
  • [37] A multi-classifier approach to dialogue act classification using function words
    O'Shea, J. (j.d.oshea@mmu.ac.uk), 1600, Springer Verlag (7270 LNCS):
  • [38] Evolutionary Classifier and Cluster Selection Approach for Ensemble Classification
    Jan, Zohaib Md
    Verma, Brijesh
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (01)
  • [39] Interactive patent classification based on multi-classifier fusion and active learning
    Zhang, Xiaoyu
    NEUROCOMPUTING, 2014, 127 : 200 - 205
  • [40] A kind of Combination Feature Division and Diversity Measure of Multi-classifier Selective Ensemble Algorithm
    Wang, Yan
    Wang, Xiu-xia
    Lai, Sheng
    ADVANCED RESEARCH ON MECHANICAL ENGINEERING, INDUSTRY AND MANUFACTURING ENGINEERING, PTS 1 AND 2, 2011, 63-64 : 55 - 58