Blood cancer prediction using leukemia microarray gene data and hybrid logistic vector trees model

被引:0
|
作者
Vaibhav Rupapara
Furqan Rustam
Wajdi Aljedaani
Hina Fatima Shahzad
Ernesto Lee
Imran Ashraf
机构
[1] Florida International University,School of Computing and Information Sciences
[2] Khwaja Fareed University of Engineering and Information Technology,Department of Computer Science
[3] University of North Texas,Department of Computer Science and Engineering
[4] Broward College,Department of Computer Science
[5] Yeungnam University,Department of Information and Communication Engineering
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Blood cancer has been a growing concern during the last decade and requires early diagnosis to start proper treatment. The diagnosis process is costly and time-consuming involving medical experts and several tests. Thus, an automatic diagnosis system for its accurate prediction is of significant importance. Diagnosis of blood cancer using leukemia microarray gene data and machine learning approach has become an important medical research today. Despite research efforts, desired accuracy and efficiency necessitate further enhancements. This study proposes an approach for blood cancer disease prediction using the supervised machine learning approach. For the current study, the leukemia microarray gene dataset containing 22,283 genes, is used. ADASYN resampling and Chi-squared (Chi2) features selection techniques are used to resolve imbalanced and high-dimensional dataset problems. ADASYN generates artificial data to make the dataset balanced for each target class, and Chi2 selects the best features out of 22,283 to train learning models. For classification, a hybrid logistics vector trees classifier (LVTrees) is proposed which utilizes logistic regression, support vector classifier, and extra tree classifier. Besides extensive experiments on the datasets, performance comparison with the state-of-the-art methods has been made for determining the significance of the proposed approach. LVTrees outperform all other models with ADASYN and Chi2 techniques with a significant 100% accuracy. Further, a statistical significance T-test is also performed to show the efficacy of the proposed approach. Results using k-fold cross-validation prove the supremacy of the proposed model.
引用
收藏
相关论文
共 50 条
  • [41] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [42] A New hybrid Feature selection-Classification model to Improve Cancer Sample Classification Accuracy in Microarray Gene Expression Data
    Bandyopadhyay, Ritaban
    Sharma, Arijt Das
    Dasgupta, Bidya
    Ghosh, Ankita
    Das, Chandra
    Bose, Shilpi
    2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE, 2023,
  • [43] Diagnosis of acute myeloid leukaemia on microarray gene expression data using categorical gradient boosted trees
    Angelakis, Athanasios
    Soulioti, Ioanna
    Filippakis, Michael
    HELIYON, 2023, 9 (10)
  • [44] Design Model of Deep Stacking Network for Breast Cancer Prediction Using Microarray
    Hanifah, Nurul
    Wasito, Ito
    Sabarguna, Boy Subiroso
    ADVANCED SCIENCE LETTERS, 2018, 24 (08) : 6095 - 6096
  • [45] Ensemble Classifiers for Acute Leukemia Classification Using Microarray Gene Expression Data under uncertainty
    Gamal, Mona
    Zaied, Abdel Nasser H.
    Rushdy, Ehab
    Neutrosophic Sets and Systems, 2022, 49 : 164 - 183
  • [46] Extreme value distribution based gene selection criteria for discriminant microarray data analysis using logistic regression
    Li, WT
    Sun, FZ
    Grosse, I
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (2-3) : 215 - 226
  • [47] Cancer Classification Based on Microarray Gene Expression Data Using Deep Learning
    Guillen, Pablo
    Ebalunode, Jerry
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1403 - 1405
  • [48] Deep learning techniques for cancer classification using microarray gene expression data
    Gupta, Surbhi
    Gupta, Manoj K.
    Shabaz, Mohammad
    Sharma, Ashutosh
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [49] Gene subset selection in microarray data using entropic filtering for cancer classification
    Navarro, Felix F. Gonzalez
    Munoz, Lluis A. Belanche
    EXPERT SYSTEMS, 2009, 26 (01) : 113 - 124
  • [50] Cancer classification by gradient LDA technique using microarray gene expression data
    Sharma, Alok
    Paliwal, Kuldip K.
    DATA & KNOWLEDGE ENGINEERING, 2008, 66 (02) : 338 - 347