Blood cancer prediction using leukemia microarray gene data and hybrid logistic vector trees model

被引:0
|
作者
Vaibhav Rupapara
Furqan Rustam
Wajdi Aljedaani
Hina Fatima Shahzad
Ernesto Lee
Imran Ashraf
机构
[1] Florida International University,School of Computing and Information Sciences
[2] Khwaja Fareed University of Engineering and Information Technology,Department of Computer Science
[3] University of North Texas,Department of Computer Science and Engineering
[4] Broward College,Department of Computer Science
[5] Yeungnam University,Department of Information and Communication Engineering
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Blood cancer has been a growing concern during the last decade and requires early diagnosis to start proper treatment. The diagnosis process is costly and time-consuming involving medical experts and several tests. Thus, an automatic diagnosis system for its accurate prediction is of significant importance. Diagnosis of blood cancer using leukemia microarray gene data and machine learning approach has become an important medical research today. Despite research efforts, desired accuracy and efficiency necessitate further enhancements. This study proposes an approach for blood cancer disease prediction using the supervised machine learning approach. For the current study, the leukemia microarray gene dataset containing 22,283 genes, is used. ADASYN resampling and Chi-squared (Chi2) features selection techniques are used to resolve imbalanced and high-dimensional dataset problems. ADASYN generates artificial data to make the dataset balanced for each target class, and Chi2 selects the best features out of 22,283 to train learning models. For classification, a hybrid logistics vector trees classifier (LVTrees) is proposed which utilizes logistic regression, support vector classifier, and extra tree classifier. Besides extensive experiments on the datasets, performance comparison with the state-of-the-art methods has been made for determining the significance of the proposed approach. LVTrees outperform all other models with ADASYN and Chi2 techniques with a significant 100% accuracy. Further, a statistical significance T-test is also performed to show the efficacy of the proposed approach. Results using k-fold cross-validation prove the supremacy of the proposed model.
引用
收藏
相关论文
共 50 条
  • [1] Blood cancer prediction using leukemia microarray gene data and hybrid logistic vector trees model
    Rupapara, Vaibhav
    Rustam, Furqan
    Aljedaani, Wajdi
    Shahzad, Hina Fatima
    Lee, Ernesto
    Ashraf, Imran
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [2] Hybrid ant lion mutated ant colony optimizer technique for Leukemia prediction using microarray gene data
    Santhakumar, D.
    Logeswari, S.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2965 - 2973
  • [3] Hybrid ant lion mutated ant colony optimizer technique for Leukemia prediction using microarray gene data
    D. Santhakumar
    S. Logeswari
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 2965 - 2973
  • [4] Improving prediction of blood cancer using leukemia microarray gene data and Chi2 features with weighted convolutional neural network
    Alabdulqader, Ebtisam Abdullah
    Alarfaj, Aisha Ahmed
    Umer, Muhammad
    Eshmawi, Ala' Abdulmajid
    Alsubai, Shtwai
    Kim, Tai-hoon
    Ashraf, Imran
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [5] Prediction of blood cancer using leukemia gene expression data and sparsity-based gene selection methods
    Mehrabani, Sanaz
    Soroush, Morteza Zangeneh
    Kheiri, Negin
    Sheikhpour, Razieh
    Bahrami, Mahshid
    IRANIAN JOURNAL OF PEDIATRIC HEMATOLOGY AND ONCOLOGY, 2023, 13 (01) : 13 - 21
  • [6] Estimation and prediction in generalized half logistic lifetime model using hybrid censored data
    Soni, Sakshi
    Shukla, Ashish Kumar
    Kumar, Kapil
    INTERNATIONAL JOURNAL OF QUALITY & RELIABILITY MANAGEMENT, 2023, 40 (09) : 2041 - 2063
  • [7] Hybrid Ant Lion Mutated Ant Colony Optimizer Technique With Particle Swarm Optimization for Leukemia Prediction Using Microarray Gene Data
    Mahesh, T. R.
    Santhakumar, D.
    Balajee, A.
    Shreenidhi, H. S.
    Kumar, V. Vinoth
    Rajkumar Annand, Jonnakuti
    IEEE ACCESS, 2024, 12 : 10910 - 10919
  • [8] Gene Subset Selection for Leukemia Classification Using Microarray Data
    Fajila, Mohamed Nisper Fathima
    CURRENT BIOINFORMATICS, 2019, 14 (04) : 353 - 358
  • [9] Lung cancer prediction from microarray data by gene expression programming
    Azzawi, Hasseeb
    Hou, Jingyu
    Xiang, Yong
    Alanni, Russul
    IET SYSTEMS BIOLOGY, 2016, 10 (05) : 168 - 178
  • [10] Classification and diagnostic prediction of cancers using gene microarray data analysis
    Osareh, Alireza
    Shadgar, Bita
    Journal of Applied Sciences, 2009, 9 (03) : 459 - 468