Blood cancer prediction using leukemia microarray gene data and hybrid logistic vector trees model

被引:0
|
作者
Vaibhav Rupapara
Furqan Rustam
Wajdi Aljedaani
Hina Fatima Shahzad
Ernesto Lee
Imran Ashraf
机构
[1] Florida International University,School of Computing and Information Sciences
[2] Khwaja Fareed University of Engineering and Information Technology,Department of Computer Science
[3] University of North Texas,Department of Computer Science and Engineering
[4] Broward College,Department of Computer Science
[5] Yeungnam University,Department of Information and Communication Engineering
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Blood cancer has been a growing concern during the last decade and requires early diagnosis to start proper treatment. The diagnosis process is costly and time-consuming involving medical experts and several tests. Thus, an automatic diagnosis system for its accurate prediction is of significant importance. Diagnosis of blood cancer using leukemia microarray gene data and machine learning approach has become an important medical research today. Despite research efforts, desired accuracy and efficiency necessitate further enhancements. This study proposes an approach for blood cancer disease prediction using the supervised machine learning approach. For the current study, the leukemia microarray gene dataset containing 22,283 genes, is used. ADASYN resampling and Chi-squared (Chi2) features selection techniques are used to resolve imbalanced and high-dimensional dataset problems. ADASYN generates artificial data to make the dataset balanced for each target class, and Chi2 selects the best features out of 22,283 to train learning models. For classification, a hybrid logistics vector trees classifier (LVTrees) is proposed which utilizes logistic regression, support vector classifier, and extra tree classifier. Besides extensive experiments on the datasets, performance comparison with the state-of-the-art methods has been made for determining the significance of the proposed approach. LVTrees outperform all other models with ADASYN and Chi2 techniques with a significant 100% accuracy. Further, a statistical significance T-test is also performed to show the efficacy of the proposed approach. Results using k-fold cross-validation prove the supremacy of the proposed model.
引用
收藏
相关论文
共 50 条
  • [31] A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification
    Almugren, Nada
    Alshamlan, Hala
    IEEE ACCESS, 2019, 7 : 78533 - 78548
  • [32] A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
    Peng, Yanxiong
    Li, Wenyuan
    Liu, Ying
    CANCER INFORMATICS, 2006, 2 : 301 - 311
  • [33] Classifying Cancer Types Based on Microarray Gene Expressions using Conformal Prediction
    Ismailoglu, Firat
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [34] Extraction of the cancer information from microarray of gene expression using Support Vector Machines
    Wilinski, A
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS IV, 2006, 6159
  • [35] A Hybrid Barnacles Mating Optimizer Algorithm With Support Vector Machines for Gene Selection of Microarray Cancer Classification
    Houssein, Essam H.
    Abdelminaam, Diaa Salama
    Hassan, Hager N.
    Al-Sayed, Mustafa M.
    Nabil, Emad
    IEEE ACCESS, 2021, 9 : 64895 - 64905
  • [36] Gene Selection for Cancer Classification from Microarray Data Using Data Overlap Measure
    Sarbazi-Azad, Saeed
    Abadeh, Mohammad Saniee
    2018 25TH IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING AND 2018 3RD INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2018, : 257 - 262
  • [37] Selecting informative genes from leukemia gene expression data using a hybrid approach for cancer classification
    Mohamad, Mohd Saberi
    Deris, Safaai
    Hashim, Siti Zaiton Mohd
    3RD KUALA LUMPUR INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING 2006, 2007, 15 : 528 - +
  • [38] Gene Selection Using Hybrid Multi-Objective Cuckoo Search Algorithm With Evolutionary Operators for Cancer Microarray Data
    Othman, Mohd Shahizan
    Kumaran, Shamini Raja
    Yusuf, Lizawati Mi
    IEEE ACCESS, 2020, 8 : 186348 - 186361
  • [39] Hybrid feature selection using micro genetic algorithm on microarray gene expression data
    Pragadeesh, C.
    Jeyaraj, Rohana
    Siranjeevi, K.
    Abishek, R.
    Jeyakumar, G.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2241 - 2246
  • [40] A MICROARRAY GENE EXPRESSION DATA CLASSIFICATION USING HYBRID BACK PROPAGATION NEURAL NETWORK
    Vimaladevi, M.
    Kalaavathi, B.
    GENETIKA-BELGRADE, 2014, 46 (03): : 1013 - 1026