Breast Cancer Prediction Using Different Classification Algorithms with Various Feature Selection Strategies

被引:0
|
作者
Sabha, Mohamad [1 ]
Tugrul, Bulent [1 ]
机构
[1] Ankara Univ, Dept Comp Engn, Ankara, Turkey
关键词
Breast Cancer; Feature Selection; Classification; Data Mining;
D O I
10.1109/ICICOS53627.2021.9651867
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer has always been one of the most dangerous diseases that threaten women's lives. If the disease is not detected in the early stages, it can result in the death of the patient. The term breast cancer is referring to a malignant tumor that happened due to the unexpected development of breast's cells which can probably have the ability to spread through other different parts of the patient's body. The occurrence of cancer is often a result of the abnormal growth of cells in our bodies. Cancers generally are classified into two types, Benign (non-cancerous cell) and Malignant (cancerous cell). The earlier the cancer is diagnosed, the better the patient's chance of recovery. Being able to accurately predict breast cancer present in patients has always been an important issue for cancer researchers. Machine Learning (ML) and Data Mining (DM) have always been a point of interest in the scientific community in the hope that they can yield accurate results. We aimed in this study to predict the tumor at early stages using some classification algorithms. After the dataset was collected and the outliers and skewness in the data set were removed, different classification algorithms were applied, focusing on the effect of the feature selection step in the model building phase. After conducting multiple experiments, we got the best overall accuracy by Support Vector Machine (SVM) classifier based on feature selection using Recursive Feature Elimination (RFE) with Random Forest (RF) technique with an accuracy of 98.25%.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Drug and Nondrug Classification Based on Deep Learning with Various Feature Selection Strategies
    Yu, Long
    Sun, Xia
    Tian, Shengwei
    Shi, Xinyu
    Yan, Yilin
    CURRENT BIOINFORMATICS, 2018, 13 (03) : 253 - 259
  • [42] Sparse feature selection for classification and prediction of metastasis in endometrial cancer
    Mehmet Eren Ahsen
    Todd P. Boren
    Nitin K. Singh
    Burook Misganaw
    David G. Mutch
    Kathleen N. Moore
    Floor J. Backes
    Carolyn K. McCourt
    Jayanthi S. Lea
    David S. Miller
    Michael A. White
    Mathukumalli Vidyasagar
    BMC Genomics, 18
  • [43] Sparse feature selection for classification and prediction of metastasis in endometrial cancer
    Ahsen, Mehmet Eren
    Boren, Todd P.
    Singh, Nitin K.
    Misganaw, Burook
    Mutch, David G.
    Moore, Kathleen N.
    Backes, Floor J.
    McCourt, Carolyn K.
    Lea, Jayanthi S.
    Miller, David S.
    White, Michael A.
    Vidyasagar, Mathukumalli
    BMC GENOMICS, 2017, 18
  • [44] Sparse Feature Selection for Classification and Prediction of Metastasis in Endometrial Cancer
    Ahsen, Mehmet Eren
    Boren, Todd P.
    Singh, Nitin K.
    Misganaw, Burook
    Lea, Jayanthi S.
    Miller, David S.
    White, Michael A.
    Vidyasagar, Mathukumalli
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 522 - 524
  • [45] Enhancement of Breast Cancer Classification Using Bat Feature Selection with Recurrent Deep Learning
    Jaafar, Ali Nafaa
    Journal of Computing and Information Technology, 2024, 32 (03) : 195 - 215
  • [46] Breast Cancer Diagnosis using Simultaneous Feature Selection and Classification: A Genetic Programming Approach
    Bhardwaj, Harshit
    Sakalle, Aditi
    Tiwari, Aruna
    Verma, Madhushi
    Bhardwaj, Arpit
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 2186 - 2192
  • [47] Detection and classification of breast cancer using logistic regression feature selection and GMDH classifier
    Khandezamin Z.
    Naderan M.
    Rashti M.J.
    Journal of Biomedical Informatics, 2020, 111
  • [48] Feature selection and classification approaches in gene expression of breast cancer
    Ghosh, Sarada
    Samanta, Guruprasad
    De la Sen, Manuel
    AIMS BIOPHYSICS, 2021, 8 (04): : 372 - 384
  • [49] Survival Prediction and Feature Selection in Patients with Breast Cancer Using Support Vector Regression
    Goli, Shahrbanoo
    Mahjub, Hossein
    Faradmal, Javad
    Mashayekhi, Hoda
    Soltanian, Ali-Reza
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [50] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Eskandar Taghizadeh
    Sahel Heydarheydari
    Alihossein Saberi
    Shabnam JafarpoorNesheli
    Seyed Masoud Rezaeijo
    BMC Bioinformatics, 23