Breast Cancer Prediction: Importance of Feature Selection

被引:0
|
作者
Prateek [1 ]
机构
[1] QR 1012,SECT 4-C, Bokaro Steel City, Jharkhand, India
关键词
Machine learning; KNN; Feature selection; SVM; Logistic regression; Naive Bayes; Classification; Prediction algorithms; Breast cancer; CLASSIFICATION RULES; DIAGNOSIS;
D O I
10.1007/978-981-13-6861-5_62
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In today's world, breast cancer is one of the most widespread causes of death in women. According to an estimation, approximately 40,920 women would die in 2018 just because of breast cancer, which is a highly alarming number. Such alarming numbers could be reduced if the cancer is diagnosed at an early stage. With the advent of technology, making such predictions has become an easier task. Machine learning is one of the latest trends, which enables to make predictions related to diseases based on physical or behavioral characteristics. In this paper, we use various machine learning algorithms like decision trees, k-nearest neighbor (KNN), logistic regression, neural networks (NNs), naive Bayes, random forest, and support vector machine (SVM). The outcome is then compared based on the precision, recall, and F1 score. Furthermore, we identify the least important features in the dataset, implement all these algorithms again after removing those features, and then compare the outcomes for the two implementation stages in order to understand the importance of feature selection in breast cancer prediction.
引用
收藏
页码:733 / 742
页数:10
相关论文
共 50 条
  • [41] Prediction of lymphedema occurrence in patients with breast cancer using the optimized combination of ensemble learning algorithm and feature selection
    Anaram Yaghoobi Notash
    Aidin Yaghoobi Notash
    Zahra Omidi
    Shahpar Haghighat
    BMC Medical Informatics and Decision Making, 22
  • [42] Integrating ensemble systems biology feature selection and bimodal deep neural network for breast cancer prognosis prediction
    Cheng, Li-Hsin
    Hsu, Te-Cheng
    Lin, Che
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [43] An enhanced soft-computing based strategy for efficient feature selection for timely breast cancer prediction: Wisconsin Diagnostic Breast Cancer dataset case
    Singh, Law Kumar
    Khanna, Munish
    Singh, Rekha
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76607 - 76672
  • [44] Feature Selection for Link Prediction
    Xu, Ye
    Rockmore, Dan
    PROCEEDINGS OF THE 5TH PH.D. WORKSHOP ON INFORMATION AND KNOWLEDGE, 2012, : 25 - 32
  • [45] Feature selection in bankruptcy prediction
    Tsai, Chih-Fong
    KNOWLEDGE-BASED SYSTEMS, 2009, 22 (02) : 120 - 127
  • [46] Score-based causal feature selection for cancer risk prediction
    Huang, Shanshan
    Li, Qingsong
    Wang, Lei
    Wang, Yuanhao
    Liu, Li
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 198 - 203
  • [47] A comparative study on feature selection for a risk prediction model for colorectal cancer
    Cueto-Lopez, Nahum
    Teresa Garcia-Ordas, Maria
    Davila-Batista, Veronica
    Moreno, Victor
    Aragones, Nduria
    Alaiz-Rodriguez, Rocio
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 177 : 219 - 229
  • [48] On the Feature Selection Methods and Reject Option Classifiers for Robust Cancer Prediction
    Waseem, Muhammad Hammad
    Nadeem, Malik Sajjad Ahmed
    Abbas, Assad
    Shaheen, Aliya
    Aziz, Wajid
    Anjum, Adeel
    Manzoor, Umar
    Balubaid, Muhammad A.
    Shim, Seong-O
    IEEE ACCESS, 2019, 7 : 141072 - 141082
  • [49] Oriented Feature Selection SVM Applied to Cancer Prediction in Precision Medicine
    Shen, Yang
    Wu, Chunxue
    Liu, Cong
    Wu, Yan
    Xiong, Naixue
    IEEE ACCESS, 2018, 6 : 48510 - 48521
  • [50] S3LR: Novel feature selection approach for Microarray-Based breast cancer recurrence prediction
    Erekat, Asala N.
    Khasawneh, Mohammad T.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241