Breast Cancer Prediction: Importance of Feature Selection

被引:0
|
作者
Prateek [1 ]
机构
[1] QR 1012,SECT 4-C, Bokaro Steel City, Jharkhand, India
关键词
Machine learning; KNN; Feature selection; SVM; Logistic regression; Naive Bayes; Classification; Prediction algorithms; Breast cancer; CLASSIFICATION RULES; DIAGNOSIS;
D O I
10.1007/978-981-13-6861-5_62
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In today's world, breast cancer is one of the most widespread causes of death in women. According to an estimation, approximately 40,920 women would die in 2018 just because of breast cancer, which is a highly alarming number. Such alarming numbers could be reduced if the cancer is diagnosed at an early stage. With the advent of technology, making such predictions has become an easier task. Machine learning is one of the latest trends, which enables to make predictions related to diseases based on physical or behavioral characteristics. In this paper, we use various machine learning algorithms like decision trees, k-nearest neighbor (KNN), logistic regression, neural networks (NNs), naive Bayes, random forest, and support vector machine (SVM). The outcome is then compared based on the precision, recall, and F1 score. Furthermore, we identify the least important features in the dataset, implement all these algorithms again after removing those features, and then compare the outcomes for the two implementation stages in order to understand the importance of feature selection in breast cancer prediction.
引用
收藏
页码:733 / 742
页数:10
相关论文
共 50 条
  • [1] Importance of feature selection and data visualization towards prediction of breast cancer
    Krishnamurthi R.
    Aggrawal N.
    Sharma L.
    Srivastava D.
    Sharma S.
    Recent Patents on Computer Science, 2019, 12 (04) : 317 - 328
  • [2] Feature Selection Facilitated Classification For Breast Cancer Prediction
    Arunadevi, J.
    Ganeshamoorthi, K.
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 560 - 563
  • [3] RoughSet based Feature Selection for Prediction of Breast Cancer
    Bhukya, Hanumanthu
    Sadanandam, M.
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (03) : 2197 - 2214
  • [4] RoughSet based Feature Selection for Prediction of Breast Cancer
    Hanumanthu Bhukya
    M Sadanandam
    Wireless Personal Communications, 2023, 130 : 2197 - 2214
  • [5] Evaluation of Feature Selection Techniques for Breast Cancer Risk Prediction
    Lopez, Nahum Cueto
    Garcia-Ordas, Maria Teresa
    Vitelli-Storelli, Facundo
    Fernandez-Navarro, Pablo
    Palazuelos, Camilo
    Alaiz-Rodriguez, Rocio
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (20)
  • [6] Breast Cancer Prediction using Feature Selection and Ensemble Voting
    Nguyen, Quang H.
    Do, Trang T. T.
    Wang, Yijing
    Heng, Sin Swee
    Chen, Kelly
    Ang, Wei Hao Max
    Philip, Conceicao Edwin
    Singh, Misha
    Pham, Hung N.
    Nguyen, Binh P.
    Chua, Matthew C. H.
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2019, : 250 - 254
  • [7] ALGORITHM SELECTION AND IMPORTANCE OF MACHINE LEARNING IN PREDICTION OF BREAST CANCER
    Babu, B. Sankara
    Bethu, Srikanth
    Rao, P. S. V. Srinivasa
    Sowmya, V
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (06): : 283 - 315
  • [8] Particle Swarm Optimization Feature Selection for Breast Cancer Recurrence Prediction
    Sakri, Sapiah Binti
    Rashid, Nuraini Binti Abdul
    Zain, Zuhaira Muhammad
    IEEE ACCESS, 2018, 6 : 29637 - 29647
  • [9] Fire ant optimisation feature selection method for breast cancer prediction
    Poonguzhali, N.
    Priya, D. Mohana
    Vijayalakshmi, S.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 69 (02) : 112 - 122
  • [10] A Comparative Study for Breast Cancer Prediction using Machine Learning and Feature Selection
    Dhanya, R.
    Paul, Irene Rose
    Akula, Sai Sindhu
    Sivakumar, Madhumathi
    Nair, Jyothisha J.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1049 - 1055