Breast Carcinoma Prediction Through Integration of Machine Learning Models

被引:0
|
作者
Martinez-Licort, Rosmeri [1 ]
Leon, Carlos de la Cruz [2 ,3 ]
Agarwal, Deevyankar [2 ]
Sahelices, Benjamin [1 ]
de la Torre, Isabel [2 ]
Miramontes-Gonzalez, Jose Pablo [4 ,5 ]
Amoon, Mohammed [6 ]
机构
[1] Univ Valladolid, Dept Comp Sci, GCME Res Grp, Valladolid 47011, Spain
[2] Univ Valladolid, Dept Signal Theory Commun & Telemat Engn, Valladolid 47011, Spain
[3] CARTIF Technol Ctr, Valladolid 47151, Spain
[4] Univ Valladolid, Fac Med, Dept Med, Valladolid 47005, Spain
[5] Rio Hortega Univ Hosp, Internal Med Serv, Valladolid 47012, Spain
[6] King Saud Univ, Community Coll, Dept Comp Sci, Riyadh 11437, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Support vector machines; Breast cancer; Data models; Training; Accuracy; Principal component analysis; Analytical models; Ensemble learning; Machine learning; ensemble learning; machine learning; majority voting; principal component analysis; CANCER DIAGNOSIS; SCHEME;
D O I
10.1109/ACCESS.2024.3431998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer poses a global health challenge, with high incidence and mortality rates. Early detection and precise diagnosis are crucial for patient prognosis. Machine learning (ML) models applied to mammary biopsy image data hold promise for achieving an efficient and accurate breast cancer diagnosis. In this study, we evaluated the performance of several ML algorithms, including Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM). We establish evaluation contexts by implementing data standardization and reducing the correlation between variables. Firstly, we select the best-performing parameters for each algorithm by building and evaluating the individual models. Then, we implement a combined model using weighted voting, where the weights of each model are determined based on its performance on the test dataset. The final model is constructed by combining the LR, RF and SVM models. We find that SVM is the best-performance individual model, so it has the highest weight in the final model. The final integrated model achieves an accuracy of 98%, a precision of 97%, a recall of 99%, an F1-score of 98% and an AUC of 0.98. Our weighted voting model compares favourably with the other models analysed. This approach demonstrates its efficiency and transparency in handling structured medical data. It is a prototype that will be refined and expanded to encompass larger real-world datasets.
引用
收藏
页码:134635 / 134650
页数:16
相关论文
共 50 条
  • [31] Breast Cancer Classification Through Transfer Learning with Vision Transformer, PCA, and Machine Learning Models
    Gutierrez-Cardenas, Juan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 1027 - 1036
  • [32] Breast Cancer Prediction Using Soft Voting Classifier Based on Machine Learning Models
    Hashim, Mohammed S.
    Yassin, Ali A.
    IAENG International Journal of Computer Science, 2023, 50 (02)
  • [33] Metabolism score and machine learning models for the prediction of esophageal squamous cell carcinoma progression
    Chen, Lu
    Zhang, Wenxin
    Shi, Huanying
    Zhu, Yongjun
    Chen, Haifei
    Wu, Zimei
    Zhong, Mingkang
    Shi, Xiaojin
    Li, Qunyi
    Wang, Tianxiao
    CANCER SCIENCE, 2024, 115 (09) : 3127 - 3142
  • [34] Prognostic prediction of breast cancer patients using machine learning models: a retrospective analysis
    Song, Xuchun
    Chu, Jiebin
    Guo, Zijie
    Wei, Qun
    Wang, Qingchuan
    Hu, Wenxian
    Wang, Linbo
    Zhao, Wenhe
    Zheng, Heming
    Lu, Xudong
    Zhou, Jichun
    GLAND SURGERY, 2024, 13 (09) : 1575 - 1587
  • [35] Comparison of Different Machine Learning Models in Prediction of Postirradiation Recurrence in Prostate Carcinoma Patients
    Marinkovic, Mladen
    Popovic, Marina
    Stojanovic-Rundic, Suzana
    Nikolic, Milos
    Cavic, Milena
    Gavrilovic, Dusica
    Teodorovic, Dusan
    Mitrovic, Nenad
    Mijatovic Teodorovic, Ljiljana
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [36] Prediction of Surgical Upstaging Risk of Ductal Carcinoma In Situ Using Machine Learning Models
    Hashiba, Kimberlee A.
    Mercaldo, Sarah
    Venkatesh, Sheila L.
    Bahl, Manisha
    JOURNAL OF BREAST IMAGING, 2023, 5 (06) : 695 - 702
  • [37] Multisource Data Integration and Comparative Analysis of Machine Learning Models for On-Street Parking Prediction
    Inam, Saba
    Mahmood, Azhar
    Khatoon, Shaheen
    Alshamari, Majed
    Nawaz, Nazia
    SUSTAINABILITY, 2022, 14 (12)
  • [38] Enhancing prognostic accuracy for breast adenoid cystic carcinoma using machine learning models
    Alshwayyat, S.
    Abu Al Hawa, M. Bashar
    Sawan, S.
    Alshwayyat, T. A.
    Alshwayyat, M.
    Sawan, L. Jihad
    ANNALS OF ONCOLOGY, 2024, 35 : S340 - S340
  • [39] Optimizing chickpea yield prediction under wilt disease through synergistic integration of biophysical and image parameters using machine learning models
    Singh, R. N.
    Krishnan, P.
    Bharadwaj, C.
    Sah, Sonam
    Das, B.
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [40] Machine Learning Algorithms for Breast Cancer Prediction
    Kumar, K. M. E. Senthil
    Akalya, A.
    Kanimozhi, V.
    JOURNAL OF POPULATION THERAPEUTICS AND CLINICAL PHARMACOLOGY, 2023, 30 (07): : E245 - E250