Breast Carcinoma Prediction Through Integration of Machine Learning Models

被引:0
|
作者
Martinez-Licort, Rosmeri [1 ]
Leon, Carlos de la Cruz [2 ,3 ]
Agarwal, Deevyankar [2 ]
Sahelices, Benjamin [1 ]
de la Torre, Isabel [2 ]
Miramontes-Gonzalez, Jose Pablo [4 ,5 ]
Amoon, Mohammed [6 ]
机构
[1] Univ Valladolid, Dept Comp Sci, GCME Res Grp, Valladolid 47011, Spain
[2] Univ Valladolid, Dept Signal Theory Commun & Telemat Engn, Valladolid 47011, Spain
[3] CARTIF Technol Ctr, Valladolid 47151, Spain
[4] Univ Valladolid, Fac Med, Dept Med, Valladolid 47005, Spain
[5] Rio Hortega Univ Hosp, Internal Med Serv, Valladolid 47012, Spain
[6] King Saud Univ, Community Coll, Dept Comp Sci, Riyadh 11437, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Support vector machines; Breast cancer; Data models; Training; Accuracy; Principal component analysis; Analytical models; Ensemble learning; Machine learning; ensemble learning; machine learning; majority voting; principal component analysis; CANCER DIAGNOSIS; SCHEME;
D O I
10.1109/ACCESS.2024.3431998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer poses a global health challenge, with high incidence and mortality rates. Early detection and precise diagnosis are crucial for patient prognosis. Machine learning (ML) models applied to mammary biopsy image data hold promise for achieving an efficient and accurate breast cancer diagnosis. In this study, we evaluated the performance of several ML algorithms, including Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM). We establish evaluation contexts by implementing data standardization and reducing the correlation between variables. Firstly, we select the best-performing parameters for each algorithm by building and evaluating the individual models. Then, we implement a combined model using weighted voting, where the weights of each model are determined based on its performance on the test dataset. The final model is constructed by combining the LR, RF and SVM models. We find that SVM is the best-performance individual model, so it has the highest weight in the final model. The final integrated model achieves an accuracy of 98%, a precision of 97%, a recall of 99%, an F1-score of 98% and an AUC of 0.98. Our weighted voting model compares favourably with the other models analysed. This approach demonstrates its efficiency and transparency in handling structured medical data. It is a prototype that will be refined and expanded to encompass larger real-world datasets.
引用
收藏
页码:134635 / 134650
页数:16
相关论文
共 50 条
  • [41] Prediction of Wind Power with Machine Learning Models
    Karaman, Omer Ali
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [42] Machine Learning Models for Inpatient Glucose Prediction
    Zale, Andrew
    Mathioudakis, Nestoras
    CURRENT DIABETES REPORTS, 2022, 22 (08) : 353 - 364
  • [43] Machine Learning Models for Inpatient Glucose Prediction
    Andrew Zale
    Nestoras Mathioudakis
    Current Diabetes Reports, 2022, 22 : 353 - 364
  • [44] Machine Learning Models for Stock Price Prediction
    Nassif, Ali Bou
    AlaaEddin, Maha
    Sahib, Amna Akram
    2020 SEVENTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY TRENDS (ITT 2020), 2020, : 67 - 71
  • [45] Interpretable machine learning models for crime prediction
    Zhang, Xu
    Liu, Lin
    Lan, Minxuan
    Song, Guangwen
    Xiao, Luzi
    Chen, Jianguo
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2022, 94
  • [46] On the Efficiency of Machine Learning Models in Malaria Prediction
    Mbaye, Ousseynou
    Mouhamadou, Lamine B. A.
    Alassane, S. Y.
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 437 - 441
  • [47] Explainability of Machine Learning Models for Bankruptcy Prediction
    Park, Min Sue
    Son, Hwijae
    Hyun, Chongseok
    Hwang, Hyung Ju
    IEEE ACCESS, 2021, 9 : 124887 - 124899
  • [48] Machine Learning Models for Multirotor Performance Prediction
    Cornelius, Jason
    Schmitz, Sven
    JOURNAL OF AIRCRAFT, 2024, 61 (04): : 1303 - 1313
  • [49] Machine learning models for the prediction of xeniobiotic metabolism
    Kops, Christina de Bruyn
    Sicho, Martin
    Plonka, Wojtek
    Mazzolari, Angelica
    Kochev, Nikolay
    Jeliazkova, Nina
    Pedretti, Alessandro
    Svozil, Daniel
    Testa, Bernard
    Vistoli, Giulio
    Kirchmair, Johannes
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [50] Improved prediction of gene expression through integrating cell signalling models with machine learning
    Nada Al taweraqi
    Ross D. King
    BMC Bioinformatics, 23