Breast Carcinoma Prediction Through Integration of Machine Learning Models

被引:0
|
作者
Martinez-Licort, Rosmeri [1 ]
Leon, Carlos de la Cruz [2 ,3 ]
Agarwal, Deevyankar [2 ]
Sahelices, Benjamin [1 ]
de la Torre, Isabel [2 ]
Miramontes-Gonzalez, Jose Pablo [4 ,5 ]
Amoon, Mohammed [6 ]
机构
[1] Univ Valladolid, Dept Comp Sci, GCME Res Grp, Valladolid 47011, Spain
[2] Univ Valladolid, Dept Signal Theory Commun & Telemat Engn, Valladolid 47011, Spain
[3] CARTIF Technol Ctr, Valladolid 47151, Spain
[4] Univ Valladolid, Fac Med, Dept Med, Valladolid 47005, Spain
[5] Rio Hortega Univ Hosp, Internal Med Serv, Valladolid 47012, Spain
[6] King Saud Univ, Community Coll, Dept Comp Sci, Riyadh 11437, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Support vector machines; Breast cancer; Data models; Training; Accuracy; Principal component analysis; Analytical models; Ensemble learning; Machine learning; ensemble learning; machine learning; majority voting; principal component analysis; CANCER DIAGNOSIS; SCHEME;
D O I
10.1109/ACCESS.2024.3431998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer poses a global health challenge, with high incidence and mortality rates. Early detection and precise diagnosis are crucial for patient prognosis. Machine learning (ML) models applied to mammary biopsy image data hold promise for achieving an efficient and accurate breast cancer diagnosis. In this study, we evaluated the performance of several ML algorithms, including Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB) and Support Vector Machine (SVM). We establish evaluation contexts by implementing data standardization and reducing the correlation between variables. Firstly, we select the best-performing parameters for each algorithm by building and evaluating the individual models. Then, we implement a combined model using weighted voting, where the weights of each model are determined based on its performance on the test dataset. The final model is constructed by combining the LR, RF and SVM models. We find that SVM is the best-performance individual model, so it has the highest weight in the final model. The final integrated model achieves an accuracy of 98%, a precision of 97%, a recall of 99%, an F1-score of 98% and an AUC of 0.98. Our weighted voting model compares favourably with the other models analysed. This approach demonstrates its efficiency and transparency in handling structured medical data. It is a prototype that will be refined and expanded to encompass larger real-world datasets.
引用
收藏
页码:134635 / 134650
页数:16
相关论文
共 50 条
  • [21] Enhancing Vault Prediction and ICL Sizing Through Advanced Machine Learning Models
    Zhu, Jun
    Li, Fen -Fen
    Li, Gao-Xiang
    Jiang, Shang -Yang
    Cheng, Dan
    Bao, Fang -Jun
    Wu, Shuang-Qing
    Dai, Qi
    Ye, Yu-Feng
    JOURNAL OF REFRACTIVE SURGERY, 2024, 40 (03) : e126 - e132
  • [22] Prediction of the Debonding Failure of Beams Strengthened with FRP through Machine Learning Models
    Hu, Tianyu
    Zhang, Hong
    Zhou, Jianting
    BUILDINGS, 2023, 13 (03)
  • [23] Prediction of bioavailability and toxicity of complex chemical mixtures through machine learning models
    Cipullo, S.
    Snapir, B.
    Prpich, G.
    Campo, P.
    Coulon, F.
    CHEMOSPHERE, 2019, 215 : 388 - 395
  • [24] Integration of machine learning and first principles models
    Rajulapati, Lokesh
    Chinta, Sivadurgaprasad
    Shyamala, Bala
    Rengaswamy, Raghunathan
    AICHE JOURNAL, 2022, 68 (06)
  • [25] Advancing Breast Cancer Diagnosis through Breast Mass Images, Machine Learning, and Regression Models
    Zaylaa, Amira J.
    Kourtian, Sylva
    SENSORS, 2024, 24 (07)
  • [26] Identification of Secondary Breast Cancer in Vital Organs through the Integration of Machine Learning and Microarrays
    Riaz, Faisal
    Abid, Fazeel
    Din, Ikram Ud
    Kim, Byung-Seo
    Almogren, Ahmad
    Ul Durar, Shajara
    ELECTRONICS, 2022, 11 (12)
  • [27] Poverty Prediction Through Machine Learning
    Huang Zixi
    2021 2ND INTERNATIONAL CONFERENCE ON E-COMMERCE AND INTERNET TECHNOLOGY (ECIT 2021), 2021, : 314 - 324
  • [28] Explainable artificial intelligence for stroke prediction through comparison of deep learning and machine learning models
    Moulaei, Khadijeh
    Afshari, Lida
    Moulaei, Reza
    Sabet, Babak
    Mousavi, Seyed Mohammad
    Afrash, Mohammad Reza
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [29] The Enhancement of Machine Learning-Based Engine Models Through the Integration of Analytical Functions
    Brusa, Alessandro
    Shethia, Fenil Panalal
    Petrone, Boris
    Cavina, Nicolo
    Moro, Davide
    Galasso, Giovanni
    Kitsopanidis, Ioannis
    ENERGIES, 2024, 17 (21)
  • [30] Enhancing cure rate analysis through integration of machine learning models: a comparative study
    Aselisewine, Wisdom
    Pal, Suvra
    STATISTICS AND COMPUTING, 2024, 34 (04)