Enhancing Prediction Accuracy in Gastric Cancer Using High-Confidence Machine Learning Models for Class Imbalance

被引:0
|
作者
Jamil, Danish [1 ,2 ]
Palaniappan, Sellappan [1 ]
Naseem, Muhammad [2 ]
Lokman, Asiah [1 ]
机构
[1] Malaysia Univ Sci & Technol, Dept Informat Technol, Kuala Lumpur, Malaysia
[2] Sir Syed Univ Engn & Technol, Dept Software Engn, Karachi, Pakistan
关键词
class imbalance; gastric cancer; decision support system; machine learning; prediction accuracy; naive bayes; logistic regression; medical diagnostics; positive predictive value;
D O I
10.12720/jait.14.6.1410-1424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gastric Cancer (GC) diagnosis and prognosis present significant challenges in the clinical industry. To address the issue of low prediction accuracy resulting from imbalanced positive and negative GC cases, this study proposes a medical Decision Support System (DSS) based on supervised Machine Learning (ML) methods. Four ML models, including Naive Bayes (NB), Logistic Regression (LR), and Multilayer Perceptron (MLP), were employed in this study. The impact of data imbalance on GC prediction was assessed through two procedures. Among the ML models, the MLP model demonstrated the best performance in weighted GC prediction, achieving a sensitivity of 0.930 and a Positive Predictive Value (PPV) of 0.932 for balanced predictions, and a sensitivity of 0.918 and a PPV of 0.908 for unbalanced predictions. The NB model showed promise in handling the data imbalance issue, achieving a sensitivity of 0.722 and a PPV of 0.420 on the unbalanced dataset. Additionally, a DSS was developed specifically for the NB and LR models to improve prediction accuracy. The proposed method significantly improved the sensitivity of optimistic GC case prediction, with the Naive Bayes model achieving a sensitivity of 0.936 and the Logistic Regression model achieving a sensitivity of 0.8306. These improvements enhance the reliability and efficiency of GC diagnostics, offering valuable decision support in healthcare. This research provides insights into addressing class imbalance in GC likelihood prediction and has potential implications for clinical practice.
引用
收藏
页码:1410 / 1424
页数:15
相关论文
共 50 条
  • [11] Comparison of machine learning models for the prediction of cancer cells using MHC class I complexes
    Mateo, N.
    Canon, Alvaro David Orjuela
    Charry, Oscar Julian Perdomo
    16TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2020, 11583
  • [12] Search for High-Confidence Blazar Candidates and Their MWL Counterparts in the Fermi-LAT Catalog Using Machine Learning
    Einecke, Sabrina
    GALAXIES, 2016, 4 (03):
  • [13] Using Class Imbalance Learning for Software Defect Prediction
    Wang, Shuo
    Yao, Xin
    IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) : 434 - 443
  • [14] Handling Class Imbalance in Machine Learning-based Prediction Models: A Case Study in Asthma Management
    Budiarto, Arif
    Sheikh, Aziz
    Wilson, Andrew
    Price, David B.
    Shah, Syed Ahmar
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [15] Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study
    Pandey, Sushant Kumar
    Tripathi, Anil Kumar
    2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 58 - 63
  • [16] Enhancing soil particle content prediction accuracy: advanced hyperspectral analysis and machine learning models
    Wang, Xiao
    Ding, Jianli
    Han, Lijing
    Tan, Jiao
    Ge, Xiangyu
    JOURNAL OF SOILS AND SEDIMENTS, 2024, 24 (10) : 3443 - 3458
  • [17] Diabetes mellitus risk prediction in the presence of class imbalance using flexible machine learning methods
    Somayeh Sadeghi
    Davood Khalili
    Azra Ramezankhani
    Mohammad Ali Mansournia
    Mahboubeh Parsaeian
    BMC Medical Informatics and Decision Making, 22
  • [18] Diabetes mellitus risk prediction in the presence of class imbalance using flexible machine learning methods
    Sadeghi, Somayeh
    Khalili, Davood
    Ramezankhani, Azra
    Mansournia, Mohammad Ali
    Parsaeian, Mahboubeh
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [19] Significance of Accuracy Levels in Cancer Prediction using Machine Learning Techniques
    Kumar, Ajay
    Sushil, Rama
    Tiwari, Arvind Kumar
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2019, 12 (03): : 741 - 747
  • [20] Enhancing prediction accuracy of concrete compressive strength using stacking ensemble machine learning
    Zhao, Yunpeng
    Goulias, Dimitrios
    Saremi, Setare
    COMPUTERS AND CONCRETE, 2023, 32 (03): : 233 - 246