Establishing machine learning models to predict the early risk of gastric cancer based on lifestyle factors

被引:19
|
作者
Afrash, Mohammad Reza [1 ]
Shafiee, Mohsen [2 ]
Kazemi-Arpanahi, Hadi [3 ]
机构
[1] Smart Univ Med Sci, Dept Artificial Intelligence, Tehran, Iran
[2] Abadan Univ Med Sci, Dept Nursing, Abadan, Iran
[3] Abadan Univ Med Sci, Dept Hlth Informat Technol, Abadan, Iran
关键词
Machine learning; Gastric cancer; Behavioral lifestyle; Prevention; Prognosis; ENDOSCOPIC SUBMUCOSAL DISSECTION; NEURAL-NETWORK; DECISION TREE; PROGNOSIS; SURGERY;
D O I
10.1186/s12876-022-02626-x
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background Gastric cancer is one of the leading causes of death worldwide. Screening for gastric cancer greatly relies on endoscopy and pathology biopsy, which are invasive and pose financial burdens. Thus, the prevention of the disease by modifying lifestyle-related behaviors and dietary habits or even the prevention of risk factor formation is of great importance. This study aimed to construct an inexpensive, non-invasive, fast, and high-precision diagnostic model using six machine learning (ML) algorithms to classify patients at high or low risk of developing gastric cancer by analyzing individual lifestyle factors.Methods This retrospective study used the data of 2029 individuals from the gastric cancer database of Ayatollah Taleghani Hospital in Abadan City, Iran. The data were randomly separated into training and test sets (ratio 0.7:0.3). Six ML methods, including multilayer perceptron (MLP), support vector machine (SVM) (linear kernel), SVM (RBF kernel), k-nearest neighbors (KNN) (K = 1, 3, 7, 9), random forest (RF), and eXtreme Gradient Boosting (XGBoost), were trained to construct prognostic models before and after performing the relief feature selection method. Finally, to evaluate the models' performance, the metrics derived from the confusion matrix were calculated via a test split and cross-validation.Results This study found 11 important influence factors for the risk of gastric cancer, such as Helicobacter pylori infection, high salt intake, and chronic atrophic gastritis, among other factors. Comparisons indicated that the XGBoost had the best performance for the risk prediction of gastric cancer.Conclusions The results suggest that based on simple baseline patient data, the ML techniques have the potential to start the prescreening of gastric cancer and identify high-risk individuals who should proceed with invasive examinations. Our model could also considerably lessen the number of cases that need endoscopic surveillance. Future studies are required to validate the efficacy of the models in a larger and multicenter population.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Modeling Epidemiology Data with Machine Learning Technique to Detect Risk Factors for Gastric Cancer
    Mohammadnezhad, Kimia
    Sahebi, Mahmod Reza
    Alatab, Sudabeh
    Sadjadi, Alireza
    JOURNAL OF GASTROINTESTINAL CANCER, 2024, 55 (01) : 287 - 296
  • [22] Risk Factors and Machine Learning-Based Prediction Models for Early Readmission after Thoracoabdominal Aortic Dissection
    Bolourani, Siavash
    Patel, Vihas M.
    Etkin, Yana
    Landis, Gregg
    Mussa, Firas
    JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2020, 231 (04) : S356 - S356
  • [23] Analysis of Risk Factors for Cervical Cancer Based on Machine Learning Methods
    Deng, Xiaoyu
    Luo, Yan
    Wang, Cong
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 631 - 635
  • [24] Machine learning: A non-invasive prediction method for gastric cancer based on a survey of lifestyle behaviors
    Jiang, Siqing
    Gao, Haojun
    He, Jiajin
    Shi, Jiaqi
    Tong, Yuling
    Wu, Jian
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [25] THE IMPACT OF LIFESTYLE FACTORS ON THE RISK OF GASTRIC CANCER IN A CROSS SECTIONAL STUDY
    Esmaili, M.
    Oghalaie, A.
    Mohajerani, N.
    Saberi, S.
    Talebkhan, Y.
    Ebrahimzadeh, F.
    Samadi, T.
    Karimi, T.
    Nahvijou, A.
    Tashakoripour, M.
    Abdirad, A.
    Hosseini, M. Eshagh
    Mohagheghi, M.
    Mohammadi, M.
    HELICOBACTER, 2014, 19 : 101 - 102
  • [26] Machine learning was used to predict risk factors for distant metastasis of pancreatic cancer and prognosis analysis
    Qianyun Yao
    Weili Jia
    Siyan Chen
    Qingqing Wang
    Zhekui Liu
    Danping Liu
    Xincai Ji
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 10279 - 10291
  • [27] Machine learning was used to predict risk factors for distant metastasis of pancreatic cancer and prognosis analysis
    Yao, Qianyun
    Jia, Weili
    Chen, Siyan
    Wang, Qingqing
    Liu, Zhekui
    Liu, Danping
    Ji, Xincai
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (12) : 10279 - 10291
  • [28] Personalized immune subtypes based on machine learning predict response to checkpoint blockade in gastric cancer
    Huang, Weibin
    Zhangt, Yuhui
    Chent, Songyao
    Yin, Haofan
    Liu, Guangyao
    Zhang, Huaqi
    Xu, Jiannan
    Yu, Jishang
    Xia, Yujian
    He, Yulong
    Zhang, Changhua
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [29] Machine learning and network-based models to identify genetic risk factors to the progression and survival of colorectal cancer
    Hossain, Md Jakir
    Chowdhury, Utpala Nanda
    Islam, M. Babul
    Uddin, Shahadat
    Ahmed, Mohammad Boshir
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
  • [30] Cervical Cancer Risk Prediction Model and Analysis of Risk Factors based on Machine Learning
    Yang, Wenying
    Gou, Xin
    Xu, Tongqing
    Yi, Xiping
    Jiang, Maohong
    ICBBT 2019: 2019 11TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL TECHNOLOGY, 2019, : 50 - 54