A web-based tool for cancer risk prediction for middle-aged and elderly adults using machine learning algorithms and self-reported questions

被引:0
|
作者
Xiao, Xingjian [1 ]
Yi, Xiaohan [1 ]
Soe, Nyi Nyi [2 ,3 ]
Latt, Phyu Mon [2 ,3 ]
Lin, Luotao [4 ]
Chen, Xuefen [1 ]
Song, Hualing [1 ]
Sun, Bo [5 ]
Zhao, Hailei [1 ]
Xu, Xianglong [1 ,2 ,3 ,6 ,7 ]
机构
[1] Shanghai Univ Tradit Chinese Med, Sch Publ Hlth, Shanghai, Peoples R China
[2] Monash Univ, Fac Med Nursing & Hlth Sci, Sch Translat Med, Clayton, Vic, Australia
[3] Alfred Hlth, Melbourne Sexual Hlth Ctr, Artificial Intelligence & Modelling Epidemiol Prog, Carlton, Vic, Australia
[4] Univ New Mexico, Dept Individual Family & Community Educ, Nutr & Dietet Program, Albuquerque, NM USA
[5] Shanghai Univ Tradit Chinese Med, LongHua Hosp, Endoscopy Ctr, Shanghai, Peoples R China
[6] Shanghai Univ Tradit Chinese Med, Bijie Inst, Bijie, Peoples R China
[7] Bijie Dist Ctr Dis Control & Prevent, Doctoral Workstat, Bijie, Peoples R China
关键词
Cancer; Pan-cancer; Prediction; Web-based; Risk; Co-management; Co-prevention; Middle-aged; China; Machine learning; HEALTH;
D O I
10.1016/j.annepidem.2024.12.003
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: From a global perspective, China is one of the countries with higher incidence and mortality rates for cancer. Objective: Our objective is to create an online cancer risk prediction tool for middle-aged and elderly Chinese adults by leveraging machine learning algorithms and self-reported data. Method: Drawing from a cohort of 19,798 participants aged 45 and above from the China Health and Retirement Longitudinal Study (2011 - 2018), we employed nine machine learning algorithms (LR: Logistic Regression, Adaboost: Adaptive Boosting, SVM: Support Vector Machine, RF: Random Forest, GNB: Gaussian Naive Bayes, GBM: Gradient Boosting Machine, LGBM: Light Gradient Boosting Machine, XGBoost: eXtreme Gradient Boosting, KNN: K - Nearest Neighbors), which are mainly used for classification and regression tasks, to construct predictive models for various cancers. Utilizing non-invasive self-reported predictors encompassing demographic, educational, marital, lifestyle, health history, and other factors, we focused on predicting "Cancer or Malignant Tumour" outcomes. The types of cancers that can be predicted mainly include lung cancer, breast cancer, cervical cancer, colorectal cancer, gastric cancer, esophageal cancer, and other rare cancers. Results: The developed tool, MyCancerRisk, demonstrated significant performance, with the Random Forest algorithm achieving an AUC of 0.75 and ACC of 0.99 using self-reported variables. Key predictors identified include age, self-rated health, sleep patterns, household heating sources, childhood health status, living conditions, and smoking habits. Conclusion: MyCancerRisk aims to serve as a preventative screening tool, encouraging individuals to undergo testing and adopt healthier behaviours to mitigate the public health impact of cancer. Our study also sheds light on unconventional predictors, such as housing conditions, offering valuable insights for refining cancer prediction models.
引用
收藏
页码:27 / 35
页数:9
相关论文
共 48 条
  • [21] RISK FACTORS FOR OSTEOPOROSIS AND FRACTURES IN MIDDLE-AGED WOMEN WITH SELF-REPORTED OSTEOPOROSIS: A CROSS-SECTIONAL POPULATION-BASED STUDY
    Pedro, A. O.
    Gueldini de Moraes, A. V.
    Costa-Paiva, L.
    AGING CLINICAL AND EXPERIMENTAL RESEARCH, 2022, 34 (SUPPL 1) : S446 - S446
  • [22] Machine learning-based prediction of sarcopenia in community-dwelling middle-aged and older adults: findings from the CHARLS
    Wang, Zongjie
    Wu, Yafei
    Zhu, Junmin
    Fang, Ya
    PSYCHOGERIATRICS, 2025, 25 (01)
  • [23] Web-Based Patient Self-Reported Outcome After Radiotherapy in Adolescents and Young Adults With Cancer: Survey on Acceptance of Digital Tools
    Vogel, Marco M. E.
    Eitz, Kerstin A.
    Combs, Stephanie E.
    JMIR MHEALTH AND UHEALTH, 2021, 9 (01):
  • [24] Applying machine learning to predict bowel preparation adequacy in elderly patients for colonoscopy: development and validation of a web-based prediction tool
    Liu, Jianying
    Jiang, Wei
    Yu, Yahong
    Gong, Jiali
    Chen, Guie
    Yang, Yuxing
    Wang, Chao
    Sun, Dalong
    Lu, Xuefeng
    ANNALS OF MEDICINE, 2025, 57 (01)
  • [25] Predictive modeling and web-based tool for cervical cancer risk assessment: A comparative study of machine learning models
    Chauhan, Ritu
    Goel, Anika
    Alankar, Bhavya
    Kaur, Harleen
    METHODSX, 2024, 12
  • [26] Self-Reported Physical Activity Among Middle-Aged Cancer Survivors in the United States: Behavioral Risk Factor Surveillance System Survey, 2009
    Nayak, Pratibha
    Holmes, Holly M.
    Nguyen, Hoang T.
    Elting, Linda S.
    PREVENTING CHRONIC DISEASE, 2014, 11
  • [27] The association between polluted fuel use and self-reported insomnia symptoms among middle-aged and elderly Indian adults: a cross-sectional study based on LASI, wave 1
    Siqi Leng
    Yuming Jin
    Michael V. Vitiello
    Ye Zhang
    Rong Ren
    Lin Lu
    Jie Shi
    Xiangdong Tang
    BMC Public Health, 23
  • [28] The association between polluted fuel use and self-reported insomnia symptoms among middle-aged and elderly Indian adults: a cross-sectional study based on LASI, wave 1
    Leng, Siqi
    Jin, Yuming
    Vitiello, Michael V.
    Zhang, Ye
    Ren, Rong
    Lu, Lin
    Shi, Jie
    Tang, Xiangdong
    BMC PUBLIC HEALTH, 2023, 23 (01)
  • [29] Association of Regular Leisure-Time Physical Activity with Self-Reported Body Mass Index and Obesity Risk among Middle-Aged and Older Adults in Taiwan
    Chen, Yun-Tsung
    Lee, Po-Fu
    Lin, Chi-Fang
    Chang, Andy
    Chung, Yu-Chun
    Yeh, Chi-Pei
    Chen, Shih-Chang
    Ho, Chien-Chang
    HEALTHCARE, 2021, 9 (12)
  • [30] Association between self-reported child maltreatment and risk of hospital-treated infectious diseases in middle-aged and older adults: A UK Biobank cohort study
    Huang, Yifang
    Lin, Ruilang
    Wang, Wenhao
    Pan, Lulu
    Huang, Chen
    Yu, Yongfu
    Qin, Guoyou
    Bao, Zhijun
    Zheng, Xueying
    PREVENTIVE MEDICINE, 2024, 189