A review of statistical and machine learning methods for modeling cancer risk using structured clinical data

被引:89
|
作者
Richter, Aaron N. [1 ,2 ]
Khoshgoftaar, Taghi M. [1 ]
机构
[1] Florida Atlantic Univ, Boca Raton, FL 33431 USA
[2] Modernizing Med Inc, Boca Raton, FL 33431 USA
关键词
Cancer prediction; Cancer recurrence; Cancer relapse; Data mining; Machine learning; Electronic health records; HEPATOCELLULAR-CARCINOMA; PREDICTIVE MODELS; PROGNOSTIC INDEX; LOCAL RECURRENCE; SURGERY; IDENTIFICATION; NOMOGRAM; FUTURE;
D O I
10.1016/j.artmed.2018.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advancements are constantly being made in oncology, improving prevention and treatment of cancers. To help reduce the impact and deadliness of cancers, they must be detected early. Additionally, there is a risk of cancers recurring after potentially curative treatments are performed. Predictive models can be built using historical patient data to model the characteristics of patients that developed cancer or relapsed. These models can then be deployed into clinical settings to determine if new patients are at high risk for cancer development or recurrence. For large-scale predictive models to be built, structured data must be captured for a wide range of diverse patients. This paper explores current methods for building cancer risk models using structured clinical patient data. Trends in statistical and machine learning techniques are explored, and gaps are identified for future research. The field of cancer risk prediction is a high-impact one, and research must continue for these models to be embraced for clinical decision support of both practitioners and patients.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [31] Breast cancer risk prediction using machine learning: a systematic review
    Hussain, Sadam
    Ali, Mansoor
    Naseem, Usman
    Nezhadmoghadam, Fahimeh
    Jatoi, Munsif Ali
    Gulliver, T. Aaron
    Tamez-Pena, Jose Gerardo
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [32] A review of ADHD detection studies with machine learning methods using rsfMRI data
    Taspinar, Gurcan
    Ozkurt, Nalan
    NMR IN BIOMEDICINE, 2024, 37 (08)
  • [33] Modeling the productivity of mechanized CTL harvesting with statistical machine learning methods
    Liski, Eero
    Jounela, Pekka
    Korpunen, Heikki
    Sosa, Amanda
    Lindroos, Ola
    Jylha, Paula
    INTERNATIONAL JOURNAL OF FOREST ENGINEERING, 2020, 31 (03) : 253 - 262
  • [34] Statistical learning theory of structured data
    Pastore, Mauro
    Rotondo, Pietro
    Erba, Vittorio
    Gherardi, Marco
    PHYSICAL REVIEW E, 2020, 102 (03)
  • [35] Medical Diagnosis Using Machine Learning: A Statistical Review
    Bhavsar, Kaustubh Arun
    Singla, Jimmy
    Al-Otaibi, Yasser D.
    Song, Oh-Young
    Bin Zikriya, Yousaf
    Bashir, Ali Kashif
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (01): : 107 - 125
  • [36] Exploring Gene Expression and Clinical Data for Identifying Prostate Cancer Severity Levels using Machine Learning Methods
    Al Marouf, Ahmed
    Alhajj, Reda
    Rokne, Jon G.
    Ghose, Sunita
    Bismar, Tarek A.
    2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
  • [37] Predicting High-Risk Prostate Cancer Using Machine Learning Methods
    Barlow, Henry
    Mao, Shunqi
    Khushi, Matloob
    DATA, 2019, 4 (03)
  • [38] A review of machine learning methods used for educational data
    Ersozlu, Zara
    Taheri, Sona
    Koch, Inge
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (16) : 22125 - 22145
  • [39] A Review of Methods Used in Machine Learning and Data Analysis
    Wu, Qingyang
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 37 - 45
  • [40] A simulation study on missing data imputation for dichotomous variables using statistical and machine learning methods
    Yingfeng Ge
    Zhiwei Li
    Jinxin Zhang
    Scientific Reports, 13