Dementia risk predictions from German claims data using methods of machine learning

被引:12
|
作者
Reinke, Constantin [1 ]
Doblhammer, Gabriele [1 ,2 ]
Schmid, Matthias [2 ,3 ]
Welchowski, Thomas [3 ]
机构
[1] Univ Rostock, Inst Sociol & Demog, Ulmenstr 69, D-18057 Rostock, Germany
[2] German Ctr Neurodegenerat Dis, Bonn, Germany
[3] Univ Bonn, Med Fac, Inst Med Biometry Informat & Epidemiol IMBIE, Bonn, Germany
关键词
calibration; dementia; discrimination; Germany; health claims data; machine learning; risk factors; ALZHEIMERS-DISEASE; COGNITIVE IMPAIRMENT; PREVALENCE; EPIDEMIOLOGY; VALIDATION; DECLINE;
D O I
10.1002/alz.12663
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Introduction We examined whether German claims data are suitable for dementia risk prediction, how machine learning (ML) compares to classical regression, and what the important predictors for dementia risk are. Methods We analyzed data from the largest German health insurance company, including 117,895 dementia-free people age 65+. Follow-up was 10 years. Predictors were: 23 age-related diseases, 212 medical prescriptions, 87 surgery codes, as well as age and sex. Statistical methods included logistic regression (LR), gradient boosting (GBM), and random forests (RFs). Results Discriminatory power was moderate for LR (C-statistic = 0.714; 95% confidence interval [CI] = 0.708-0.720) and GBM (C-statistic = 0.707; 95% CI = 0.700-0.713) and lower for RF (C-statistic = 0.636; 95% CI = 0.628-0.643). GBM had the best model calibration. We identified antipsychotic medications and cerebrovascular disease but also a less-established specific antibacterial medical prescription as important predictors. Discussion Our models from German claims data have acceptable accuracy and may provide cost-effective decision support for early dementia screening.
引用
收藏
页码:477 / 486
页数:10
相关论文
共 50 条
  • [1] USING SUPERVISED MACHINE LEARNING METHODS TO IDENTIFY INDIVIDUALS AT RISK OF DEMENTIA
    Divecha, Ayushi
    Bannon, Sarah
    Dams-O'Connor, Kristen
    INNOVATION IN AGING, 2024, 8 : 425 - 425
  • [2] Machine Learning Methods for Disease Prediction with Claims Data
    Christensen, Tanner
    Frandsen, Abraham
    Glazier, Seth
    Humpherys, Jeffrey
    Kartchner, David
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 467 - 471
  • [3] Exploration of machine learning methods for maritime risk predictions
    Knapp, Sabine
    van de Velden, Michel
    MARITIME POLICY & MANAGEMENT, 2024, 51 (07) : 1443 - 1473
  • [4] Detecting Unrecognized Dementia Using Deep Learning Methods with Korean National Claims Data
    Yoo, K. B.
    Nam, J. W.
    Lee, K. M.
    Kang, T. H.
    Yoon, J. H.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2024, 34
  • [5] Gravity Predictions in Data-Missing Areas Using Machine Learning Methods
    Liu, Yubin
    Zhang, Yi
    Pang, Qipei
    Liu, Sulan
    Li, Shaobo
    Shi, Xuguo
    Bian, Shaofeng
    Wu, Yunlong
    REMOTE SENSING, 2024, 16 (22)
  • [6] Interpretable machine learning methods for predictions in systems biology from omics data
    Sidak, David
    Schwarzerova, Jana
    Weckwerth, Wolfram
    Waldherr, Steffen
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [7] Macroeconomic Predictions Using Payments Data and Machine Learning
    Chapman, James T. E.
    Desai, Ajit
    FORECASTING, 2023, 5 (04): : 652 - 683
  • [8] Machine learning and ligand binding predictions: A review of data, methods, and obstacles
    Ellingson, Sally R.
    Davis, Brian
    Allen, Jonathan
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2020, 1864 (06):
  • [9] IDENTIFYING POTENTIAL LONG-COVID PATIENTS USING MACHINE LEARNING: A GERMAN CLAIMS DATA ANALYSIS
    Pacis, S.
    Bolzani, A.
    Maywald, U.
    Wilke, T.
    VALUE IN HEALTH, 2023, 26 (12) : S415 - S416
  • [10] Assessment of Healthcare Claims Rejection Risk Using Machine Learning
    Chimmad, Anundhara
    Saripalli, Prasad
    Tirumala, Venu
    2017 IEEE 19TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2017,