Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand

被引:0
|
作者
Nghiem, Nhung [1 ]
Wilson, Nick [2 ]
Krebs, Jeremy [3 ]
Tran, Truyen [4 ]
机构
[1] Univ Otago Wellington, Dept Publ Hlth, Wellington 6021, Wellington, New Zealand
[2] Australian Natl Univ, John Curtin Sch Med Res, Canberra, ACT 2601, Australia
[3] Univ Otago Wellington, Dept Med, Wellington 6021, Wellington, New Zealand
[4] Deakin Univ, Appl Artificial Intelligence Inst A2I2, Geelong, Vic 3216, Australia
关键词
Machine learning; Diabetes complications; Cardiovascular disease; Risk prediction; Health and social administrative data; CARDIOVASCULAR-DISEASE; PRIMARY-CARE; BIG DATA; COMORBIDITY; DERIVATION; ALGORITHM;
D O I
10.1186/s12911-024-02678-x
中图分类号
R-058 [];
学科分类号
摘要
BackgroundIn the age of big data, linked social and administrative health data in combination with machine learning (ML) is being increasingly used to improve prediction in chronic disease, e.g., cardiovascular diseases (CVD). In this study we aimed to apply ML methods on extensive national-level health and social administrative datasets to assess the utility of these for predicting future diabetes complications, including by ethnicity.MethodsFive ML models were used to predict CVD events among all people with known diabetes in the population of New Zealand, utilizing nationwide individual-level administrative data.ResultsThe Xgboost ML model had the best predictive power for predicting CVD events three years into the future among the population with diabetes (N = 145,600). The optimization procedure also found limited improvement in prediction by ethnicity (using area under the receiver operating curve, [AUC]). The results indicated no trade-off between model predictive performance and equity gap of prediction by ethnicity (that is improving model prediction and reducing performance gaps by ethnicity can be achieved simultaneously). The list of variables of importance was different among different models/ethnic groups, for example: age, deprivation (neighborhood-level), having had a hospitalization event, and the number of years living with diabetes.Discussion and conclusionsWe provide further evidence that ML with administrative health data can be used for meaningful future prediction of health outcomes. As such, it could be utilized to inform health planning and healthcare resource allocation for diabetes management and the prevention of CVD events. Our results may suggest limited scope for developing prediction models by ethnic group and that the major ways to reduce inequitable health outcomes is probably via improved delivery of prevention and management to those groups with diabetes at highest need.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] MobDBTest: A machine learning based system for predicting diabetes risk using mobile devices
    Sowjanya, K.
    Singhal, Ayush
    Choudhary, Chaitali
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 397 - 402
  • [42] Explainable artificial intelligence models for predicting risk of suicide using health administrative data in Quebec
    Kharrat, Fatemeh Gholi Zadeh
    Gagne, Christian
    Lesage, Alain
    Gariepy, Genevieve
    Pelletier, Jean-Francois
    Brousseau-Paradis, Camille
    Rochette, Louis
    Pelletier, Eric
    Levesque, Pascale
    Mohammed, Mada
    Wang, JianLi
    PLOS ONE, 2024, 19 (04):
  • [43] The enrolment gap: who is not enrolling with primary health organizations in Aotearoa New Zealand and what are the implications? An exploration of 2015-2019 administrative data
    Irurzun-Lopez, Maite
    Jeffreys, Mona
    Cumming, Jacqueline
    INTERNATIONAL JOURNAL FOR EQUITY IN HEALTH, 2021, 20 (01)
  • [44] Machine Learning For Risk Prediction After Heart Failure Emergency Department Visit or Hospital Admission Using Administrative Health Data
    Fine, Nowell M.
    Kalmady, Sunil V.
    Sun, Weijie
    Greiner, Russ
    Howlett, Jonathan G.
    White, James A.
    Mcalister, Finlay A.
    Ezekowitz, Justin A.
    Kaul, Padma
    PLOS DIGITAL HEALTH, 2024, 3 (10):
  • [45] A machine learning approach to identify distinct subgroups of veterans at risk for hospitalization or death using administrative and electronic health record data
    Parikh, Ravi B.
    Linn, Kristin A.
    Yan, Jiali
    Maciejewski, Matthew L.
    Rosland, Ann-Marie
    Volpp, Kevin G.
    Groeneveld, Peter W.
    Navathe, Amol S.
    PLOS ONE, 2021, 16 (02):
  • [46] Machine learning risk estimation and prediction of death in continuing care facilities using administrative data
    Shahidi, Faezehsadat
    Rennert-May, Elissa
    D'Souza, Adam G.
    Crocker, Alysha
    Faris, Peter
    Leal, Jenine
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [47] Machine learning risk estimation and prediction of death in continuing care facilities using administrative data
    Faezehsadat Shahidi
    Elissa Rennert-May
    Adam G. D’Souza
    Alysha Crocker
    Peter Faris
    Jenine Leal
    Scientific Reports, 13
  • [48] Predicting high health-cost users among people with cardiovascular disease using machine learning and nationwide linked social administrative datasets
    Nhung Nghiem
    June Atkinson
    Binh P. Nguyen
    An Tran-Duy
    Nick Wilson
    Health Economics Review, 13
  • [49] Predicting high health-cost users among people with cardiovascular disease using machine learning and nationwide linked social administrative datasets
    Nghiem, Nhung
    Atkinson, June
    Nguyen, Binh P.
    Tran-Duy, An
    Wilson, Nick
    HEALTH ECONOMICS REVIEW, 2023, 13 (01)
  • [50] Social impacts and costs of schizophrenia: a national cohort study using New Zealand linked administrative data
    Gibb, Sheree
    Brewer, Naomi
    Bowden, Nicholas
    NEW ZEALAND MEDICAL JOURNAL, 2021, 134 (1537) : 66 - 83