Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand

被引:0
|
作者
Nghiem, Nhung [1 ]
Wilson, Nick [2 ]
Krebs, Jeremy [3 ]
Tran, Truyen [4 ]
机构
[1] Univ Otago Wellington, Dept Publ Hlth, Wellington 6021, Wellington, New Zealand
[2] Australian Natl Univ, John Curtin Sch Med Res, Canberra, ACT 2601, Australia
[3] Univ Otago Wellington, Dept Med, Wellington 6021, Wellington, New Zealand
[4] Deakin Univ, Appl Artificial Intelligence Inst A2I2, Geelong, Vic 3216, Australia
关键词
Machine learning; Diabetes complications; Cardiovascular disease; Risk prediction; Health and social administrative data; CARDIOVASCULAR-DISEASE; PRIMARY-CARE; BIG DATA; COMORBIDITY; DERIVATION; ALGORITHM;
D O I
10.1186/s12911-024-02678-x
中图分类号
R-058 [];
学科分类号
摘要
BackgroundIn the age of big data, linked social and administrative health data in combination with machine learning (ML) is being increasingly used to improve prediction in chronic disease, e.g., cardiovascular diseases (CVD). In this study we aimed to apply ML methods on extensive national-level health and social administrative datasets to assess the utility of these for predicting future diabetes complications, including by ethnicity.MethodsFive ML models were used to predict CVD events among all people with known diabetes in the population of New Zealand, utilizing nationwide individual-level administrative data.ResultsThe Xgboost ML model had the best predictive power for predicting CVD events three years into the future among the population with diabetes (N = 145,600). The optimization procedure also found limited improvement in prediction by ethnicity (using area under the receiver operating curve, [AUC]). The results indicated no trade-off between model predictive performance and equity gap of prediction by ethnicity (that is improving model prediction and reducing performance gaps by ethnicity can be achieved simultaneously). The list of variables of importance was different among different models/ethnic groups, for example: age, deprivation (neighborhood-level), having had a hospitalization event, and the number of years living with diabetes.Discussion and conclusionsWe provide further evidence that ML with administrative health data can be used for meaningful future prediction of health outcomes. As such, it could be utilized to inform health planning and healthcare resource allocation for diabetes management and the prevention of CVD events. Our results may suggest limited scope for developing prediction models by ethnic group and that the major ways to reduce inequitable health outcomes is probably via improved delivery of prevention and management to those groups with diabetes at highest need.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Predicting return to work after traumatic brain injury using machine learning and administrative data
    Van Deynse, Helena
    Cools, Wilfried
    De Deken, Viktor-Jan
    Depreitere, Bart
    Hubloue, Ives
    Kimpe, Eva
    Moens, Maarten
    Pien, Karen
    Tisseghem, Ellen
    Van Belleghem, Griet
    Putman, Koen
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 178
  • [32] Predicting postoperative pulmonary infection risk in patients with diabetes using machine learning
    Zhao, Chunxiu
    Xiang, Bingbing
    Zhang, Jie
    Yang, Pingliang
    Liu, Qiaoli
    Wang, Shun
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [33] Predicting Diabetes Diseases Using Mixed Data and Supervised Machine Learning Algorithms
    Daanouni, Othmane
    Cherradi, Bouchaib
    Tmiri, Amal
    4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [34] Is parent education a factor in identifying autism/takiwatanga in an ethnic cohort of Pacific children in Aotearoa, New Zealand? A national cross-sectional study using linked administrative data
    Kokaua, Jesse
    Kolose-Pulefolau, Betty
    Ruhe, Troy
    Aldridge, Faith
    Foliaki, Siale
    Kokaua, Liam
    Mapusua, Talai
    Dacombe, Joanne
    Richards, Rosalina
    Blakelock, Russell
    Bowden, Nicholas
    AUTISM, 2024, 28 (07) : 1667 - 1676
  • [35] Estimation of Prevalence and Incremental Costs of Systemic Lupus Erythematosus in a Middle-Income Country Using Machine Learning on Administrative Health Data
    Castro-Villarreal, Santiago
    Beltran-Ostos, Adriana
    Valencia, Carlos F.
    VALUE IN HEALTH REGIONAL ISSUES, 2021, 26 : 98 - 104
  • [36] The enrolment gap: who is not enrolling with primary health organizations in Aotearoa New Zealand and what are the implications? An exploration of 2015–2019 administrative data
    Maite Irurzun-Lopez
    Mona Jeffreys
    Jacqueline Cumming
    International Journal for Equity in Health, 20
  • [37] Identification of postoperative complications using electronic health record data and machine learning
    Bronsert, Michael
    Singh, Abhinav B.
    Henderson, William G.
    Hammermeister, Karl
    Meguid, Robert A.
    Colborn, Kathryn L.
    AMERICAN JOURNAL OF SURGERY, 2020, 220 (01): : 114 - 119
  • [38] Predicting the Risk of Inpatient Hypoglycemia With Machine Learning Using Electronic Health Records
    Ruan, Yue
    Bellot, Alexis
    Moysova, Zuzana
    Tan, Garry D.
    Lumb, Alistair
    Davies, Jim
    van der Schaar, Mihaela
    Rea, Rustam
    DIABETES CARE, 2020, 43 (07) : 1504 - 1511
  • [39] PrOsteoporosis: predicting osteoporosis risk using NHANES data and machine learning approach
    Si, Zebing
    Zhang, Di
    Wang, Huajun
    Zheng, Xiaofei
    BMC RESEARCH NOTES, 2025, 18 (01)
  • [40] Predicting neonatal respiratory distress syndrome and hypoglycaemia prior to discharge: Leveraging health administrative data and machine learning
    Betts, Kim S.
    Kisely, Steve
    Alati, Rosa
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 114