Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm

被引:6
|
作者
Daniel R. [1 ]
Jones H. [1 ]
Gregory J.W. [1 ]
Shetty A. [2 ]
Francis N. [3 ]
Paranjothy S. [4 ]
Townson J. [5 ]
机构
[1] Division of Population Medicine, School of Medicine, Cardiff University, Cardiff
[2] The Noah's Ark Children's Hospital for Wales, Department of Paediatric Diabetes and Endocrinology, Cardiff and Vale University Health Board, Cardiff
[3] Primary Care Research Centre, University of Southampton, Southampton
[4] Public Health Directorate, NHS Grampian, Aberdeen
[5] Centre for Trials Research, Cardiff University, Cardiff
来源
The Lancet Digital Health | 2024年 / 6卷 / 06期
关键词
Compendex;
D O I
10.1016/S2589-7500(24)00050-5
中图分类号
学科分类号
摘要
Background: Children presenting to primary care with suspected type 1 diabetes should be referred immediately to secondary care to avoid life-threatening diabetic ketoacidosis. However, early recognition of children with type 1 diabetes is challenging. Children might not present with classic symptoms, or symptoms might be attributed to more common conditions. A quarter of children present with diabetic ketoacidosis, a proportion unchanged over 25 years. Our aim was to investigate whether a machine-learning algorithm could lead to earlier detection of type 1 diabetes in primary care. Methods: We developed the predictive algorithm using Welsh primary care electronic health records (EHRs) linked to the Brecon Dataset, a register of children newly diagnosed with type 1 diabetes. Children were included from their first primary care record within the study period of Jan 1, 2000, to Dec 31, 2016, until either type 1 diabetes diagnosis, they turned 15 years of age, or study end. We developed an ensemble learner (SuperLearner) using 26 potential predictors. Validation of the algorithm was done in English EHRs from the Clinical Practice Research Datalink (primary care) and Hospital Episode Statistics, focusing on the ability of the algorithm to identify children who went on to develop type 1 diabetes and the time by which diagnosis could be anticipated. Findings: The development dataset comprised 34 754 400 primary care contacts, relating to 952 402 children, and the validation dataset comprised 43 089 103 primary care contacts, relating to 1 493 328 children. Of these, 1829 (0·19%) children younger than 15 years in the development dataset, and 1516 (0·10%) in the validation dataset had a reliable date of type 1 diabetes diagnosis. If set to give an alert in 10% of contacts, an estimated 71·6% (95% CI 68·8–74·4) of the children with type 1 diabetes would receive an alert by the algorithm in the 90 days before diagnosis, with diagnosis anticipated, on average, by an estimated 9·34 days (95% CI 7·77–10·9). Interpretation: If implemented into primary care settings, this predictive algorithm could substantially reduce the proportion of patients with new-onset type 1 diabetes presenting in diabetic ketoacidosis. Acceptability of alert thresholds should be explored in primary care. Funding: Diabetes UK. © 2024 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC 4.0 license
引用
收藏
页码:e386 / e395
页数:9
相关论文
共 50 条
  • [21] Development and validation of a pancreatic cancer prediction model from electronic health records using machine learning
    Appelbaum, Limor
    Cambronero, Jose Pablo
    Pollick, Karla
    Silva, George
    Stevens, Jennifer P.
    Mamon, Harvey J.
    Kaplan, Irving D.
    Rinard, Martin
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (04)
  • [22] Development and validation of a rheumatoid arthritis case definition: a machine learning approach using data from primary care electronic medical records
    Pham, Anh N. Q.
    Barber, Claire E. H.
    Drummond, Neil
    Jasper, Lisa
    Klein, Doug
    Lindeman, Cliff
    Widdifield, Jessica
    Williamson, Tyler
    Jones, C. Allyson
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [23] Pregnancy losses in women with Type 1 or Type 2 diabetes in the UK: an investigation using primary care records
    McGrogan, A.
    Snowball, J.
    de Vries, C. S.
    DIABETIC MEDICINE, 2014, 31 (03) : 357 - 365
  • [24] Anticipating type 1 diabetes in children using machine learning and routinely-collected data in primary care
    Townson, J.
    Daniel, R.
    Jones, H.
    Francis, N. A.
    Paranjothy, S.
    Baldwin, B.
    Thayer, D.
    Gregory, J. W.
    DIABETIC MEDICINE, 2022, 39
  • [25] Machine Learning for Personalized Medicine: Predicting Primary Myocardial Infarction from Electronic Health Records
    Weiss, Jeremy C.
    Natarajan, Sriraam
    Peissig, Peggy L.
    McCarty, Catherine A.
    Page, David
    AI MAGAZINE, 2012, 33 (04) : 33 - 45
  • [26] Natural Language Processing and Machine Learning for Identifying Incident Stroke From Electronic Health Records: Algorithm Development and Validation
    Zhao, Yiqing
    Fu, Sunyang
    Bielinski, Suzette J.
    Decker, Paul A.
    Chamberlain, Alanna M.
    Roger, Veronique L.
    Liu, Hongfang
    Larson, Nicholas B.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)
  • [27] Phenotyping Hereditary hemorrhagic telangiectasia using UK primary care Electronic Health Records
    Buendia, Orlando
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 609 - 609
  • [28] Development and Validation of a Primary Care Electronic Health Record Phenotype to Study Migration and Health in the UK
    Pathak, Neha
    Zhang, Claire X.
    Boukari, Yamina
    Burns, Rachel
    Mathur, Rohini
    Gonzalez-Izquierdo, Arturo
    Denaxas, Spiros
    Sonnenberg, Pam
    Hayward, Andrew
    Aldridge, Robert W.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (24)
  • [29] Predicting Causes of Death from Structured Electronic Health Records Using Machine Learning
    Al-Garad, Mohammed
    Reeves, Ruth M.
    Desai, Rishi J.
    LeNoue-Newton, Michele
    Park, Daniel
    Wang, Shirley V.
    Maro, Judith C.
    Fuller, Candace C.
    Lin, Kueiyu Joshua
    Hernandez-Munoz, Jose J.
    Kuzucan, Aida
    Wang, Xi
    Pillai, Haritha
    Ngan, Kerry
    Whitaker, Jill
    Deere, Jessica
    McLemore, Michael F.
    Westerman, Dax
    Matheny, Michael E.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 : 71 - 72
  • [30] Predicting Acute Kidney Injury: A Machine Learning Approach Using Electronic Health Records
    Abdullah, Sheikh S.
    Rostamzadeh, Neda
    Sedig, Kamran
    Garg, Amit X.
    McArthur, Eric
    INFORMATION, 2020, 11 (08)