Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm

被引:6
|
作者
Daniel R. [1 ]
Jones H. [1 ]
Gregory J.W. [1 ]
Shetty A. [2 ]
Francis N. [3 ]
Paranjothy S. [4 ]
Townson J. [5 ]
机构
[1] Division of Population Medicine, School of Medicine, Cardiff University, Cardiff
[2] The Noah's Ark Children's Hospital for Wales, Department of Paediatric Diabetes and Endocrinology, Cardiff and Vale University Health Board, Cardiff
[3] Primary Care Research Centre, University of Southampton, Southampton
[4] Public Health Directorate, NHS Grampian, Aberdeen
[5] Centre for Trials Research, Cardiff University, Cardiff
来源
The Lancet Digital Health | 2024年 / 6卷 / 06期
关键词
Compendex;
D O I
10.1016/S2589-7500(24)00050-5
中图分类号
学科分类号
摘要
Background: Children presenting to primary care with suspected type 1 diabetes should be referred immediately to secondary care to avoid life-threatening diabetic ketoacidosis. However, early recognition of children with type 1 diabetes is challenging. Children might not present with classic symptoms, or symptoms might be attributed to more common conditions. A quarter of children present with diabetic ketoacidosis, a proportion unchanged over 25 years. Our aim was to investigate whether a machine-learning algorithm could lead to earlier detection of type 1 diabetes in primary care. Methods: We developed the predictive algorithm using Welsh primary care electronic health records (EHRs) linked to the Brecon Dataset, a register of children newly diagnosed with type 1 diabetes. Children were included from their first primary care record within the study period of Jan 1, 2000, to Dec 31, 2016, until either type 1 diabetes diagnosis, they turned 15 years of age, or study end. We developed an ensemble learner (SuperLearner) using 26 potential predictors. Validation of the algorithm was done in English EHRs from the Clinical Practice Research Datalink (primary care) and Hospital Episode Statistics, focusing on the ability of the algorithm to identify children who went on to develop type 1 diabetes and the time by which diagnosis could be anticipated. Findings: The development dataset comprised 34 754 400 primary care contacts, relating to 952 402 children, and the validation dataset comprised 43 089 103 primary care contacts, relating to 1 493 328 children. Of these, 1829 (0·19%) children younger than 15 years in the development dataset, and 1516 (0·10%) in the validation dataset had a reliable date of type 1 diabetes diagnosis. If set to give an alert in 10% of contacts, an estimated 71·6% (95% CI 68·8–74·4) of the children with type 1 diabetes would receive an alert by the algorithm in the 90 days before diagnosis, with diagnosis anticipated, on average, by an estimated 9·34 days (95% CI 7·77–10·9). Interpretation: If implemented into primary care settings, this predictive algorithm could substantially reduce the proportion of patients with new-onset type 1 diabetes presenting in diabetic ketoacidosis. Acceptability of alert thresholds should be explored in primary care. Funding: Diabetes UK. © 2024 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC 4.0 license
引用
收藏
页码:e386 / e395
页数:9
相关论文
共 50 条
  • [1] Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm
    Daniel, Rhian
    Jones, Hywel
    Gregory, John W.
    Shetty, Ambika
    Francis, Nick
    Paranjothy, Shantini
    Townson, Julia
    LANCET DIGITAL HEALTH, 2024, 6 (06): : e386 - e395
  • [2] EXPLAINABLE MACHINE-LEARNING FOR PREDICTING PREOPERATIVE FRAILTY PHENOTYPE USING ELECTRONIC HEALTH RECORDS
    Mardini, Mamoun
    Price, Catherine
    Tighe, Patrick
    Manini, Todd
    INNOVATION IN AGING, 2022, 6 : 564 - 564
  • [3] A machine learning algorithm for the detection of paroxysmal nocturnal haemoglobinuria (PNH) in UK primary care electronic health records
    Worker, Amanda
    Mahon, Hadley
    Sams, Jack
    Boardman-Pretty, Freya
    Marchini, Elena
    Dubis, Rand
    Warren, Alan
    Stockdale, Jez
    Kumar, Jyothika
    Varones, Elizabeth
    Ollerenshaw, Daniel
    Grant, Calum
    Fish, Peter
    Kelly, Richard J.
    ORPHANET JOURNAL OF RARE DISEASES, 2024, 19 (01)
  • [4] Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records
    Rahimian, Fatemeh
    Salimi-Khorshidi, Gholamreza
    Payberah, Amir H.
    Tran, Jenny
    Solares, Roberto Ayala
    Raimondi, Francesca
    Nazarzadeh, Milad
    Canoy, Dexter
    Rahimi, Kazem
    PLOS MEDICINE, 2018, 15 (11)
  • [5] Development and Validation of an Algorithm to Accurately Identify Atopic Eczema Patients in Primary Care Electronic Health Records from the UK
    Abuabara, Katrina
    Magyari, Alexa M.
    Hoffstad, Ole
    Jabbar-Lopez, Zarif K.
    Smeeth, Liam
    Williams, Hywel C.
    Gelfand, Joel M.
    Margolis, David J.
    Langan, Sinead M.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2017, 137 (08) : 1655 - 1662
  • [6] Development and Validation of a Machine Learning Algorithm for Predicting Diabetes Retinopathy in Patients With Type 2 Diabetes: Algorithm Development Study
    Kim, Sunyoung
    Park, Jaeyu
    Son, Yejun
    Lee, Hojae
    Woo, Selin
    Lee, Myeongcheol
    Lee, Hayeon
    Sang, Hyunji
    Yon, Dong Keon
    Rhee, Sang Youl
    JMIR MEDICAL INFORMATICS, 2025, 13
  • [7] Machine Learning Prediction of Hypoglycemia and Hyperglycemia From Electronic Health Records: Algorithm Development and Validation
    Witte, Harald
    Nakas, Christos
    Bally, Lia
    Leichtle, Alexander Benedikt
    JMIR FORMATIVE RESEARCH, 2022, 6 (07)
  • [8] Using electronic health records to develop and validate a machine-learning tool to predict type 2 diabetes outcomes: a study protocol
    Neves, Ana Luisa
    Rodrigues, Pedro Pereira
    Mulla, Abdulrahim
    Glampson, Ben
    Willis, Tony
    Darzi, Ara
    Mayer, Erik
    BMJ OPEN, 2021, 11 (07):
  • [9] Performance assessment of different machine learning approaches in predicting diabetic ketoacidosis in adults with type 1 diabetes using electronic health records data
    Li, Lin
    Lee, Chuang-Chung
    Zhou, Fang Liz
    Molony, Cliona
    Doder, Zoran
    Zalmover, Evgeny
    Sharma, Kristen
    Juhaeri, Juhaeri
    Wu, Chuntao
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2021, 30 (05) : 610 - 618
  • [10] Predicting the onset of type 2 diabetes using wide and deep learning with electronic health records
    Nguyen, Binh P.
    Pham, Hung N.
    Tran, Hop
    Nghiem, Nhung
    Nguyen, Quang H.
    Do, Trang T. T.
    Cao Truong Tran
    Simpson, Colin R.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 182