Finding missed cases of familial hypercholesterolemia in health systems using machine learning

被引:0
|
作者
Juan M. Banda
Ashish Sarraju
Fahim Abbasi
Justin Parizo
Mitchel Pariani
Hannah Ison
Elinor Briskin
Hannah Wand
Sebastien Dubois
Kenneth Jung
Seth A. Myers
Daniel J. Rader
Joseph B. Leader
Michael F. Murray
Kelly D. Myers
Katherine Wilemon
Nigam H. Shah
Joshua W. Knowles
机构
[1] Stanford University,Center for Biomedical Informatics Research
[2] Georgia State University,Department of Computer Science
[3] Stanford University,Cardiovascular Medicine and Cardiovascular Institute
[4] Atomo,Geisinger Health System
[5] Inc,Center for Genomic Health
[6] Perelman School of Medicine at the University of Pennsylvania,undefined
[7] The FH Foundation,undefined
[8] Genomic Medicine Institute,undefined
[9] Yale University,undefined
[10] Stanford Diabetes Research Center,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Familial hypercholesterolemia (FH) is an underdiagnosed dominant genetic condition affecting approximately 0.4% of the population and has up to a 20-fold increased risk of coronary artery disease if untreated. Simple screening strategies have false positive rates greater than 95%. As part of the FH Foundation′s FIND FH initiative, we developed a classifier to identify potential FH patients using electronic health record (EHR) data at Stanford Health Care. We trained a random forest classifier using data from known patients (n = 197) and matched non-cases (n = 6590). Our classifier obtained a positive predictive value (PPV) of 0.88 and sensitivity of 0.75 on a held-out test-set. We evaluated the accuracy of the classifier′s predictions by chart review of 100 patients at risk of FH not included in the original dataset. The classifier correctly flagged 84% of patients at the highest probability threshold, with decreasing performance as the threshold lowers. In external validation on 466 FH patients (236 with genetically proven FH) and 5000 matched non-cases from the Geisinger Healthcare System our FH classifier achieved a PPV of 0.85. Our EHR-derived FH classifier is effective in finding candidate patients for further FH screening. Such machine learning guided strategies can lead to effective identification of the highest risk patients for enhanced management strategies.
引用
收藏
相关论文
共 50 条
  • [31] Finding patterns in subsurface using Bayesian machine learning approach
    Wang, Hui
    UNDERGROUND SPACE, 2020, 5 (01) : 84 - 92
  • [32] Prediction of COVID-19 Cases Using Machine Learning for Effective Public Health Management
    Ahmad, Fahad
    Almuayqil, Saleh N.
    Humayun, Mamoona
    Naseem, Shahid
    Khan, Wasim Ahmad
    Junaid, Kashaf
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (03): : 2265 - 2282
  • [33] Assessment of Associations Between Serum Lipoprotein (a) Levels and Atherosclerotic Vascular Diseases in Hungarian Patients With Familial Hypercholesterolemia Using Data Mining and Machine Learning
    Nemeth, Akos
    Daroczy, Balint
    Juhasz, Lilla
    Fulop, Peter
    Harangi, Mariann
    Paragh, Gyoergy
    FRONTIERS IN GENETICS, 2022, 13
  • [34] Using machine learning for assigning indices to textual cases
    Bruninghaus, S
    Ashley, KD
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, 1997, 1266 : 303 - 314
  • [35] Using network analysis modularity to represent health code systems in machine learning models
    Askar, Mohsen
    Svendsen, Kristian
    Smabrekke, Lars
    Bongo, Lars Ailo
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2022, 31 : 121 - 121
  • [36] Machine Learning Systems Applied to Health Data and System
    Bonifazi, Fedele
    Volpe, Elisabetta
    Digregorio, Giuseppe
    Giannuzzi, Viviana
    Ceci, Adriana
    EUROPEAN JOURNAL OF HEALTH LAW, 2020, 27 (03) : 242 - 258
  • [37] Finding flares in Kepler data using machine-learning tools
    Vida, Krisztian
    Roettenbacher, Rachael M.
    ASTRONOMY & ASTROPHYSICS, 2018, 616
  • [38] Case detection of familial hypercholesterolemia using various criteria during an annual health examination in the workplace
    Ganokroj, Poranee
    Muanpetch, Suwanna
    Hanprathet, Nitt
    Jiamjarasrangsi, Wiroj
    Khovidhunkit, Weerapan
    INTERNATIONAL JOURNAL OF CARDIOLOGY CARDIOVASCULAR RISK AND PREVENTION, 2024, 23
  • [39] COSMIC: A Galaxy Cluster-Finding Algorithm Using Machine Learning
    Tian, Da-Chuan
    Yang, Yang
    Wen, Zhong-Lue
    Xia, Jun-Qing
    ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2025, 276 (01):
  • [40] Chip Health Monitoring Using Machine Learning
    Firouzi, Farshad
    Ye, Fangming
    Chakrabarty, Krishnendu
    Tahoori, Mehdi B.
    2014 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2014, : 281 - 284