Finding missed cases of familial hypercholesterolemia in health systems using machine learning

被引:62
|
作者
Banda, Juan M. [1 ,2 ]
Sarraju, Ashish [3 ]
Abbasi, Fahim [3 ]
Parizo, Justin [3 ]
Pariani, Mitchel [3 ]
Ison, Hannah [3 ]
Briskin, Elinor [3 ]
Wand, Hannah [3 ]
Dubois, Sebastien [1 ]
Jung, Kenneth [1 ]
Myers, Seth A. [4 ]
Rader, Daniel J. [5 ,6 ]
Leader, Joseph B. [7 ]
Murray, Michael F. [8 ]
Myers, Kelly D. [4 ,6 ]
Wilemon, Katherine [6 ]
Shah, Nigam H. [1 ]
Knowles, Joshua W. [3 ,6 ,9 ]
机构
[1] Stanford Univ, Ctr Biomed Informat Res, Stanford, CA 94305 USA
[2] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
[3] Stanford Univ, Cardiovasc Med & Cardiovasc Inst, Stanford, CA 94305 USA
[4] Atomo Inc, Austin, TX USA
[5] Univ Penn, Perelman Sch Med, Philadelphia, PA 19104 USA
[6] FH Fdn, Pasadena, CA 91106 USA
[7] Geisinger Hlth Syst, Genom Med Inst, Forty Ft, PA USA
[8] Yale Univ, Ctr Genom Hlth, New Haven, CT USA
[9] Stanford Diabet Res Ctr, Stanford, CA 94305 USA
关键词
SAVANNA CHOICE; IDENTIFICATION; FUTURE; ZOO;
D O I
10.1038/s41746-019-0101-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Familial hypercholesterolemia (FH) is an underdiagnosed dominant genetic condition affecting approximately 0.4% of the population and has up to a 20-fold increased risk of coronary artery disease if untreated. Simple screening strategies have false positive rates greater than 95%. As part of the FH Foundation's FIND FH initiative, we developed a classifier to identify potential FH patients using electronic health record (EHR) data at Stanford Health Care. We trained a random forest classifier using data from known patients (n = 197) and matched non-cases (n = 6590). Our classifier obtained a positive predictive value (PPV) of 0.88 and sensitivity of 0.75 on a held-out test-set. We evaluated the accuracy of the classifier's predictions by chart review of 100 patients at risk of FH not included in the original dataset. The classifier correctly flagged 84% of patients at the highest probability threshold, with decreasing performance as the threshold lowers. In external validation on 466 FH patients (236 with genetically proven FH) and 5000 matched non-cases from the Geisinger Healthcare System our FH classifier achieved a PPV of 0.85. Our EHR-derived FH classifier is effective in finding candidate patients for further FH screening. Such machine learning guided strategies can lead to effective identification of the highest risk patients for enhanced management strategies.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Finding missed cases of familial hypercholesterolemia in health systems using machine learning
    Juan M. Banda
    Ashish Sarraju
    Fahim Abbasi
    Justin Parizo
    Mitchel Pariani
    Hannah Ison
    Elinor Briskin
    Hannah Wand
    Sebastien Dubois
    Kenneth Jung
    Seth A. Myers
    Daniel J. Rader
    Joseph B. Leader
    Michael F. Murray
    Kelly D. Myers
    Katherine Wilemon
    Nigam H. Shah
    Joshua W. Knowles
    npj Digital Medicine, 2
  • [2] Applications of machine learning in familial hypercholesterolemia
    Luo, Ren-Fei
    Wang, Jing-Hui
    Hu, Li-Juan
    Fu, Qing-An
    Zhang, Si-Yi
    Jiang, Long
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
  • [3] Improving the Detection of Potential Cases of Familial Hypercholesterolemia: Could Machine Learning Be Part of the Solution?
    Stevens, Christophe A. T.
    Vallejo-Vaz, Antonio J.
    Chora, Joana R.
    Barkas, Fotis
    Brandts, Julia
    Mahani, Alireza
    Abar, Leila
    Sharabiani, Mansour T. A.
    Ray, Kausik K.
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2024, 13 (12):
  • [4] Using Healthcare Claims Data and Machine Learning to Identify Health Disparities for Individuals With Diagnosed and Undiagnosed Familial Hypercholesterolemia
    Ahmad, Zahid
    Xing, Chao
    Khera, Amit
    Huang, Chun-Yuan
    Brandt, Eric
    MacDougall, Dianne
    Ahmed, Catherine D.
    McGowan, Mary P.
    Wilemon, Katherine A.
    Myers, Kelly
    CIRCULATION, 2022, 146
  • [5] Using Healthcare Claims Data and Machine Learning to Identify Health Disparities for Individuals with Diagnosed and Undiagnosed Familial Hypercholesterolemia
    Ahmad, Zahid
    Xing, Chao
    Khera, Amit
    Huang, Chun-Yuan
    Brandt, Eric J.
    MacDougall, Diane E.
    Ahmed, Catherine D.
    McGowan, Mary P.
    Paige, Hilly
    Wilemon, Katherine A.
    Myers, Kelly D.
    METABOLISM-CLINICAL AND EXPERIMENTAL, 2023, 142 : S36 - S36
  • [6] Virtual genetic diagnosis for familial hypercholesterolemia powered by machine learning
    Pina, Ana
    Helgadottir, Saga
    Mancina, Rosellina Margherita
    Pavanello, Chiara
    Pirazzi, Carlo
    Montalcini, Tiziana
    Henriques, Roberto
    Calabresi, Laura
    Wiklund, Olov
    Macedo, M. Paula
    Valenti, Luca
    Volpe, Giovanni
    Romeo, Stefano
    EUROPEAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2020, 27 (15) : 1639 - 1646
  • [7] A machine-learning algorithm using claims data to identify patients with homozygous familial hypercholesterolemia
    Gu, Jing
    Epland, Matthew
    Ma, Xinshuo
    Park, Jina
    Sanchez, Robert J.
    Li, Ying
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [8] FINDING UNDIAGNOSED PATIENTS WITH FAMILIAL HYPERCHOLESTEROLEMIA IN PRIMARY CARE USINGELECTRONIC HEALTH RECORDS
    Williamson, Latoya
    Wojcik, Cezary
    Taunton, Marshall
    McElheran, Kylie
    Howard, William
    Staszak, Dave
    Gidding, Samuel
    Wilemon, Katherine
    Myers, Kelly
    Shapiro, Michael D.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2020, 75 (11) : 3502 - 3502
  • [9] Prediction of hypercholesterolemia using machine learning techniques
    Pooyan Moradifar
    Mohammad Meskarpour Amiri
    Journal of Diabetes & Metabolic Disorders, 2023, 22 : 255 - 265
  • [10] Prediction of hypercholesterolemia using machine learning techniques
    Moradifar, Pooyan
    Amiri, Mohammad Meskarpour
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2023, 22 (01) : 255 - 265