Finding missed cases of familial hypercholesterolemia in health systems using machine learning

被引:0
|
作者
Juan M. Banda
Ashish Sarraju
Fahim Abbasi
Justin Parizo
Mitchel Pariani
Hannah Ison
Elinor Briskin
Hannah Wand
Sebastien Dubois
Kenneth Jung
Seth A. Myers
Daniel J. Rader
Joseph B. Leader
Michael F. Murray
Kelly D. Myers
Katherine Wilemon
Nigam H. Shah
Joshua W. Knowles
机构
[1] Stanford University,Center for Biomedical Informatics Research
[2] Georgia State University,Department of Computer Science
[3] Stanford University,Cardiovascular Medicine and Cardiovascular Institute
[4] Atomo,Geisinger Health System
[5] Inc,Center for Genomic Health
[6] Perelman School of Medicine at the University of Pennsylvania,undefined
[7] The FH Foundation,undefined
[8] Genomic Medicine Institute,undefined
[9] Yale University,undefined
[10] Stanford Diabetes Research Center,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Familial hypercholesterolemia (FH) is an underdiagnosed dominant genetic condition affecting approximately 0.4% of the population and has up to a 20-fold increased risk of coronary artery disease if untreated. Simple screening strategies have false positive rates greater than 95%. As part of the FH Foundation′s FIND FH initiative, we developed a classifier to identify potential FH patients using electronic health record (EHR) data at Stanford Health Care. We trained a random forest classifier using data from known patients (n = 197) and matched non-cases (n = 6590). Our classifier obtained a positive predictive value (PPV) of 0.88 and sensitivity of 0.75 on a held-out test-set. We evaluated the accuracy of the classifier′s predictions by chart review of 100 patients at risk of FH not included in the original dataset. The classifier correctly flagged 84% of patients at the highest probability threshold, with decreasing performance as the threshold lowers. In external validation on 466 FH patients (236 with genetically proven FH) and 5000 matched non-cases from the Geisinger Healthcare System our FH classifier achieved a PPV of 0.85. Our EHR-derived FH classifier is effective in finding candidate patients for further FH screening. Such machine learning guided strategies can lead to effective identification of the highest risk patients for enhanced management strategies.
引用
收藏
相关论文
共 50 条
  • [41] Developing a Hybrid Risk Assessment Tool for Familial Hypercholesterolemia: A Machine Learning Study of Chinese Arteriosclerotic Cardiovascular Disease Patients
    Wang, Lei
    Guo, Jian
    Tian, Zhuang
    Seery, Samuel
    Jin, Ye
    Zhang, Shuyang
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [42] Tractor Assistance Systems Using Machine Learning
    Riedl, Johannes
    Riedl, Johannes, 1600, Springer Vieweg (13) : 50 - 55
  • [43] USING MACHINE LEARNING FOR INTRUSION DETECTION SYSTEMS
    Quang-Vinh Dang
    COMPUTING AND INFORMATICS, 2022, 41 (01) : 12 - 33
  • [44] Using Machine Learning to Secure IoT Systems
    Canedo, Janice
    Skjellum, Anthony
    2016 14TH ANNUAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2016,
  • [45] Improving Storage Systems Using Machine Learning
    Akgun, Ibrahim Umit
    Aydin, Ali Selman
    Burford, Andrew
    McNeill, Michael
    Arkhangelskiy, Michael
    Zadok, Erez
    ACM TRANSACTIONS ON STORAGE, 2023, 19 (01)
  • [46] Diagnostic Classification of Cases of Canine Leishmaniasis Using Machine Learning
    Ferreira, Tiago S.
    Santana, Ewaldo E. C.
    Jacob Junior, Antonio F. L.
    Silva Junior, Paulo F.
    Bastos, Luciana S.
    Silva, Ana L. A.
    Melo, Solange A.
    Cruz, Carlos A. M.
    Aquino, Vivianne S.
    Castro, Luis S. O.
    Lima, Guilherme O.
    Freire, Raimundo C. S.
    SENSORS, 2022, 22 (09)
  • [47] A Novel Software Engineering Approach Toward Using Machine Learning for Improving the Efficiency of Health Systems
    Moreb, Mohammed
    Mohammed, Tareq Abed
    Bayat, Oguz
    IEEE ACCESS, 2020, 8 : 23169 - 23178
  • [48] Finding optimal strategies for river quality assessment using machine learning and deep learning models
    Zamri, Nurnadiah
    Pairan, Mohamad Ammar
    Azman, Wan Nur Amira Wan
    Gao, Miaomiao
    MODELING EARTH SYSTEMS AND ENVIRONMENT, 2023, 9 (01) : 615 - 629
  • [49] Embedded Machine Learning Using Microcontrollers in Wearable and Ambulatory Systems for Health and Care Applications: A Review
    Diab, Maha S.
    Rodriguez-Villegas, Esther
    IEEE ACCESS, 2022, 10 : 98450 - 98474
  • [50] Finding optimal strategies for river quality assessment using machine learning and deep learning models
    Nurnadiah Zamri
    Mohamad Ammar Pairan
    Wan Nur Amira Wan Azman
    Miaomiao Gao
    Modeling Earth Systems and Environment, 2023, 9 : 615 - 629