Cluster analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases

被引:3
|
作者
Moynihan, Daniel [1 ]
Monaco, Sean [2 ]
Ting, Teck Wah [3 ,4 ]
Narasimhalu, Kaavya [4 ,5 ]
Hsieh, Jenny [4 ,6 ]
Kam, Sylvia [3 ,4 ]
Lim, Jiin Ying [3 ,4 ]
Lim, Weng Khong [4 ,7 ,8 ,9 ]
Davila, Sonia [4 ,7 ]
Bylstra, Yasmin [4 ,7 ]
Balakrishnan, Iswaree Devi [4 ,10 ]
Heng, Mark [11 ]
Chia, Elian [11 ]
Yeo, Khung Keong [10 ]
Goh, Bee Keow [12 ]
Gupta, Ritu [1 ]
Tan, Tele [1 ]
Baynam, Gareth [13 ,14 ]
Jamuar, Saumya Shekhar [3 ,4 ,7 ]
机构
[1] Curtin Univ, Perth, Australia
[2] Hlth Catalyst, South Jordan, UT USA
[3] KK Womens & Childrens Hosp, Dept Paediat, Genet Serv, 100 Bukit Timah Rd, Singapore 229899, Singapore
[4] SingHealth Duke NUS Genom Med Ctr, Singapore, Singapore
[5] Singapore Gen Hosp, Natl Neurosci Inst, Dept Neurol, Singapore, Singapore
[6] Singapore Gen Hosp, Dept Internal Med, Singapore, Singapore
[7] SingHealth Duke NUS Inst Precis Med, Singapore, Singapore
[8] Duke NUS Med Sch, Canc & Stem Cell Biol Program, Singapore, Singapore
[9] Genome Inst Singapore, Lab Genome Variat Analyt, Singapore, Singapore
[10] Natl Heart Ctr Singapore, Singapore, Singapore
[11] SingHealth Off Insights & Analyt, Singapore, Singapore
[12] KK Womens & Childrens Hosp, Data Analyt Off, Singapore, Singapore
[13] Perth Childrens Hosp, Rare Care Ctr, Perth, WA, Australia
[14] Western Australian Register Dev Anomalies, Perth, WA, Australia
关键词
FABRY-DISEASE;
D O I
10.1038/s41598-024-55424-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rare genetic diseases affect 5-8% of the population but are often undiagnosed or misdiagnosed. Electronic health records (EHR) contain large amounts of data, which provide opportunities for analysing and mining. Data mining, in the form of cluster analysis and visualisation, was performed on a database containing deidentified health records of 1.28 million patients across 3 major hospitals in Singapore, in a bid to improve the diagnostic process for patients who are living with an undiagnosed rare disease, specifically focusing on Fabry Disease and Familial Hypercholesterolaemia (FH). On a baseline of 4 patients, we identified 2 additional patients with potential diagnosis of Fabry disease, suggesting a potential 50% increase in diagnosis. Similarly, we identified > 12,000 individuals who fulfil the clinical and laboratory criteria for FH but had not been diagnosed previously. This proof-of-concept study showed that it is possible to perform mining on EHR data albeit with some challenges and limitations.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Detection of probable dementia cases in undiagnosed patients using structured and unstructured electronic health records
    Shao, Yijun
    Zeng, Qing T.
    Chen, Kathryn K.
    Shutes-David, Andrew
    Thielke, Stephen M.
    Tsuang, Debby W.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (1)
  • [22] Detection of probable dementia cases in undiagnosed patients using structured and unstructured electronic health records
    Yijun Shao
    Qing T. Zeng
    Kathryn K. Chen
    Andrew Shutes-David
    Stephen M. Thielke
    Debby W. Tsuang
    BMC Medical Informatics and Decision Making, 19
  • [23] Using Electronic Health Records To Identify Patients With Apparent Treatment Resistant Hypertension
    Solanki, Priyanka
    Ajmal, Imran
    Ding, Xiruo
    Cohen, Jordana
    Cohen, Debbie
    Herman, Daniel
    HYPERTENSION, 2020, 76
  • [24] Leveraging unstructured data to identify hereditary angioedema patients in electronic medical records
    Brouwer, Emily S.
    Bratton, Emily W.
    Near, Aimee M.
    Sanders, Lynn
    Mack, Christina D.
    ALLERGY ASTHMA AND CLINICAL IMMUNOLOGY, 2021, 17 (01):
  • [25] Leveraging unstructured data to identify hereditary angioedema patients in electronic medical records
    Emily S. Brouwer
    Emily W. Bratton
    Aimee M. Near
    Lynn Sanders
    Christina D. Mack
    Allergy, Asthma & Clinical Immunology, 17
  • [26] Putting patients in control of data from electronic health records
    New, John P.
    Leather, David
    Bakerly, Nawar Diar
    McCrae, John
    Gibson, J. Martin
    BMJ-BRITISH MEDICAL JOURNAL, 2018, 360 : j5554
  • [27] Artificial intelligence processing electronic health records to identify commonalities and comorbidities cluster at Immuno Center Humanitas
    Morandini, Pierandrea
    Laino, Maria Elena
    Paoletti, Giovanni
    Carlucci, Alessandro
    Tommasini, Tobia
    Angelotti, Giovanni
    Pepys, Jack
    Canonica, Giorgio Walter
    Heffler, Enrico
    Savevski, Victor
    Puggioni, Francesca
    CLINICAL AND TRANSLATIONAL ALLERGY, 2022, 12 (06)
  • [28] An algorithm-based approach to ascertain patients with rare diseases in electronic health records using hypereosinophilic syndrome as an example
    Requena, Gema
    Joksaite, Sandra
    Galwey, Nicholas
    Jakes, Rupert W. W.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 (11) : 1261 - 1270
  • [29] Leveraging Electronic Health Record Studies to Identify Shared Genetic Architecture of Eye Diseases
    Scalici, Alexandra
    Miller-Fleming, Tyne W.
    Hirbo, Jibril
    Knapik, Ela W.
    Cox, Nancy J.
    GENETIC EPIDEMIOLOGY, 2022, 46 (07) : 529 - 530
  • [30] External Validation of an Algorithm to Identify Patients with High Data-Completeness in Electronic Health Records for Comparative Effectiveness Research
    Lin, Kueiyu Joshua
    Rosenthal, Gary E.
    Murphy, Shawn N.
    Mandl, Kenneth D.
    Jin, Yinzhu
    Glynn, Robert J.
    Schneeweiss, Sebastian
    CLINICAL EPIDEMIOLOGY, 2020, 12 : 133 - 141