Cluster analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases

被引:3
|
作者
Moynihan, Daniel [1 ]
Monaco, Sean [2 ]
Ting, Teck Wah [3 ,4 ]
Narasimhalu, Kaavya [4 ,5 ]
Hsieh, Jenny [4 ,6 ]
Kam, Sylvia [3 ,4 ]
Lim, Jiin Ying [3 ,4 ]
Lim, Weng Khong [4 ,7 ,8 ,9 ]
Davila, Sonia [4 ,7 ]
Bylstra, Yasmin [4 ,7 ]
Balakrishnan, Iswaree Devi [4 ,10 ]
Heng, Mark [11 ]
Chia, Elian [11 ]
Yeo, Khung Keong [10 ]
Goh, Bee Keow [12 ]
Gupta, Ritu [1 ]
Tan, Tele [1 ]
Baynam, Gareth [13 ,14 ]
Jamuar, Saumya Shekhar [3 ,4 ,7 ]
机构
[1] Curtin Univ, Perth, Australia
[2] Hlth Catalyst, South Jordan, UT USA
[3] KK Womens & Childrens Hosp, Dept Paediat, Genet Serv, 100 Bukit Timah Rd, Singapore 229899, Singapore
[4] SingHealth Duke NUS Genom Med Ctr, Singapore, Singapore
[5] Singapore Gen Hosp, Natl Neurosci Inst, Dept Neurol, Singapore, Singapore
[6] Singapore Gen Hosp, Dept Internal Med, Singapore, Singapore
[7] SingHealth Duke NUS Inst Precis Med, Singapore, Singapore
[8] Duke NUS Med Sch, Canc & Stem Cell Biol Program, Singapore, Singapore
[9] Genome Inst Singapore, Lab Genome Variat Analyt, Singapore, Singapore
[10] Natl Heart Ctr Singapore, Singapore, Singapore
[11] SingHealth Off Insights & Analyt, Singapore, Singapore
[12] KK Womens & Childrens Hosp, Data Analyt Off, Singapore, Singapore
[13] Perth Childrens Hosp, Rare Care Ctr, Perth, WA, Australia
[14] Western Australian Register Dev Anomalies, Perth, WA, Australia
关键词
FABRY-DISEASE;
D O I
10.1038/s41598-024-55424-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rare genetic diseases affect 5-8% of the population but are often undiagnosed or misdiagnosed. Electronic health records (EHR) contain large amounts of data, which provide opportunities for analysing and mining. Data mining, in the form of cluster analysis and visualisation, was performed on a database containing deidentified health records of 1.28 million patients across 3 major hospitals in Singapore, in a bid to improve the diagnostic process for patients who are living with an undiagnosed rare disease, specifically focusing on Fabry Disease and Familial Hypercholesterolaemia (FH). On a baseline of 4 patients, we identified 2 additional patients with potential diagnosis of Fabry disease, suggesting a potential 50% increase in diagnosis. Similarly, we identified > 12,000 individuals who fulfil the clinical and laboratory criteria for FH but had not been diagnosed previously. This proof-of-concept study showed that it is possible to perform mining on EHR data albeit with some challenges and limitations.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Imputation of Missing Data in Electronic Health Records Based on Patients’ Similarities
    Ali Jazayeri
    Ou Stella Liang
    Christopher C. Yang
    Journal of Healthcare Informatics Research, 2020, 4 : 295 - 307
  • [42] Biases introduced by filtering electronic health records for patients with "complete data"
    Weber, Griffin M.
    Adams, William G.
    Bernstam, Elmer V.
    Bickel, Jonathan P.
    Fox, Kathe P.
    Marsolo, Keith
    Raghavan, Vijay A.
    Turchin, Alexander
    Zhou, Xiaobo
    Murphy, Shawn N.
    Mandl, Kenneth D.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (06) : 1134 - 1141
  • [43] Toward a Model-Based Patient Pre-Selection Tool to Identify Suspected Undiagnosed Albuminuria Using Electronic Health Records
    Svangard, Nils
    Hildeman, Anders G.
    Greasley, Peter J.
    Khader, Shameer
    Ambery, Philip D.
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2022, 33 (11): : 981 - 981
  • [44] Using Electronic Health Records and Claims Data to Identify High-risk Patients Likely to Benefit From Palliative Care
    Guo, Aixia
    Foraker, Randi
    White, Patrick
    Chivers, Corey
    Courtright, Katherine
    Moore, Nathan
    AMERICAN JOURNAL OF MANAGED CARE, 2021, 27 (01): : E7 - +
  • [45] Challenges and opportunities beyond structured data in analysis of electronic health records
    Tayefi, Maryam
    Ngo, Phuong
    Chomutare, Taridzo
    Dalianis, Hercules
    Salvi, Elisa
    Budrionis, Andrius
    Godtliebsen, Fred
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (06)
  • [46] Multimodal Data Analysis and Visualization to Study the Usage of Electronic Health Records
    Weibel, Nadir
    Ashfaq, Shazia
    Calvitti, Alan
    Hollan, James D.
    Agha, Zia
    PROCEEDINGS OF THE 2013 7TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE AND WORKSHOPS (PERVASIVEHEALTH 2013), 2013, : 282 - 283
  • [47] Risk prediction of heart diseases in patients with breast cancer: A deep learning approach with longitudinal electronic health records data
    Zhou, Sicheng
    Blaes, Anne
    Shenoy, Chetan
    Sun, Ju
    Zhang, Rui
    ISCIENCE, 2024, 27 (07)
  • [48] Visual Analytics for Dimension Reduction and Cluster Analysis of High Dimensional Electronic Health Records
    Abdullah, Sheikh S.
    Rostamzadeh, Neda
    Sedig, Kamran
    Garg, Amit X.
    McArthur, Eric
    INFORMATICS-BASEL, 2020, 7 (02):
  • [49] Using Electronic Health Records to Build an Ophthalmologic Data Warehouse and Visualize Patients' Data
    Kortuem, Karsten U.
    Mueller, Michael
    Kern, Christoph
    Babenko, Alexander
    Mayer, Wolfgang J.
    Kampik, Anselm
    Kreutzer, Thomas C.
    Priglinger, Siegfried
    Hirneiss, Christoph
    AMERICAN JOURNAL OF OPHTHALMOLOGY, 2017, 178 : 84 - 93
  • [50] Solving patients with rare diseases within Telethon Undiagnosed Disease Program through reanalysis of exomephenome data
    Morleo, Manuela
    Torella, Annalaura
    Pinelli, Michele
    Spampanato, Carmine
    Banfi, Sandro
    Romano, Francesca
    Tirozzi, Alfonsina
    Nigro, Vincenzo
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 16 - 16