Predicting Pediatric Genetic Epilepsy Through Electronic Medical Records: A Data-Driven Biomarker Discovery Approach

被引:0
|
作者
Li, Yi [1 ]
机构
[1] Stanford Univ, Dept Neurol & Neurol Sci, Stanford, CA 94305 USA
关键词
D O I
10.1177/15357597241290322
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Purpose: An early genetic diagnosis can guide the time-sensitive treatment of individuals with genetic epilepsies. However, most genetic diagnoses occur long after disease onset. We aimed to identify early clinical features suggestive of genetic diagnoses in individuals with epilepsy through large-scale analysis of full-text electronic medical records (EMRs). Methods: We extracted 89 million time-stamped standardized clinical annotations using Natural Language Processing from 4,572,783 clinical notes from 32 112 individuals with childhood epilepsy, including 1925 individuals with known or presumed genetic epilepsies. We applied these features to train random forest models to predict SCN1A-related disorders and any genetic diagnosis. Results: We identified 47 774 age-dependent associations of clinical features with genetic etiologies a median of 3.6 years prior to molecular diagnosis. Across all 710 genetic etiologies identified in our cohort, neurodevelopmental differences between 6 and 9 months increased the likelihood of a later molecular diagnosis fivefold (P < .0001, 95% CI = 3.55-7.42). A later diagnosis of SCN1A-related disorders (AUC = 0.91) or an overall positive genetic diagnosis (AUC = 0.82) could be reliably predicted using random forest models. Conclusion: Clinical features predictive of genetic epilepsies precede molecular diagnoses by up to several years in conditions with known precision treatments. An earlier diagnosis facilitated by automated EMR analysis has the potential for earlier targeted therapeutic strategies in the genetic epilepsies.
引用
收藏
页码:42 / 44
页数:3
相关论文
共 50 条
  • [41] The data-driven discovery of partial differential equations by symbolic genetic algorithm
    Sun, Shifei
    Tian, Shifang
    Wang, Yuduo
    Li, Biao
    NONLINEAR DYNAMICS, 2024, 112 (22) : 19871 - 19885
  • [42] A Data-Driven Approach to Predicting Successes and Failures of Clinical Trials
    Gayvert, Kaitlyn M.
    Madhukar, Neel S.
    Elemento, Olivier
    CELL CHEMICAL BIOLOGY, 2016, 23 (10) : 1294 - 1301
  • [43] A data-driven approach for predicting failure scenarios in nuclear systems
    Zio, Enrico
    Di Maio, Francesco
    Stasi, Marco
    ANNALS OF NUCLEAR ENERGY, 2010, 37 (04) : 482 - 491
  • [44] Identifying data-driven subtypes of major depressive disorder with electronic health records
    Sharma, Abhishek
    Verhaak, Pilar F.
    McCoy, Thomas H.
    Perlis, Roy H.
    Doshi-Velez, Finale
    JOURNAL OF AFFECTIVE DISORDERS, 2024, 356 : 64 - 70
  • [45] A Comparison of Methods for Data-Driven Cancer Outlier Discovery, and An Application Scheme to Semisupervised Predictive Biomarker Discovery
    Karrila, Seppo
    Lee, Julian Hock Ean
    Tucker-Kellogg, Greg
    CANCER INFORMATICS, 2011, 10 : 109 - 120
  • [46] A BIOMARKER APPROACH TO DATA-DRIVEN IDENTIFICATION OF ENDOTYPES IN KNEE OA PATIENTS
    Lisowska-Petersen, Z.
    Hannani, M. Toft
    Karsdal, M.
    Bager, C.
    Bay-Jensen, A. C.
    Thudium, C.
    ANNALS OF THE RHEUMATIC DISEASES, 2023, 82 : 1026 - 1026
  • [47] Assessing seizure burden in pediatric epilepsy using an electronic medical record-based tool through a common data element approach
    Fitzgerald, Mark P.
    Kaufman, Michael C.
    Massey, Shavonne L.
    Fridinger, Sara
    Prelack, Marisa
    Ellis, Colin
    Ortiz-Gonzalez, Xilma
    Fried, Lawrence E.
    DiGiovine, Marissa P.
    Melamed, Susan
    Malcolm, Marissa
    Banwell, Brenda
    Stephenson, Donna
    Witzman, Stephanie M.
    Gonzalez, Alexander
    Dlugos, Dennis
    Kessler, Sudha Kilaru
    Goldberg, Ethan M.
    Abend, Nicholas S.
    Helbig, Ingo
    EPILEPSIA, 2021, 62 (07) : 1617 - 1628
  • [48] A new frontier in drug discovery for skin cancer through electronic medical records
    Dousset, Lea
    Khosrotehrani, Kiarash
    BRITISH JOURNAL OF DERMATOLOGY, 2025, 192 (04) : 570 - 571
  • [49] Predicting Urban Water Quality With Ubiquitous Data-A Data-Driven Approach
    Liu, Ye
    Liang, Yuxuan
    Ouyang, Kun
    Liu, Shuming
    Rosenblum, David S.
    Zheng, Yu
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (02) : 564 - 578
  • [50] Ensemble feature selection with data-driven thresholding for Alzheimer's disease biomarker discovery
    Spooner, Annette
    Mohammadi, Gelareh
    Sachdev, Perminder S.
    Brodaty, Henry
    Sowmya, Arcot
    BMC BIOINFORMATICS, 2023, 24 (01)