Learning a Health Knowledge Graph from Electronic Medical Records

被引:258
|
作者
Rotmensch, Maya [1 ]
Halpern, Yoni [2 ]
Tlimat, Abdulhakim [3 ]
Horng, Steven [3 ,4 ]
Sontag, David [5 ,6 ]
机构
[1] NYU, Ctr Data Sci, New York, NY USA
[2] NYU, Dept Comp Sci, New York, NY USA
[3] Beth Israel Deaconess Med Ctr, Dept Emergency Med, Boston, MA 02215 USA
[4] Beth Israel Deaconess Med Ctr, Div Clin Informat, Boston, MA 02215 USA
[5] MIT, Dept Elect Engn & Comp Sci, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[6] MIT, Inst Med Engn & Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
关键词
DIAGNOSIS; PROGRAM;
D O I
10.1038/s41598-017-05778-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Demand for clinical decision support systems in medicine and self-diagnostic symptom checkers has substantially increased in recent years. Existing platforms rely on knowledge bases manually compiled through a labor-intensive process or automatically derived using simple pairwise statistics. This study explored an automated process to learn high quality knowledge bases linking diseases and symptoms directly from electronic medical records. Medical concepts were extracted from 273,174 de-identified patient records and maximum likelihood estimation of three probabilistic models was used to automatically construct knowledge graphs: logistic regression, naive Bayes classifier and a Bayesian network using noisy OR gates. A graph of disease-symptom relationships was elicited from the learned parameters and the constructed knowledge graphs were evaluated and validated, with permission, against Google's manually-constructed knowledge graph and against expert physician opinions. Our study shows that direct and automated construction of high quality health knowledge graphs from medical records using rudimentary concept extraction is feasible. The noisy OR model produces a high quality knowledge graph reaching precision of 0.85 for a recall of 0.6 in the clinical evaluation. Noisy OR significantly outperforms all tested models across evaluation frameworks (p < 0.01).
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Marrying Medical Domain Knowledge With Deep Learning on Electronic Health Records: A Deep Visual Analytics Approach
    Li, Rui
    Yin, Changchang
    Yang, Samuel
    Qian, Buyue
    Zhang, Ping
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (09)
  • [22] Organization of Solution Knowledge Graph from Collaborative Learning Records
    Watanabe, Yuki
    Kojiri, Tomoko
    Watanabe, Toyohide
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II, PROCEEDINGS, 2009, 5712 : 564 - 571
  • [23] Learning Treatment Regimens from Electronic Medical Records
    Hoang, Khanh Hung
    Ho, Tu Bao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 411 - 422
  • [24] Personalized Federated Graph Learning on Non-IID Electronic Health Records
    Tang, Tao
    Han, Zhuoyang
    Cai, Zhen
    Yu, Shuo
    Zhou, Xiaokang
    Oseni, Taiwo
    Das, Sajal K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 11843 - 11856
  • [25] Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records
    Daniel M. Bean
    Honghan Wu
    Ehtesham Iqbal
    Olubanke Dzahini
    Zina M. Ibrahim
    Matthew Broadbent
    Robert Stewart
    Richard J. B. Dobson
    Scientific Reports, 7
  • [26] Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records
    Bean, Daniel M.
    Wu, Honghan
    Dzahini, Olubanke
    Broadbent, Matthew
    Stewart, Robert
    Dobson, Richard J. B.
    SCIENTIFIC REPORTS, 2017, 7
  • [27] Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges
    Sachdeva, Shelly
    Bhalla, Subhash
    INFORMATION, 2022, 13 (02)
  • [28] Learning Electronic Health Records through Hyperbolic Embedding of Medical Ontologies
    Lu, Qiuhao
    de Silva, Nisansa
    Kafle, Sabin
    Cao, Jiazhen
    Dou, Dejing
    Thien Huu Nguyen
    Sen, Prithviraj
    Hailpern, Brent
    Reinwald, Berthold
    Li, Yunyao
    ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 338 - 346
  • [29] PDD Graph: Bridging Electronic Medical Records and Biomedical Knowledge Graphs via Entity Linking
    Wang, Meng
    Zhang, Jiaheng
    Liu, Jun
    Hu, Wei
    Wang, Sen
    Li, Xue
    Liu, Wenqiang
    SEMANTIC WEB - ISWC 2017, PT II, 2017, 10588 : 219 - 227
  • [30] Medical Scribes and Electronic Health Records
    Soudi, Abdesalam
    McCague, Anna-Binney
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2015, 314 (05): : 518 - 519