Learning a Health Knowledge Graph from Electronic Medical Records

被引:258
|
作者
Rotmensch, Maya [1 ]
Halpern, Yoni [2 ]
Tlimat, Abdulhakim [3 ]
Horng, Steven [3 ,4 ]
Sontag, David [5 ,6 ]
机构
[1] NYU, Ctr Data Sci, New York, NY USA
[2] NYU, Dept Comp Sci, New York, NY USA
[3] Beth Israel Deaconess Med Ctr, Dept Emergency Med, Boston, MA 02215 USA
[4] Beth Israel Deaconess Med Ctr, Div Clin Informat, Boston, MA 02215 USA
[5] MIT, Dept Elect Engn & Comp Sci, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[6] MIT, Inst Med Engn & Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
关键词
DIAGNOSIS; PROGRAM;
D O I
10.1038/s41598-017-05778-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Demand for clinical decision support systems in medicine and self-diagnostic symptom checkers has substantially increased in recent years. Existing platforms rely on knowledge bases manually compiled through a labor-intensive process or automatically derived using simple pairwise statistics. This study explored an automated process to learn high quality knowledge bases linking diseases and symptoms directly from electronic medical records. Medical concepts were extracted from 273,174 de-identified patient records and maximum likelihood estimation of three probabilistic models was used to automatically construct knowledge graphs: logistic regression, naive Bayes classifier and a Bayesian network using noisy OR gates. A graph of disease-symptom relationships was elicited from the learned parameters and the constructed knowledge graphs were evaluated and validated, with permission, against Google's manually-constructed knowledge graph and against expert physician opinions. Our study shows that direct and automated construction of high quality health knowledge graphs from medical records using rudimentary concept extraction is feasible. The noisy OR model produces a high quality knowledge graph reaching precision of 0.85 for a recall of 0.6 in the clinical evaluation. Noisy OR significantly outperforms all tested models across evaluation frameworks (p < 0.01).
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Electronic Medical Records and Electronic Health Records: Overview for Nurse Practitioners
    McMullen, Patricia C.
    Howie, William O.
    Philipsen, Nayna
    Bryant, Virletta C.
    Setlow, Patricia D.
    Calhoun, Mona
    Green, Zakevia D.
    JNP-JOURNAL FOR NURSE PRACTITIONERS, 2014, 10 (09): : 56 - 61
  • [32] Learning bundled care opportunities from electronic medical records
    Chen, You
    Kho, Abel N.
    Liebovitz, David
    Ivory, Catherine
    Osmundson, Sarah
    Bian, Jiang
    Malin, Bradley A.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 : 1 - 10
  • [33] MKDS: A Medical Knowledge Discovery System Learned from Electronic Medical Records (Demonstration)
    Huang, Hen-Hsen
    Yen, An-Zi
    Chen, Hsin-Hsi
    INFORMATION RETRIEVAL TECHNOLOGY (AIRS 2018), 2018, 11292 : 196 - 202
  • [34] Self-Supervised Representation Learning on Electronic Health Records with Graph Kernel Infomax
    Yao, Hao-ren
    Cao, Nairen
    Russell, Katina
    Chang, Der-chen
    Frieder, Ophir
    Fineman, Jeremy t.
    ACM TRANSACTIONS ON COMPUTING FOR HEALTHCARE, 2024, 5 (02):
  • [35] Generating Accurate Electronic Health Assessment from Medical Graph
    Yang, Zhichao
    Yu, Hong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 3764 - 3773
  • [36] Federated Learning for Electronic Health Records
    Dang, Trung Kien
    Lan, Xiang
    Weng, Jianshu
    Feng, Mengling
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (05)
  • [37] Learning from heterogeneous temporal data in electronic health records
    Zhao, Jing
    Papapetrou, Panagiotis
    Asker, Lars
    Bostrom, Henrik
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 65 : 105 - 119
  • [38] Knowledge Acquisition for Electronic Health Records on cloud
    Jagli, Dhanamma
    Purohit, Seema
    Chandra, Subhash
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 1909 - 1915
  • [39] Temporal Phenotyping from Longitudinal Electronic Health Records: A Graph Based Framework
    Liu, Chuanren
    Wang, Fei
    Hu, Jianying
    Xiong, Hui
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 705 - 714
  • [40] Knowledge Management for the Protection of Information in Electronic Medical Records
    Lea, Nathan
    Hailes, Stephen
    Austin, Tony
    Kalra, Dipak
    EHEALTH BEYOND THE HORIZON - GET IT THERE, 2008, 136 : 685 - +