Ontology-driven weak supervision for clinical entity classification in electronic health records

被引:0
|
作者
Jason A. Fries
Ethan Steinberg
Saelig Khattar
Scott L. Fleming
Jose Posada
Alison Callahan
Nigam H. Shah
机构
[1] Stanford University,Center for Biomedical Informatics Research
[2] Stanford University,Department of Computer Science
来源
Nature Communications | / 12卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In the electronic health record, using clinical notes to identify entities such as disorders and their temporality (e.g. the order of an event relative to a time index) can inform many important analyses. However, creating training data for clinical entity tasks is time consuming and sharing labeled data is challenging due to privacy concerns. The information needs of the COVID-19 pandemic highlight the need for agile methods of training machine learning models for clinical notes. We present Trove, a framework for weakly supervised entity classification using medical ontologies and expert-generated rules. Our approach, unlike hand-labeled notes, is easy to share and modify, while offering performance comparable to learning from manually labeled training data. In this work, we validate our framework on six benchmark tasks and demonstrate Trove’s ability to analyze the records of patients visiting the emergency department at Stanford Health Care for COVID-19 presenting symptoms and risk factors.
引用
收藏
相关论文
共 50 条
  • [41] Ontology-based Framework for Electronic Health Records Interoperability
    Gonzalez, Carolina
    Blobel, Bernd G. M. E.
    Lopez, Diego M.
    USER CENTRED NETWORKED HEALTH CARE, 2011, 169 : 694 - 698
  • [42] Ontology-driven identification of inconsistencies in clinical data: A case study in lung cancer phenotyping
    Awuklu, Yvon K.
    Mougin, Fleur
    Griffier, Romain
    Bienvenu, Meghyn
    Jouhet, Vianney
    JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 165
  • [44] Can Synthetic Text Help Clinical Named Entity Recognition? A Study of Electronic Health Records in French
    Hiebel, Nicolas
    Ferret, Olivier
    Fort, Karen
    Neveol, Aurelie
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2320 - 2338
  • [45] GLOBAL CONTRASTIVE TRAINING FOR MULTIMODAL ELECTRONIC HEALTH RECORDS WITH LANGUAGE SUPERVISION
    Ma, Yingbo
    Kolla, Suraj
    Hu, Zhenhong
    Kaliraman, Dhruv
    Nolan, Victoria
    Guan, Ziyuan
    Ren, Yuanfang
    Armfield, Brooke
    Ozrazgat-Baslanti, Tezcan
    Balch, Jeremy A.
    Loftus, Tyler J.
    Rashidi, Parisa
    Bihorac, Azra
    Shickel, Benjamin
    arXiv,
  • [46] Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods
    Zhang, Yu
    Wang, Xuwen
    Hou, Zhen
    Li, Jiao
    JMIR MEDICAL INFORMATICS, 2018, 6 (04) : 242 - 254
  • [47] SAREF4health: IoT Standard-Based Ontology-Driven Healthcare Systems
    Moreira, Joao
    Ferreira Pires, Luis
    van Sinderen, Marten
    Daniele, Laura
    FORMAL ONTOLOGY IN INFORMATION SYSTEMS (FOIS 2018), 2018, 306 : 239 - 252
  • [48] Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition
    Sun, Wei
    Ji, Shaoxiong
    Denti, Tuulia
    Moen, Hans
    Kerro, Oleg
    Rannikko, Antti
    Marttinen, Pekka
    Koskinen, Miika
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 444 - 459
  • [49] A Blockchain Based Decentralized Identifiers for Entity Authentication in Electronic Health Records
    Manoj, T.
    Makkithaya, Krishnamoorthi
    Narendra, V. G.
    COGENT ENGINEERING, 2022, 9 (01):
  • [50] A clinical text classification paradigm using weak supervision and deep representation
    Wang, Yanshan
    Sohn, Sunghwan
    Liu, Sijia
    Shen, Feichen
    Wang, Liwei
    Atkinson, Elizabeth J.
    Amin, Shreyasee
    Liu, Hongfang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (1)