LLMs Accelerate Annotation for Medical Information Extraction

被引:0
|
作者
Goel, Akshay [1 ]
Gueta, Almog [1 ]
Gilon, Omry [1 ]
Liu, Chang [1 ]
Erell, Sofia [1 ]
Lan Huong Nguyen [1 ]
Hao, Xiaohong [1 ]
Jaber, Bolous [1 ]
Reddy, Shashir [1 ]
Kartha, Rupesh [1 ]
Steiner, Jean [1 ]
Laish, Itay [1 ]
Feder, Amir [1 ]
机构
[1] Google Res, Mountain View, CA 94035 USA
关键词
Medical NLP; Large Language Models; Data Annotation; ELECTRONIC HEALTH RECORDS; TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The unstructured nature of clinical notes within electronic health records often conceals vital patient-related information, making it challenging to access or interpret. To uncover this hidden information, specialized Natural Language Processing (NLP) models are required. However, training these models necessitates large amounts of labeled data, a process that is both time-consuming and costly when relying solely on human experts for annotation. In this paper, we propose an approach that combines Large Language Models (LLMs) with human expertise to create an efficient method for generating ground truth labels for medical text annotation. By utilizing LLMs in conjunction with human annotators, we significantly reduce the human annotation burden, enabling the rapid creation of labeled datasets. We rigorously evaluate our method on a medical information extraction task, demonstrating that our approach not only substantially cuts down on human intervention but also maintains high accuracy. The results highlight the potential of using LLMs to improve the utilization of unstructured clinical data, allowing for the swift deployment of tailored NLP solutions in healthcare.
引用
收藏
页码:82 / 100
页数:19
相关论文
共 50 条
  • [1] Leveraging LLMs for Information Extraction in Manufacturing
    Matthes, Marvin
    Guhr, Oliver
    Krockert, Martin
    Munkelt, Torsten
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS-PRODUCTION MANAGEMENT SYSTEMS FOR VOLATILE, UNCERTAIN, COMPLEX, AND AMBIGUOUS ENVIRONMENTS, APMS 2024, PT V, 2024, 732 : 355 - 366
  • [2] Knowledge Extraction from LLMs for Scalable Historical Data Annotation
    Celli, Fabio
    Mingazov, Dmitry
    ELECTRONICS, 2024, 13 (24):
  • [3] A unified framework of medical information annotation and extraction for Chinese clinical text
    Zhu, Enwei
    Sheng, Qilin
    Yang, Huanwan
    Liu, Yiyang
    Cai, Ting
    Li, Jinpeng
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 142
  • [4] WEIGHT ANNOTATION IN INFORMATION EXTRACTION
    Doleschal, Johannes
    Kimelfeld, Benny
    Martens, Wim
    Peterfreund, Liat
    LOGICAL METHODS IN COMPUTER SCIENCE, 2020, 18 (01)
  • [5] ITAKE: Interactive Unstructured Text Annotation and Knowledge Extraction System with LLMs and ModelOps
    Song, Jiahe
    Ding, Hongxin
    Wang, Zhiyuan
    Xu, Yongxin
    Zhao, Junfeng
    Wang, Yasha
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 3: SYSTEM DEMONSTRATIONS, 2024, : 326 - 334
  • [6] JaMIE: A Pipeline Japanese Medical Information Extraction System with Novel Relation Annotation
    Cheng, Fei
    Yada, Shuntaro
    Tanaka, Ribeka
    Aramaki, Eiji
    Kurohashi, Sadao
    2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 3724 - 3731
  • [7] JaMIE: A Pipeline Japanese Medical Information Extraction System with Novel Relation Annotation
    Cheng, Fei
    Yada, Shuntaro
    Tanaka, Ribeka
    Aramaki, Eiji
    Kurohashi, Sadao
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3724 - 3731
  • [8] Information extraction and semantic annotation of wikipedia
    Computer Science Department, Universidad Autonoma de Madrid, Spain
    不详
    Front. Artif. Intell. Appl., 2008, 1 (145-169):
  • [9] Annotation projection for temporal information extraction
    Giannella, Chris R.
    Winder, Ransom K.
    Jubinski, Joseph P.
    NATURAL LANGUAGE ENGINEERING, 2019, 25 (03) : 385 - 403
  • [10] KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
    Li, Zixuan
    Zeng, Yutao
    Zuo, Yuxin
    Ren, Weicheng
    Liu, Wenxuan
    Su, Miao
    Guo, Yucan
    Liu, Yantao
    Li, Xiang
    Hu, Zhilei
    Bai, Long
    Li, Wei
    Liu, Yidan
    Yang, Pan
    Jin, Xiaolong
    Guo, Jiafeng
    Cheng, Xueqi
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 8758 - 8779