LLMs Accelerate Annotation for Medical Information Extraction

被引:0
|
作者
Goel, Akshay [1 ]
Gueta, Almog [1 ]
Gilon, Omry [1 ]
Liu, Chang [1 ]
Erell, Sofia [1 ]
Lan Huong Nguyen [1 ]
Hao, Xiaohong [1 ]
Jaber, Bolous [1 ]
Reddy, Shashir [1 ]
Kartha, Rupesh [1 ]
Steiner, Jean [1 ]
Laish, Itay [1 ]
Feder, Amir [1 ]
机构
[1] Google Res, Mountain View, CA 94035 USA
关键词
Medical NLP; Large Language Models; Data Annotation; ELECTRONIC HEALTH RECORDS; TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The unstructured nature of clinical notes within electronic health records often conceals vital patient-related information, making it challenging to access or interpret. To uncover this hidden information, specialized Natural Language Processing (NLP) models are required. However, training these models necessitates large amounts of labeled data, a process that is both time-consuming and costly when relying solely on human experts for annotation. In this paper, we propose an approach that combines Large Language Models (LLMs) with human expertise to create an efficient method for generating ground truth labels for medical text annotation. By utilizing LLMs in conjunction with human annotators, we significantly reduce the human annotation burden, enabling the rapid creation of labeled datasets. We rigorously evaluate our method on a medical information extraction task, demonstrating that our approach not only substantially cuts down on human intervention but also maintains high accuracy. The results highlight the potential of using LLMs to improve the utilization of unstructured clinical data, allowing for the swift deployment of tailored NLP solutions in healthcare.
引用
收藏
页码:82 / 100
页数:19
相关论文
共 50 条
  • [31] Learning information extraction rules for protein annotation from unannotated corpora
    Kim, JH
    Hilario, M
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 522 - 534
  • [32] User-system cooperation in document annotation based on information extraction
    Ciravegna, F
    Dingli, A
    Petrelli, D
    Wilks, Y
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, PROCEEDINGS: ONTOLOGIES AND THE SEMANTIC WEB, 2002, 2473 : 122 - 137
  • [33] ANNO: A General Annotation Tool for Bilingual Clinical Note Information Extraction
    Lee, Kye Hwa
    Lee, Hyunsung
    Park, Jin-Hyeok
    Kim, Yi-Jun
    Lee, Youngho
    HEALTHCARE INFORMATICS RESEARCH, 2022, 28 (01) : 89 - 94
  • [34] AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark
    Friedrich, Niklas
    Gashteovski, Kiril
    Yu, Mingying
    Kotnis, Bhushan
    Lawrence, Carolin
    Niepert, Mathias
    Glavas, Goran
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 44 - 60
  • [35] Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes
    Chiaramello, Emma
    Pinciroli, Francesco
    Bonalumi, Alberico
    Caroli, Angelo
    Tognola, Gabriella
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 63 : 22 - 32
  • [36] Information extraction and summarization from medical documents
    Spyropoulos, CD
    Karkatetsis, V
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) : 107 - 110
  • [37] Information extraction from sound for medical telemonitoring
    Istrate, D
    Castelli, E
    Vacher, M
    Besacier, L
    Serignat, JF
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (02): : 264 - 274
  • [38] An Extraction of Medical Information Based on Human Handwritings
    Bhaskoro, Susetyo Bagas
    Supangkat, Suhono Harso
    2014 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2014, : 253 - 258
  • [39] Synthetic data for annotation and extraction of family history information from clinical text
    Pål H. Brekke
    Taraka Rama
    Ildikó Pilán
    Øystein Nytrø
    Lilja Øvrelid
    Journal of Biomedical Semantics, 12
  • [40] Using adaptive information extraction for effective human-centred document annotation
    Ciravegna, F
    Dingli, A
    Wilks, Y
    Petrelli, D
    TEXT MINING: THEORETICAL ASPECTS AND APPLICATIONS, 2003, : 153 - 163