LLMs Accelerate Annotation for Medical Information Extraction

被引:0
|
作者
Goel, Akshay [1 ]
Gueta, Almog [1 ]
Gilon, Omry [1 ]
Liu, Chang [1 ]
Erell, Sofia [1 ]
Lan Huong Nguyen [1 ]
Hao, Xiaohong [1 ]
Jaber, Bolous [1 ]
Reddy, Shashir [1 ]
Kartha, Rupesh [1 ]
Steiner, Jean [1 ]
Laish, Itay [1 ]
Feder, Amir [1 ]
机构
[1] Google Res, Mountain View, CA 94035 USA
关键词
Medical NLP; Large Language Models; Data Annotation; ELECTRONIC HEALTH RECORDS; TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The unstructured nature of clinical notes within electronic health records often conceals vital patient-related information, making it challenging to access or interpret. To uncover this hidden information, specialized Natural Language Processing (NLP) models are required. However, training these models necessitates large amounts of labeled data, a process that is both time-consuming and costly when relying solely on human experts for annotation. In this paper, we propose an approach that combines Large Language Models (LLMs) with human expertise to create an efficient method for generating ground truth labels for medical text annotation. By utilizing LLMs in conjunction with human annotators, we significantly reduce the human annotation burden, enabling the rapid creation of labeled datasets. We rigorously evaluate our method on a medical information extraction task, demonstrating that our approach not only substantially cuts down on human intervention but also maintains high accuracy. The results highlight the potential of using LLMs to improve the utilization of unstructured clinical data, allowing for the swift deployment of tailored NLP solutions in healthcare.
引用
收藏
页码:82 / 100
页数:19
相关论文
共 50 条
  • [41] Extraction of lexico-syntactic information and acquisition of causality schemes for text annotation
    Alamarguy, L
    Dieng-Kuntz, R
    Faron-Zucker, C
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2005, 3683 : 1180 - 1186
  • [42] OnTheFly 2.0: a tool for automatic annotation of files and biological information extraction.
    Pafilis, Evangelos
    Pavlopoulos, Georgios A.
    Satagopam, Venkata P.
    Papanikolaou, Nikolas
    Horn, Heiko
    Arvanitidis, Christos
    Jensen, Lars Juhl
    Schneider, Reinhard
    Iliopoulos, Ioannis
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [43] Synthetic data for annotation and extraction of family history information from clinical text
    Brekke, Pal H.
    Rama, Taraka
    Pilan, Ildiko
    Nytro, Oystein
    Ovrelid, Lilja
    JOURNAL OF BIOMEDICAL SEMANTICS, 2021, 12 (01)
  • [44] COMBREX: a project to accelerate the functional annotation of prokaryotic genomes
    Roberts, Richard J.
    Chang, Yi-Chien
    Hu, Zhenjun
    Rachlin, John N.
    Anton, Brian P.
    Pokrzywa, Revonda M.
    Choi, Han-Pil
    Faller, Lina L.
    Guleria, Jyotsna
    Housman, Genevieve
    Klitgord, Niels
    Mazumdar, Varun
    McGettrick, Mark G.
    Osmani, Lais
    Swaminathan, Rajeswari
    Tao, Kevin R.
    Letovsky, Stan
    Vitkup, Dennis
    Segre, Daniel
    Salzberg, Steven L.
    Delisi, Charles
    Steffen, Martin
    Kasif, Simon
    NUCLEIC ACIDS RESEARCH, 2011, 39 : D11 - D14
  • [45] Query Expansion Using Medical Information Extraction for Improving Information Retrieval in French Medical Domain
    Ghoulam, Aicha
    Barigou, Fatiha
    Belalem, Ghalem
    Meziane, Farid
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2018, 14 (03) : 1 - 17
  • [46] Enhancing Medical History Collection using LLMs
    Kumar, Rohit
    Gattani, Ram Krishna
    Singh, Kavita
    2024 AUSTRALIAN COMPUTER SCIENCE WEEK, ACSW 2024, 2024, : 140 - 143
  • [47] Information Extraction of Medical Materials: An Overview of the Track of Medical Materials MedOCR
    Liu, Lifeng
    Chang, Dejie
    Zhao, Xiaolong
    Guo, Longjie
    Chen, Mosha
    Tang, Buzhou
    HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 137 - 142
  • [48] Using LLMs for the Extraction and Normalization of Product Attribute Values
    Brinkmann, Alexander
    Baumann, Nick
    Bizer, Christian
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2024, 2024, 14918 : 217 - 230
  • [49] Extracting medical records with hierarchical information extraction method
    Zhu, W., 1600, Asian Network for Scientific Information (12):
  • [50] Compositional Information Extraction Methodology from Medical Reports
    Rani, Pratibha
    Reddy, Raghunath
    Mathur, Devika
    Bandyopadhyay, Subhadip
    Laha, Arijit
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, 2011, 6588 : 400 - +