Comprehensive testing of large language models for extraction of structured data in pathology

被引:0
|
作者
Bastian Grothey [1 ]
Jan Odenkirchen [2 ]
Adnan Brkic [1 ]
Birgid Schömig-Markiefka [1 ]
Alexander Quaas [1 ]
Reinhard Büttner [1 ]
Yuri Tolkach [1 ]
机构
[1] University Hospital Cologne,Institute of Pathology
[2] University of Cologne,Medical Faculty
来源
关键词
D O I
10.1038/s43856-025-00808-8
中图分类号
学科分类号
摘要
Pathology departments produce many diagnostic reports as free text, which is hard to analyze or use in research and computer projects. Converting this free text into more standard organized information like test results or diagnoses, makes it easier to use. This task often requires human experts and takes time. Large language models (LLMs), which are advanced computer systems designed to understand and generate human-like text, might simplify this process. Here, we tested six LLMs, including freely available models and the commercial GPT-4 model, using 579 pathology reports in English and German. Our results show that freely available models can perform as well as commercial, providing a cheaper solution while avoiding privacy concerns. The shared dataset will support future research in pathology data processing.
引用
收藏
相关论文
共 50 条
  • [21] Collaborative large language models for automated data extraction in living systematic reviews
    Khan, Muhammad Ali
    Ayub, Umair
    Naqvi, Syed Arsalan Ahmed
    Khakwani, Kaneez Zahra Rubab
    Sipra, Zaryab bin Riaz
    Raina, Ammad
    Zhou, Sihan
    He, Huan
    Saeidi, Amir
    Hasan, Bashar
    Rumble, Robert Bryan
    Bitterman, Danielle S.
    Warner, Jeremy L.
    Zou, Jia
    Tevaarwerk, Amye J.
    Leventakos, Konstantinos
    Kehl, Kenneth L.
    Palmer, Jeanne M.
    Murad, Mohammad Hassan
    Baral, Chitta
    bin Riaz, Irbaz
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025,
  • [22] Event extraction based on self-data augmentation with large language models
    Yang, Lishan
    Fan, Xi
    Wang, Xiangyu
    Wang, Xin
    Chen, Qiuju
    MEMETIC COMPUTING, 2025, 17 (01)
  • [23] Large language models for data extraction from unstructured and semi-structured electronic health records: a multiple model performance evaluation
    Ntinopoulos, Vasileios
    Biefer, Hector Rodriguez Cetina
    Tudorache, Igor
    Papadopoulos, Nestoras
    Odavic, Dragan
    Risteski, Petar
    Haeussler, Achim
    Dzemali, Omer
    BMJ HEALTH & CARE INFORMATICS, 2025, 32 (01)
  • [24] Large language models for structured reporting in radiology: comment
    Amnuay Kleebayoon
    Viroj Wiwanitkit
    La radiologia medica, 2023, 128 : 1440 - 1440
  • [25] SKILL: Structured Knowledge Infusion for Large Language Models
    Moiseev, Fedor
    Dong, Zhe
    Alfonseca, Enrique
    Jaggi, Martin
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1581 - 1588
  • [26] Large language models for structured reporting in radiology: comment
    Kleebayoon, Amnuay
    Wiwanitkit, Viroj
    RADIOLOGIA MEDICA, 2023, 128 (11): : 1440 - 1440
  • [27] The Potential Utility of Large Language Models in Molecular Pathology
    Gagan, Jeffrey
    JOURNAL OF APPLIED LABORATORY MEDICINE, 2024, 9 (01): : 159 - 161
  • [28] Data-driven building load prediction and large language models: Comprehensive overview
    Zhang, Yake
    Wang, Dijun
    Wang, Guansong
    Xu, Peng
    Zhu, Yihao
    ENERGY AND BUILDINGS, 2025, 326
  • [29] Large Language Model-Driven Structured Output: A Comprehensive Benchmark and Spatial Data Generation Framework
    Li, Diya
    Zhao, Yue
    Wang, Zhifang
    Jung, Calvin
    Zhang, Zhe
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (11)
  • [30] Evaluating large language models for software testing
    Li, Yihao
    Liu, Pan
    Wang, Haiyang
    Chu, Jie
    Wong, W. Eric
    COMPUTER STANDARDS & INTERFACES, 2025, 93