Comprehensive testing of large language models for extraction of structured data in pathology

被引:0
|
作者
Bastian Grothey [1 ]
Jan Odenkirchen [2 ]
Adnan Brkic [1 ]
Birgid Schömig-Markiefka [1 ]
Alexander Quaas [1 ]
Reinhard Büttner [1 ]
Yuri Tolkach [1 ]
机构
[1] University Hospital Cologne,Institute of Pathology
[2] University of Cologne,Medical Faculty
来源
关键词
D O I
10.1038/s43856-025-00808-8
中图分类号
学科分类号
摘要
Pathology departments produce many diagnostic reports as free text, which is hard to analyze or use in research and computer projects. Converting this free text into more standard organized information like test results or diagnoses, makes it easier to use. This task often requires human experts and takes time. Large language models (LLMs), which are advanced computer systems designed to understand and generate human-like text, might simplify this process. Here, we tested six LLMs, including freely available models and the commercial GPT-4 model, using 579 pathology reports in English and German. Our results show that freely available models can perform as well as commercial, providing a cheaper solution while avoiding privacy concerns. The shared dataset will support future research in pathology data processing.
引用
收藏
相关论文
共 50 条
  • [41] The interaction of structured data using openEHR and large Language models for clinical decision support in prostate cancer
    Kaiser, Philippe
    Yang, Shan
    Bach, Michael
    Breit, Christian
    Mertz, Kirsten
    Stieltjes, Bram
    Ebbing, Jan
    Wetterauer, Christian
    Henkel, Maurice
    WORLD JOURNAL OF UROLOGY, 2025, 43 (01)
  • [42] Leveraging Cognitive Science for Testing Large Language Models
    Srinivasan, Ramya
    Inakoshi, Hiroya
    Uchino, Kanji
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
  • [43] Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning
    Ozdayi, Mustafa Safa
    Peris, Charith
    Fitzgerald, Jack
    Dupuy, Christophe
    Majmudar, Jimit
    Khan, Haidar
    Parikh, Rahil
    Gupta, Rahul
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1512 - 1521
  • [44] Pipelines for Social Bias Testing of Large Language Models
    Nozza, Debora
    Bianchi, Federico
    Hovy, Dirk
    PROCEEDINGS OF WORKSHOP ON CHALLENGES & PERSPECTIVES IN CREATING LARGE LANGUAGE MODELS (BIGSCIENCE EPISODE #5), 2022, : 68 - 74
  • [45] Testing theory of mind in large language models and humans
    Strachan, James W. A.
    Albergo, Dalila
    Borghini, Giulia
    Pansardi, Oriana
    Scaliti, Eugenio
    Gupta, Saurabh
    Saxena, Krati
    Rufo, Alessandro
    Panzeri, Stefano
    Manzi, Guido
    Graziano, Michael S. A.
    Becchio, Cristina
    NATURE HUMAN BEHAVIOUR, 2024, 8 (07): : 1285 - 1295
  • [46] A Survey of Testing Techniques Based on Large Language Models
    Qi, Fei
    Hou, Yingnan
    Lin, Ning
    Bao, Shanshan
    Xu, Nuo
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 280 - 284
  • [47] Demystifying Data Management for Large Language Models
    Miao, Xupeng
    Jia, Zhihao
    Cui, Bin
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
  • [48] A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction
    Shui, Ruihao
    Cao, Yixin
    Xiang, Wang
    Chua, Tat-Seng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7337 - 7348
  • [49] A Comprehensive Analysis of Various Tokenizers for Arabic Large Language Models
    Qarah, Faisal
    Alsanoosy, Tawfeeq
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [50] Effective Structured Information Extraction from Chest Radiography Reports Using Open-Weights Large Language Models
    Gee, James C.
    Yao, Michael S.
    RADIOLOGY, 2025, 314 (01)