Comprehensive testing of large language models for extraction of structured data in pathology

被引：0

作者：

Bastian Grothey ^{[1
]}

Jan Odenkirchen ^{[2
]}

Adnan Brkic ^{[1
]}

Birgid Schömig-Markiefka ^{[1
]}

Alexander Quaas ^{[1
]}

Reinhard Büttner ^{[1
]}

Yuri Tolkach ^{[1
]}

机构：

[1] University Hospital Cologne,Institute of Pathology

[2] University of Cologne,Medical Faculty

来源：

Communications Medicine | / 5卷 / 1期

关键词：

D O I：

10.1038/s43856-025-00808-8

中图分类号：

学科分类号：

摘要：

Pathology departments produce many diagnostic reports as free text, which is hard to analyze or use in research and computer projects. Converting this free text into more standard organized information like test results or diagnoses, makes it easier to use. This task often requires human experts and takes time. Large language models (LLMs), which are advanced computer systems designed to understand and generate human-like text, might simplify this process. Here, we tested six LLMs, including freely available models and the commercial GPT-4 model, using 579 pathology reports in English and German. Our results show that freely available models can perform as well as commercial, providing a cheaper solution while avoiding privacy concerns. The shared dataset will support future research in pathology data processing.

引用

共 50 条

[41] The interaction of structured data using openEHR and large Language models for clinical decision support in prostate cancer
Kaiser, Philippe
Yang, Shan
Bach, Michael
Breit, Christian
Mertz, Kirsten
Stieltjes, Bram
Ebbing, Jan
Wetterauer, Christian
Henkel, Maurice
WORLD JOURNAL OF UROLOGY, 2025, 43 (01)
[42] Leveraging Cognitive Science for Testing Large Language Models
Srinivasan, Ramya
Inakoshi, Hiroya
Uchino, Kanji
2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
[43] Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning
Ozdayi, Mustafa Safa
Peris, Charith
Fitzgerald, Jack
Dupuy, Christophe
Majmudar, Jimit
Khan, Haidar
Parikh, Rahil
Gupta, Rahul
61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1512 - 1521
[44] Pipelines for Social Bias Testing of Large Language Models
Nozza, Debora
Bianchi, Federico
Hovy, Dirk
PROCEEDINGS OF WORKSHOP ON CHALLENGES & PERSPECTIVES IN CREATING LARGE LANGUAGE MODELS (BIGSCIENCE EPISODE #5), 2022, : 68 - 74
[45] Testing theory of mind in large language models and humans
Strachan, James W. A.
Albergo, Dalila
Borghini, Giulia
Pansardi, Oriana
Scaliti, Eugenio
Gupta, Saurabh
Saxena, Krati
Rufo, Alessandro
Panzeri, Stefano
Manzi, Guido
Graziano, Michael S. A.
Becchio, Cristina
NATURE HUMAN BEHAVIOUR, 2024, 8 (07): : 1285 - 1295
[46] A Survey of Testing Techniques Based on Large Language Models
Qi, Fei
Hou, Yingnan
Lin, Ning
Bao, Shanshan
Xu, Nuo
PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 280 - 284
[47] Demystifying Data Management for Large Language Models
Miao, Xupeng
Jia, Zhihao
Cui, Bin
COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
[48] A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction
Shui, Ruihao
Cao, Yixin
Xiang, Wang
Chua, Tat-Seng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7337 - 7348
[49] A Comprehensive Analysis of Various Tokenizers for Arabic Large Language Models
Qarah, Faisal
Alsanoosy, Tawfeeq
APPLIED SCIENCES-BASEL, 2024, 14 (13):
[50] Effective Structured Information Extraction from Chest Radiography Reports Using Open-Weights Large Language Models
Gee, James C.
Yao, Michael S.
RADIOLOGY, 2025, 314 (01)

← 1 2 3 4 5 →