Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports

被引:12
|
作者
Le Guellec, Bastien [1 ,2 ]
Lefevre, Alexandre [1 ]
Geay, Charlotte [3 ]
Shorten, Lucas [3 ]
Bruge, Cyril [1 ]
Hacein-Bey, Lotfi [4 ]
Amouyel, Philippe [2 ,5 ]
Pruvo, Jean-Pierre [1 ,6 ,7 ]
Kuchcinski, Gregory [1 ,6 ,7 ]
Hamroun, Aghiles [2 ,5 ]
机构
[1] Univ Lille, Dept Neuroradiol, CHU Lille, Rue Emile Laine, F-59000 Lille, France
[2] Univ Lille, Dept Publ Hlth, CHU Lille, Rue Emile Laine, F-59000 Lille, France
[3] Univ Lille, CHU Lille, INclude Hlth Data Warehouse, Rue Emile Laine, F-59000 Lille, France
[4] UC Davis Hlth, Dept Radiol, Sacramento, CA 95817 USA
[5] Univ Lille, CHU Lille, Inst Pasteur Lille,Inserm, RID AGE Facteurs Ris & Determinants Mol Malad Liee, Lille, France
[6] Univ Lille, INSERM, LilNCog Lille Neurosci & Cognit U1172, Lille, France
[7] Univ Lille, Plateformes Lilloises Biol & Sante, UAR 2014, US 41,PLBS, Lille, France
关键词
D O I
10.1148/ryai.230364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Purpose: To assess the performance of a local open-source large language model (LLM) in various information extraction tasks from real-life emergency brain MRI reports. Materials and Methods: All consecutive emergency brain MRI reports written in 2022 from a French quaternary center were retrospectively reviewed. Two radiologists identified MRI scans that were performed in the emergency department for headaches. Four radiologists scored the reports' conclusions as either normal or abnormal. Abnormalities were labeled as either headache-causing or incidental. Vicuna (LMSYS Org), an open-source LLM, performed the same tasks. Vicuna's performance metrics were evaluated using the radiologists' consensus as the reference standard. Results: Among the 2398 reports during the study period, radiologists identified 595 that included headaches in the indication (median age of patients, 35 years [IQR, 26-51 years]; 68% [403 of 595] women). A positive finding was reported in 227 of 595 (38%) cases, 136 of which could explain the headache. The LLM had a sensitivity of 98.0% (95% CI: 96.5, 99.0) and specificity of 99.3% (95% CI: 98.8, 99.7) for detecting the presence of headache in the clinical context, a sensitivity of 99.4% (95% CI: 98.3, 99.9) and specificity of 98.6% (95% CI: 92.2, 100.0) for the use of contrast medium injection, a sensitivity of 96.0% (95% CI: 92.5, 98.2) and specificity of 98.9% (95% CI: 97.2, 99.7) for study categorization as either normal or abnormal, and a sensitivity of 88.2% (95% CI: 81.6, 93.1) and specificity of 73% (95% CI: 62, 81) for causal inference between MRI findings and headache. Conclusion: An open-source LLM was able to extract information from free-text radiology reports with excellent accuracy without requiring further training.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Extracting Information from Free-text Mammography Reports
    Esuli, Andrea
    Marcheggiani, Diego
    Sebastiani, Fabrizio
    ERCIM NEWS, 2010, (82): : 60 - 61
  • [2] Large language model-based information extraction from free-text radiology reports: a scoping review protocol
    Reichenpfader, Daniel
    Muller, Henning
    Denecke, Kerstin
    BMJ OPEN, 2023, 13 (12):
  • [3] Extracting information from free text radiology reports
    Johnson D.B.
    Taira R.K.
    Cardenas A.F.
    Aberle D.R.
    International Journal on Digital Libraries, 1997, 1 (3) : 297 - 308
  • [4] Automating Clinical Chart Review: An Open-Source Natural Language Processing Pipeline Developed on Free-Text Radiology Reports From Patients With Glioblastoma
    Senders, Joeky T.
    Cho, Logan D.
    Calvachi, Paola
    McNulty, John J.
    Ashby, Joanna L.
    Schulte, Isabelle S.
    Almekkawi, Ahmad Kareem
    Mehrtash, Alireza
    Gormley, William B.
    Smith, Timothy R.
    Broekman, Marike L. D.
    Arnaout, Omar
    JCO CLINICAL CANCER INFORMATICS, 2020, 4 : 25 - 34
  • [5] Large Language Model Ability to Translate CT and MRI Free-Text Radiology Reports Into Multiple Languages
    Meddeb, Aymen
    Lueken, Sophia
    Busch, Felix
    Adams, Lisa
    Ugga, Lorenzo
    Koltsakis, Emmanouil
    Tzortzakakis, Antonios
    Jelassi, Soumaya
    Dkhil, Insaf
    Klontzas, Michail E.
    Triantafyllou, Matthaios
    Kocak, Burak
    Yuezkan, Sabahattin
    Zhang, Longjiang
    Hu, Bin
    Andreychenko, Anna
    Yurievich, Efimtcev Alexander
    Logunova, Tatiana
    Morakote, Wipawee
    Angkurawaranon, Salita
    Makowski, Marcus R.
    Wattjes, Mike P.
    Cuocolo, Renato
    Bressem, Keno
    RADIOLOGY, 2024, 313 (03)
  • [6] Automatic structuring of radiology reports with on-premise open-source large language models
    Woznicki, Piotr
    Laqua, Caroline
    Fiku, Ina
    Hekalo, Amar
    Truhn, Daniel
    Engelhardt, Sandy
    Kather, Jakob
    Foersch, Sebastian
    D'Antonoli, Tugba Akinci
    dos Santos, Daniel Pinto
    Baessler, Bettina
    Laqua, Fabian Christopher
    EUROPEAN RADIOLOGY, 2025, 35 (04) : 2018 - 2029
  • [7] Re: Open-Source Large Language Models in Radiology
    Kooraki, Soheil
    Bedayat, Arash
    ACADEMIC RADIOLOGY, 2024, 31 (10) : 4293 - 4293
  • [8] Extracting information from free-text aircraft repair notes
    Farley, B
    AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2001, 15 (04): : 295 - 305
  • [9] Upgrading Academic Radiology with Open-Source Large Language Models
    Ray, Partha Pratim
    ACADEMIC RADIOLOGY, 2024, 31 (10) : 4291 - 4292
  • [10] FinBERT: A Large Language Model for Extracting Information from Financial Text
    Huang, Allen H.
    Wang, Hui
    Yang, Yi
    CONTEMPORARY ACCOUNTING RESEARCH, 2023, 40 (02) : 806 - 841