Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports

被引:12
|
作者
Le Guellec, Bastien [1 ,2 ]
Lefevre, Alexandre [1 ]
Geay, Charlotte [3 ]
Shorten, Lucas [3 ]
Bruge, Cyril [1 ]
Hacein-Bey, Lotfi [4 ]
Amouyel, Philippe [2 ,5 ]
Pruvo, Jean-Pierre [1 ,6 ,7 ]
Kuchcinski, Gregory [1 ,6 ,7 ]
Hamroun, Aghiles [2 ,5 ]
机构
[1] Univ Lille, Dept Neuroradiol, CHU Lille, Rue Emile Laine, F-59000 Lille, France
[2] Univ Lille, Dept Publ Hlth, CHU Lille, Rue Emile Laine, F-59000 Lille, France
[3] Univ Lille, CHU Lille, INclude Hlth Data Warehouse, Rue Emile Laine, F-59000 Lille, France
[4] UC Davis Hlth, Dept Radiol, Sacramento, CA 95817 USA
[5] Univ Lille, CHU Lille, Inst Pasteur Lille,Inserm, RID AGE Facteurs Ris & Determinants Mol Malad Liee, Lille, France
[6] Univ Lille, INSERM, LilNCog Lille Neurosci & Cognit U1172, Lille, France
[7] Univ Lille, Plateformes Lilloises Biol & Sante, UAR 2014, US 41,PLBS, Lille, France
关键词
D O I
10.1148/ryai.230364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Purpose: To assess the performance of a local open-source large language model (LLM) in various information extraction tasks from real-life emergency brain MRI reports. Materials and Methods: All consecutive emergency brain MRI reports written in 2022 from a French quaternary center were retrospectively reviewed. Two radiologists identified MRI scans that were performed in the emergency department for headaches. Four radiologists scored the reports' conclusions as either normal or abnormal. Abnormalities were labeled as either headache-causing or incidental. Vicuna (LMSYS Org), an open-source LLM, performed the same tasks. Vicuna's performance metrics were evaluated using the radiologists' consensus as the reference standard. Results: Among the 2398 reports during the study period, radiologists identified 595 that included headaches in the indication (median age of patients, 35 years [IQR, 26-51 years]; 68% [403 of 595] women). A positive finding was reported in 227 of 595 (38%) cases, 136 of which could explain the headache. The LLM had a sensitivity of 98.0% (95% CI: 96.5, 99.0) and specificity of 99.3% (95% CI: 98.8, 99.7) for detecting the presence of headache in the clinical context, a sensitivity of 99.4% (95% CI: 98.3, 99.9) and specificity of 98.6% (95% CI: 92.2, 100.0) for the use of contrast medium injection, a sensitivity of 96.0% (95% CI: 92.5, 98.2) and specificity of 98.9% (95% CI: 97.2, 99.7) for study categorization as either normal or abnormal, and a sensitivity of 88.2% (95% CI: 81.6, 93.1) and specificity of 73% (95% CI: 62, 81) for causal inference between MRI findings and headache. Conclusion: An open-source LLM was able to extract information from free-text radiology reports with excellent accuracy without requiring further training.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports
    Kanemaru, Noriko
    Yasaka, Koichiro
    Fujita, Nana
    Kanzawa, Jun
    Abe, Osamu
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025, 38 (02): : 865 - 872
  • [32] Comparing Commercial and Open-Source Large Language Models for Labeling Chest Radiograph Reports
    Dorfner, Felix J.
    Juergensen, Liv
    Donle, Leonhard
    Al Mohamad, Fares
    Bodenmann, Tobias R.
    Cleveland, Mason C.
    Busch, Felix
    Adams, Lisa C.
    Sato, James
    Schultz, Thomas
    Kim, Albert E.
    Merkow, Jameson
    Bressem, Keno K.
    Bridge, Christopher P.
    RADIOLOGY, 2024, 313 (01)
  • [33] A scoping review of large language model based approaches for information extraction from radiology reports
    Reichenpfader, Daniel
    Muller, Henning
    Denecke, Kerstin
    NPJ DIGITAL MEDICINE, 2024, 7 (01):
  • [34] Need of Fine-Tuned Radiology Aware Open-Source Large Language Models for Neuroradiology
    Ray, Partha Pratim
    CLINICAL NEURORADIOLOGY, 2024,
  • [35] Large language models for error detection in radiology reports: a comparative analysis between closed-source and privacy-compliant open-source models
    Salam, Babak
    Stuewe, Claire
    Nowak, Sebastian
    Sprinkart, Alois M.
    Theis, Maike
    Kravchenko, Dmitrij
    Mesropyan, Narine
    Dell, Tatjana
    Endler, Christoph
    Pieper, Claus C.
    Kuetting, Daniel L.
    Luetkens, Julian A.
    Isaak, Alexander
    EUROPEAN RADIOLOGY, 2025,
  • [36] Diagnostic Performance Evaluation of Deep Learning-Based Medical Text Modelling to Predict Pulmonary Diseases from Unstructured Radiology Free-Text Reports
    Shetty, Shashank
    Ananthanarayana, V. S.
    Mahale, Ajit
    ACTA INFORMATICA PRAGENSIA, 2023, 12 (02) : 260 - 274
  • [37] Extracting information from free-text electronic patient records to identify practice-based evidence of the performance of coronary stents
    Kim, Yoon Seob
    Yoon, Dukyong
    Byun, JungHyun
    Park, Hojun
    Lee, Ahram
    Kim, Il Hyun
    Lee, Sukhoon
    Lim, Hong-Seok
    Park, Rae Woong
    PLOS ONE, 2017, 12 (08):
  • [38] AUTOMATED DATA CAPTURE FROM FREE-TEXT RADIOLOGY REPORTS TO ENHANCE ACCURACY OF HOSPITAL INPATIENT STROKE CODES
    Flynn, R.
    Macdonald, T.
    Murray, G.
    Doney, A.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2009, 105 : 49 - 49
  • [39] A Universal Prompting Strategy for Extracting Process Model Information from Natural Language Text Using Large Language Models
    Neuberger, Julian
    Ackermann, Lars
    van der Aa, Han
    Jablonski, Stefan
    CONCEPTUAL MODELING, ER 2024, 2025, 15238 : 38 - 55
  • [40] Automated data capture from free-text radiology reports to enhance accuracy of hospital inpatient stroke codes
    Flynn, Robert W. V.
    Macdonald, Thomas M.
    Schembri, Nicola
    Murray, Gordon D.
    Doney, Alexander S. F.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2010, 19 (08) : 843 - 847