Constructing synthetic datasets with generative artificial intelligence to train large language models to classify acute renal failure from clinical notes

被引:1
|
作者
Litake, Onkar [1 ]
Park, Brian H. [1 ]
Tully, Jeffrey L. [1 ]
Gabriel, Rodney A. [1 ,2 ]
机构
[1] Univ Calif San Diego, Dept Anesthesiol, Div Perioperat Informat, 9400 Campus Point Dr, La Jolla, CA 92037 USA
[2] Univ Calif San Diego Hlth, Dept Biomed Informat, La Jolla, CA 92037 USA
关键词
large language models; artificial intelligence; generative AI; ChatGPT;
D O I
10.1093/jamia/ocae081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives To compare performances of a classifier that leverages language models when trained on synthetic versus authentic clinical notes.Materials and Methods A classifier using language models was developed to identify acute renal failure. Four types of training data were compared: (1) notes from MIMIC-III; and (2, 3, and 4) synthetic notes generated by ChatGPT of varied text lengths of 15 (GPT-15 sentences), 30 (GPT-30 sentences), and 45 (GPT-45 sentences) sentences, respectively. The area under the receiver operating characteristics curve (AUC) was calculated from a test set from MIMIC-III.Results With RoBERTa, the AUCs were 0.84, 0.80, 0.84, and 0.76 for the MIMIC-III, GPT-15, GPT-30- and GPT-45 sentences training sets, respectively.Discussion Training language models to detect acute renal failure from clinical notes resulted in similar performances when using synthetic versus authentic training data.Conclusion The use of training data derived from protected health information may not be needed.
引用
收藏
页码:1404 / 1410
页数:7
相关论文
共 50 条
  • [31] Large language models generating synthetic clinical datasets: a feasibility and comparative analysis with real-world perioperative data
    Barr, Austin A.
    Quan, Joshua
    Guo, Eddie
    Sezgin, Emre
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2025, 8
  • [32] From Programming to Prompting: Developing Computational Thinking through Large Language Model-Based Generative Artificial Intelligence
    Hsu, Hsiao-Ping
    TECHTRENDS, 2025,
  • [33] From technical to understandable: Artificial Intelligence Large Language Models improve the readability of knee radiology reports
    Butler, James J.
    Puleo, James
    Harrington, Michael C.
    Dahmen, Jari
    Rosenbaum, Andrew J.
    Kerkhoffs, Gino M. M. J.
    Kennedy, John G.
    KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2024, 32 (05) : 1077 - 1086
  • [34] Identification of Frailty from Clinical Notes Among Surgical Patients Using Large Language Models
    Zhou, YingQiu
    Litake, Onkar
    Meineke, Minhthy
    Abdou, Waseem
    Xu, Nicole
    Gabriel, Rodney
    ANESTHESIA AND ANALGESIA, 2024, 139 (05): : 788 - 788
  • [35] Evidence-based potential of generative artificial intelligence large language models in orthodontics: a comparative study of ChatGPT, Google Bard, and Microsoft Bing
    Makrygiannakis, Miltiadis A.
    Giannakopoulos, Kostis
    Kaklamanos, Eleftherios G.
    EUROPEAN JOURNAL OF ORTHODONTICS, 2024,
  • [36] CAN ARTIFICIAL INTELLIGENCE (AI) LARGE LANGUAGE MODELS (LLMS) SUCH AS GENERATIVE PRE-TRAINED TRANSFORMER (GPT) BE USED TO AUTOMATE LITERATURE REVIEWS?
    Guerra, I
    Gallinaro, J.
    Rtveladze, K.
    Lambova, A.
    Asenova, E.
    VALUE IN HEALTH, 2023, 26 (12) : S410 - S411
  • [37] Letter to the Editor: Value-based Healthcare: Can Generative Artificial Intelligence and Large Language Models be a Catalyst for Value-based Healthcare?
    Porter, Matt A.
    CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2024, 482 (05) : 901 - 904
  • [38] Assessment of large language models in medical quizzes for clinical chemistry and laboratory management: implications and applications for healthcare artificial intelligence
    Heo, Won Young
    Park, Hyung-Doo
    SCANDINAVIAN JOURNAL OF CLINICAL & LABORATORY INVESTIGATION, 2025,
  • [39] Evaluation of Patient Education Materials From Large-Language Artificial Intelligence Models on Carpal Tunnel Release
    Croen, Brett J.
    Abdullah, Mohammed S.
    Berns, Ellis
    Rapaport, Sarah
    Hahn, Alexander K.
    Barrett, Caitlin C.
    Sobel, Andrew D.
    HAND-AMERICAN ASSOCIATION FOR HAND SURGERY, 2024,
  • [40] USING LARGE LANGUAGE MODELS TO ANNOTATE SUBSTANCE USE BEHAVIOR FROM ICU PATIENTS' CLINICAL NOTES
    Mathur, Piyush
    Maslinksi, Julia
    Dirosa, Izabella
    Cohen, Anabelle
    Arshad, Hajra
    Mahapatra, Dwarikanath
    Mishra, Shreya
    Awasthi, Raghav
    CRITICAL CARE MEDICINE, 2025, 53 (01)