Leveraging large language models for medical text classification: a hospital readmission prediction case

被引:0
|
作者
Nazyrova, Nodira [1 ]
Chahed, Salma [1 ]
Chausalet, Thierry [1 ]
Dwek, Miriam [2 ]
机构
[1] Univ Westminster, Sch Comp Sci & Engn, London, England
[2] Univ Westminster, Sch Life Sci, London, England
关键词
hospital readmission prediction; domain-specific transformer models; BERT; ClinicalBERT; SciBERT; BioBERT; large language models;
D O I
10.1109/ICPRS62101.2024.10677826
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the intersection of natural language processing (NLP) and healthcare informatics has witnessed a revolutionary transformation. One of the most groundbreaking developments in this realm is the advent of large language models (LLM), which have demonstrated remarkable capabilities in analysing clinical data. This paper aims to explore the potential of large language models in medical text classification, shedding light on their ability to discern subtle patterns, grasp domain-specific terminology, and adapt to the dynamic nature of medical information. This research focuses on the application of transformer-based models, such as Bidirectional Encoder Representations from Transformers (BERT), on hospital discharge summaries to predict 30-day readmissions among older adults. In particular, we explore the role of transfer learning in medical text classification and compare domain-specific transformer models, such as SciBERT, BioBERT and ClinicalBERT. We also analyse how data preprocessing techniques affect the performance of language models. Our comparative analysis shows that removing parts of text with a large proportion of out-of-vocabulary words improves the classification results. We also investigate how the input sequence length affects the model performance, varying sequence length from 128 to 512 for BERT-based models and 4096 sequence length for the Longformers. The results of the investigation showed that among compared models SciBERT yields the best performance when applied in the medical domain, improving current hospital readmission predictions using clinical notes on MIMIC data from 0.714 to 0.735 AUROC. Our next step is pretraining a model with a large corpus of clinical notes to potentially improve the adaptability of a language model in the medical domain and achieve better results in downstream tasks.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Risk Prediction Models for Hospital Readmission A Systematic Review
    Kansagara, Devan
    Englander, Honora
    Salanitro, Amanda
    Kagen, David
    Theobald, Cecelia
    Freeman, Michele
    Kripalani, Sunil
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 306 (15): : 1688 - 1698
  • [32] Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
    Reif, Emily
    Qian, Crystal
    Wexler, James
    Kahng, Minsuk
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [33] Leveraging Open Large Language Models for Multilingual Policy Topic Classification: The Babel Machine Approach
    Sebok, Miklos
    Mate, Akos
    Ring, Orsolya
    Kovacs, Viktor
    Lehoczki, Richard
    SOCIAL SCIENCE COMPUTER REVIEW, 2024,
  • [34] Enhanced Discriminative Fine-Tuning of Large Language Models for Chinese Text Classification
    Song, Jinwang
    Zan, Hongying
    Zhang, Kunli
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 168 - 174
  • [35] Leveraging Medical Knowledge Graphs and Large Language Models for Enhanced Mental Disorder Information Extraction
    Park, Chaelim
    Lee, Hayoung
    Jeong, Ok-ran
    FUTURE INTERNET, 2024, 16 (08)
  • [36] Leveraging large language models to construct feedback from medical multiple-choice Questions
    Tomova, Mihaela
    Rosello Atanet, Ivan
    Sehy, Victoria
    Sieg, Miriam
    Maerz, Maren
    Maeder, Patrick
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [37] Leveraging Large Language Models for Automated Dialogue Analysis
    Finch, Sarah E.
    Paek, Ellie S.
    Choi, Jinho D.
    24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 202 - 215
  • [38] Leveraging Large Language Models for Sensor Data Retrieval
    Berenguer, Alberto
    Morejon, Adriana
    Tomas, David
    Mazon, Jose-Norberto
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [39] Leveraging Cognitive Science for Testing Large Language Models
    Srinivasan, Ramya
    Inakoshi, Hiroya
    Uchino, Kanji
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
  • [40] Leveraging large language models for data analysis automation
    Jansen, Jacqueline A.
    Manukyan, Artur
    Al Khoury, Nour
    Akalin, Altuna
    PLOS ONE, 2025, 20 (02):