Leveraging large language models for medical text classification: a hospital readmission prediction case

被引：0

作者：

Nazyrova, Nodira ^{[1
]}

Chahed, Salma ^{[1
]}

Chausalet, Thierry ^{[1
]}

Dwek, Miriam ^{[2
]}

机构：

[1] Univ Westminster, Sch Comp Sci & Engn, London, England

[2] Univ Westminster, Sch Life Sci, London, England

来源：

2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS | 2024年

关键词：

hospital readmission prediction; domain-specific transformer models; BERT; ClinicalBERT; SciBERT; BioBERT; large language models;

D O I：

10.1109/ICPRS62101.2024.10677826

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, the intersection of natural language processing (NLP) and healthcare informatics has witnessed a revolutionary transformation. One of the most groundbreaking developments in this realm is the advent of large language models (LLM), which have demonstrated remarkable capabilities in analysing clinical data. This paper aims to explore the potential of large language models in medical text classification, shedding light on their ability to discern subtle patterns, grasp domain-specific terminology, and adapt to the dynamic nature of medical information. This research focuses on the application of transformer-based models, such as Bidirectional Encoder Representations from Transformers (BERT), on hospital discharge summaries to predict 30-day readmissions among older adults. In particular, we explore the role of transfer learning in medical text classification and compare domain-specific transformer models, such as SciBERT, BioBERT and ClinicalBERT. We also analyse how data preprocessing techniques affect the performance of language models. Our comparative analysis shows that removing parts of text with a large proportion of out-of-vocabulary words improves the classification results. We also investigate how the input sequence length affects the model performance, varying sequence length from 128 to 512 for BERT-based models and 4096 sequence length for the Longformers. The results of the investigation showed that among compared models SciBERT yields the best performance when applied in the medical domain, improving current hospital readmission predictions using clinical notes on MIMIC data from 0.714 to 0.735 AUROC. Our next step is pretraining a model with a large corpus of clinical notes to potentially improve the adaptability of a language model in the medical domain and achieve better results in downstream tasks.

引用

页数：7

共 50 条

[31] Risk Prediction Models for Hospital Readmission A Systematic Review
Kansagara, Devan
Englander, Honora
Salanitro, Amanda
Kagen, David
Theobald, Cecelia
Freeman, Michele
Kripalani, Sunil
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 306 (15): : 1688 - 1698
[32] Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
Reif, Emily
Qian, Crystal
Wexler, James
Kahng, Minsuk
EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
[33] Leveraging Open Large Language Models for Multilingual Policy Topic Classification: The Babel Machine Approach
Sebok, Miklos
Mate, Akos
Ring, Orsolya
Kovacs, Viktor
Lehoczki, Richard
SOCIAL SCIENCE COMPUTER REVIEW, 2024,
[34] Enhanced Discriminative Fine-Tuning of Large Language Models for Chinese Text Classification
Song, Jinwang
Zan, Hongying
Zhang, Kunli
2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 168 - 174
[35] Leveraging Medical Knowledge Graphs and Large Language Models for Enhanced Mental Disorder Information Extraction
Park, Chaelim
Lee, Hayoung
Jeong, Ok-ran
FUTURE INTERNET, 2024, 16 (08)
[36] Leveraging large language models to construct feedback from medical multiple-choice Questions
Tomova, Mihaela
Rosello Atanet, Ivan
Sehy, Victoria
Sieg, Miriam
Maerz, Maren
Maeder, Patrick
SCIENTIFIC REPORTS, 2024, 14 (01):
[37] Leveraging Large Language Models for Automated Dialogue Analysis
Finch, Sarah E.
Paek, Ellie S.
Choi, Jinho D.
24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 202 - 215
[38] Leveraging Large Language Models for Sensor Data Retrieval
Berenguer, Alberto
Morejon, Adriana
Tomas, David
Mazon, Jose-Norberto
APPLIED SCIENCES-BASEL, 2024, 14 (06):
[39] Leveraging Cognitive Science for Testing Large Language Models
Srinivasan, Ramya
Inakoshi, Hiroya
Uchino, Kanji
2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
[40] Leveraging large language models for data analysis automation
Jansen, Jacqueline A.
Manukyan, Artur
Al Khoury, Nour
Akalin, Altuna
PLOS ONE, 2025, 20 (02):

← 1 2 3 4 5 →