Leveraging pretrained language models for seizure frequency extraction from epilepsy evaluation reports

被引:0
|
作者
Rashmie Abeysinghe [1 ]
Shiqiang Tao [2 ]
Samden D. Lhatoo [1 ]
Guo-Qiang Zhang [2 ]
Licong Cui [1 ]
机构
[1] The University of Texas Health Science Center at Houston,Department of Neurology, McGovern Medical School
[2] The University of Texas Health Science Center at Houston,Texas Institute for Restorative Neurotechnologies
[3] The University of Texas Health Science Center at Houston,McWilliams School of Biomedical Informatics
关键词
D O I
10.1038/s41746-025-01592-4
中图分类号
学科分类号
摘要
Seizure frequency is essential for evaluating epilepsy treatment, ensuring patient safety, and reducing risk for Sudden Unexpected Death in Epilepsy. As this information is often described in clinical narratives, this study presents an approach to extracting structured seizure frequency details from such unstructured text. We investigated two tasks: (1) extracting phrases describing seizure frequency, and (2) extracting seizure frequency attributes. For both tasks, we fine-tuned three BERT-based models (bert-large-cased, biobert-large-cased, and Bio_ClinicalBERT), as well as three generative large language models (GPT-4, GPT-3.5 Turbo, and Llama-2-70b-hf). The final structured output integrated the results from both tasks. GPT-4 attained the best performance across all tasks with precision, recall, and F1-score of 86.61%, 85.04%, and 85.79% respectively for frequency phrase extraction; 90.23%, 93.51%, and 91.84% for seizure frequency attribute extraction; and 86.64%, 85.06%, and 85.82% for the final structured output. These findings highlight the potential of fine-tuned generative models in extractive tasks from limited text strings.
引用
收藏
相关论文
共 50 条
  • [1] Leveraging Frozen Pretrained Written Language Models for Neural Sign Language Translation
    De Coster, Mathieu
    Dambre, Joni
    INFORMATION, 2022, 13 (05)
  • [2] LEVERAGING ACOUSTIC AND LINGUISTIC EMBEDDINGS FROM PRETRAINED SPEECH AND LANGUAGE MODELS FOR INTENT CLASSIFICATION
    Sharma, Bidisha
    Madhavi, Maulik
    Li, Haizhou
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7498 - 7502
  • [3] Pretrained Models and Evaluation Data for the Khmer Language
    Shengyi Jiang
    Sihui Fu
    Nankai Lin
    Yingwen Fu
    Tsinghua Science and Technology, 2022, 27 (04) : 709 - 718
  • [4] Pretrained models and evaluation data for the Khmer language
    Jiang, Shengyi
    Fu, Sihui
    Lin, Nankai
    Fu, Yingwen
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (04) : 709 - 718
  • [5] Leveraging Text-to-Text Pretrained Language Models for Question Answering in Chemistry
    Tran, Dan
    Pascazio, Laura
    Akroyd, Jethro
    Mosbach, Sebastian
    Kraft, Markus
    ACS OMEGA, 2024, 9 (12): : 13883 - 13896
  • [6] Constructing Taxonomies from Pretrained Language Models
    Chen, Catherine
    Lin, Kevin
    Klein, Dan
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4687 - 4700
  • [7] Evaluation of Pretrained Large Language Models in Embodied Planning Tasks
    Sarkisyan, Christina
    Korchemnyi, Alexandr
    Kovalev, Alexey K.
    Panov, Aleksandr, I
    ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 222 - 232
  • [8] ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models
    Li, Junyi
    Tang, Tianyi
    Gong, Zheng
    Yang, Lixin
    Yu, Zhuohao
    Chen, Zhipeng
    Wang, Jingyuan
    Zhao, Wayne Xin
    Wen, Ji-Rong
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3519 - 3539
  • [9] IIITT at CASE 2021 Task 1: Leveraging Pretrained Language Models for Multilingual Protest Detection
    Jada, Pawan Kalyan
    Reddy, Duddukunta Sashidhar
    Hande, Adeep
    Priyadharshini, Ruba
    Sakuntharaj, Ratnasingam
    Chakravarthi, Bharathi Raja
    CASE 2021: THE 4TH WORKSHOP ON CHALLENGES AND APPLICATIONS OF AUTOMATED EXTRACTION OF SOCIO-POLITICAL EVENTS FROM TEXT (CASE), 2021, : 98 - 104
  • [10] LEVERAGING LARGE LANGUAGE MODELS FOR CARDIOVASCULAR MORTALITY PREDICTION FROM CT CHEST REPORTS
    James, Jose
    Brooks, Hunter
    Kullo, Iftikhar J.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 83 (13) : 2441 - 2441