Leveraging pretrained language models for seizure frequency extraction from epilepsy evaluation reports

被引:0
|
作者
Rashmie Abeysinghe [1 ]
Shiqiang Tao [2 ]
Samden D. Lhatoo [1 ]
Guo-Qiang Zhang [2 ]
Licong Cui [1 ]
机构
[1] The University of Texas Health Science Center at Houston,Department of Neurology, McGovern Medical School
[2] The University of Texas Health Science Center at Houston,Texas Institute for Restorative Neurotechnologies
[3] The University of Texas Health Science Center at Houston,McWilliams School of Biomedical Informatics
关键词
D O I
10.1038/s41746-025-01592-4
中图分类号
学科分类号
摘要
Seizure frequency is essential for evaluating epilepsy treatment, ensuring patient safety, and reducing risk for Sudden Unexpected Death in Epilepsy. As this information is often described in clinical narratives, this study presents an approach to extracting structured seizure frequency details from such unstructured text. We investigated two tasks: (1) extracting phrases describing seizure frequency, and (2) extracting seizure frequency attributes. For both tasks, we fine-tuned three BERT-based models (bert-large-cased, biobert-large-cased, and Bio_ClinicalBERT), as well as three generative large language models (GPT-4, GPT-3.5 Turbo, and Llama-2-70b-hf). The final structured output integrated the results from both tasks. GPT-4 attained the best performance across all tasks with precision, recall, and F1-score of 86.61%, 85.04%, and 85.79% respectively for frequency phrase extraction; 90.23%, 93.51%, and 91.84% for seizure frequency attribute extraction; and 86.64%, 85.06%, and 85.82% for the final structured output. These findings highlight the potential of fine-tuned generative models in extractive tasks from limited text strings.
引用
收藏
相关论文
共 50 条
  • [31] Evaluation of Cannabidiol in Animal Seizure Models by the Epilepsy Therapy Screening Program (ETSP)
    Klein, Brian D.
    Jacobson, Catherine A.
    Metcalf, Cameron S.
    Smith, Misty D.
    Wilcox, Karen S.
    Hampson, Aidan J.
    Kehne, John H.
    NEUROCHEMICAL RESEARCH, 2017, 42 (07) : 1939 - 1948
  • [32] Leveraging natural language processing for efficient information extraction from breast cancer pathology reports: Single-institution study
    Park, Phillip
    Choi, Yeonho
    Han, Nayoung
    Park, Ye-Lin
    Hwang, Juyeon
    Chae, Heejung
    Yoo, Chong Woo
    Choi, Kui Son
    Kim, Hyun-Jin
    PLOS ONE, 2025, 20 (02):
  • [33] Bridging the Gap: A Hybrid Approach to Medical Relation Extraction Using Pretrained Language Models and Traditional Machine Learning
    Hassan, Nesma A.
    Seoud, Rania A. Abul
    Salem, Dina A.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (06) : 723 - 734
  • [34] Using Large Pretrained Language Models for Answering User Queries from Product Specifications
    Roy, Kalyani
    Shah, Smit
    Pai, Nithish
    Ramtej, Jaidam
    Nadkarn, Prajit Prashant
    Banerjee, Jyotirmoy
    Goyal, Pawan
    Kumar, Surender
    WORKSHOP ON E-COMMERCE AND NLP (ECNLP 3), 2020, : 35 - 39
  • [35] Unsupervised and Few-Shot Parsing from Pretrained Language Models (Extended Abstract)
    Zeng, Zhiyuan
    Xiong, Deyi
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6995 - 7000
  • [36] X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models
    Zhengbao, Jiang
    Anastasopoulos, Antonios
    Jun, Araki
    Haibo, Ding
    Neubig, Graham
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5943 - 5959
  • [37] Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models
    Lee, Jaehyun
    Sharma, Ishan
    Arcaro, Nichole
    Blackstone, Eugene H.
    Gillinov, A. Marc
    Svensson, Lars G.
    Karamlou, Tara
    Chen, David
    JAMIA OPEN, 2024, 7 (03)
  • [38] A Systematic Approach to Prompting Large Language Models for Automated Feature Extraction from Cardiovascular Imaging Reports
    Goldfinger, Shir
    Mackay, Emily
    Chan, Trevor
    Eswar, Vikram
    Grasfield, Rachel
    Yan, Vivian
    Barreto, David
    Pouch, Alison
    CIRCULATION, 2024, 150
  • [39] Evaluation of the Seizure Frequency and Severity in Patients with Epilepsy Who Had COVID-19
    Guclu Altun, Ilknur
    Koc, Guray
    Ozen Barut, Banu
    Gokcil, Zeki
    EPILEPSI, 2021, 27 (03): : 163 - 170
  • [40] bitsa_nlp@LT-EDI-ACL2022: Leveraging Pretrained Language Models for Detecting Homophobia and Transphobia in Social Media Comments
    Bhandari, Vitthal
    Goyal, Poonam
    PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 149 - 154