Assertion Detection in Clinical Natural Language Processing using Large Language Models

被引:0
|
作者
Ji, Yuelyu [1 ]
Yu, Zeshui [2 ]
Wang, Yanshan [3 ]
机构
[1] Univ Pittsburgh, Dept Comp & Informat, Pittsburgh, PA 15260 USA
[2] Univ Pittsburgh, Dept Pharmaceut Sci, Pittsburgh, PA USA
[3] Univ Pittsburgh, Dept Hlth Informat Management, Pittsburgh, PA USA
基金
美国国家卫生研究院;
关键词
Assertion Detection Large Language Model In-context Learning LoRA Fine-tuning;
D O I
10.1109/ICHI61247.2024.00039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we aim to address the task of assertion detection when extracting medical concepts from clinical notes, a key process in clinical natural language processing (NLP). Assertion detection in clinical NLP usually involves identifying assertion types for medical concepts in the clinical text, namely certainty (whether the medical concept is positive, negated, possible, or hypothetical), temporality (whether the medical concept is for present or the past history), and experiencer (whether the medical concept is described for the patient or a family member). These assertion types are essential for healthcare professionals to quickly and clearly understand the context of medical conditions from unstructured clinical texts, directly influencing the quality and outcomes of patient care. Although widely used, traditional methods, particularly rule-based NLP systems and machine learning or deep learning models, demand intensive manual efforts to create patterns and tend to overlook less common assertion types, leading to an incomplete understanding of the context. To address this challenge, our research introduces a novel methodology that utilizes Large Language Models (LLMs) pre-trained on a vast array of medical data for assertion detection. We enhanced the current method with advanced reasoning techniques, including Tree of Thought (ToT), Chain of Thought (CoT), and Self-Consistency (SC), and refine it further with Low-Rank Adaptation (LoRA) fine-tuning. We first evaluated the model on the i2b2 2010 assertion dataset. Our method achieved a micro-averaged F-1 of 0.89, with 0.11 improvements over the previous works. To further assess the generalizability of our approach, we extended our evaluation to a local dataset that focused on sleep concept extraction. Our approach achieved an F-1 of 0.74, which is 0.31 higher than the previous method. The results show that using LLMs is a viable option for assertion detection in clinical NLP and can potentially integrate with other LLM-based concept extraction models for clinical NLP tasks.
引用
收藏
页码:242 / 247
页数:6
相关论文
共 50 条
  • [1] Natural language processing in the era of large language models
    Zubiaga, Arkaitz
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 6
  • [2] Fairness Certification for Natural Language Processing and Large Language Models
    Freiberger, Vincent
    Buchmann, Erik
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2024, 2024, 1065 : 606 - 624
  • [3] Assertion Detection in Clinical Natural Language Processing: A Knowledge-Poor Machine Learning Approach
    Chen, Long
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), 2019, : 37 - 40
  • [4] Applications of natural language processing and large language models in materials discovery
    Jiang, Xue
    Wang, Weiren
    Tian, Shaohan
    Wang, Hao
    Lookman, Turab
    Su, Yanjing
    NPJ COMPUTATIONAL MATERIALS, 2025, 11 (01)
  • [5] Robustness of GPT Large Language Models on Natural Language Processing Tasks
    Xuanting C.
    Junjie Y.
    Can Z.
    Nuo X.
    Tao G.
    Qi Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1128 - 1142
  • [6] Leveraging large language models for knowledge-free weak supervision in clinical natural language processing
    Hsu, Enshuo
    Roberts, Kirk
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [7] BioInstruct: instruction tuning of large language models for biomedical natural language processing
    Tran, Hieu
    Yang, Zhichao
    Yao, Zonghai
    Yu, Hong
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1821 - 1832
  • [8] Research and Exploration on Chinese Natural Language Processing in Era of Large Language Models
    大模型时代下的汉语自然语言处理研究与探索
    Xi, Xuefeng (xfxi@mail.usts.edu.cn), 2025, 61 (01) : 80 - 97
  • [9] Benchmarking large language models for biomedical natural language processing applications and recommendations
    Chen, Qingyu
    Hu, Yan
    Peng, Xueqing
    Xie, Qianqian
    Jin, Qiao
    Gilson, Aidan
    Singer, Maxwell B.
    Ai, Xuguang
    Lai, Po-Ting
    Wang, Zhizheng
    Keloth, Vipina K.
    Raja, Kalpana
    Huang, Jimin
    He, Huan
    Lin, Fongci
    Du, Jingcheng
    Zhang, Rui
    Zheng, W. Jim
    Adelman, Ron A.
    Lu, Zhiyong
    Xu, Hua
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [10] Automated Research Review Support Using Machine Learning, Large Language Models, and Natural Language Processing
    Pendyala, Vishnu S.
    Kamdar, Karnavee
    Mulchandani, Kapil
    ELECTRONICS, 2025, 14 (02):