Mining heart disease risk factors in clinical text with named entity recognition and distributional semantic models

被引:18
|
作者
Urbain, Jay [1 ,2 ]
机构
[1] Milwaukee Sch Engn, Milwaukee, WI 53202 USA
[2] Med Coll Wisconsin, CTSI SE Wisconsin, Milwaukee, WI 53226 USA
基金
美国国家卫生研究院;
关键词
Biomedical text mining; Clinical informatics; Translational research; Natural language processing; Named entity recognition; Distributional semantic models; Heart disease risk factors; Diabetes;
D O I
10.1016/j.jbi.2015.08.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present the design, and analyze the performance of a multi-stage natural language processing system employing named entity recognition, Bayesian statistics, and rule logic to identify and characterize heart disease risk factor events in diabetic patients over time. The system was originally developed for the 2014 i2b2 Challenges in Natural Language in Clinical Data. The system's strengths included a high level of accuracy for identifying named entities associated with heart disease risk factor events. The system's primary weakness was due to inaccuracies when characterizing the attributes of some events. For example, determining the relative time of an event with respect to the record date, whether an event is attributable to the patient's history or the patient's family history, and differentiating between current and prior smoking status. We believe these inaccuracies were due in large part to the lack of an effective approach for integrating context into our event detection model. To address these inaccuracies, we explore the addition of a distributional semantic model for characterizing contextual evidence of heart disease risk factor events. Using this semantic model, we raise our initial 2014 i2b2 Challenges in Natural Language of Clinical data F1 score of 0.838 to 0.890 and increased precision by 10.3% without use of any lexicons that might bias our results. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:S143 / S149
页数:7
相关论文
共 50 条
  • [1] Clinical Text Mining in Spanish Enhanced by Negation Detection and Named Entity Recognition
    Herrera, Antonio Jesus Tamayo
    Burgos, Diego A.
    Gelbukh, Alexander
    COMPUTACION Y SISTEMAS, 2023, 27 (04): : 1169 - 1181
  • [2] Comparison of Text Mining Models for Food and Dietary Constituent Named-Entity Recognition
    Perera, Nadeesha
    Thi Thuy Linh Nguyen
    Dehmer, Matthias
    Emmert-Streib, Frank
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (01): : 254 - 275
  • [3] A real time Named Entity Recognition system for Arabic text mining
    Al-Jumaily, Harith
    Martinez, Paloma
    Martinez-Fernandez, Jose L.
    Van der Goot, Erik
    LANGUAGE RESOURCES AND EVALUATION, 2012, 46 (04) : 543 - 563
  • [4] A real time Named Entity Recognition system for Arabic text mining
    Harith Al-Jumaily
    Paloma Martínez
    José L. Martínez-Fernández
    Erik Van der Goot
    Language Resources and Evaluation, 2012, 46 : 543 - 563
  • [5] AGRONER: An unsupervised agriculture named entity recognition using weighted distributional semantic model
    Veena, G.
    Kanjirangat, Vani
    Gupta, Deepa
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [6] A comprehensive study of named entity recognition in Chinese clinical text
    Lei, Jianbo
    Tang, Buzhou
    Lu, Xueqin
    Gao, Kaihua
    Jiang, Min
    Xu, Hua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (05) : 808 - 814
  • [7] Quantifying the Significance of Cybersecurity Text through Semantic Similarity and Named Entity Recognition
    Mendsaikhan, Otgonpurev
    Hasegawa, Hirokazu
    Yukiko, Yamaguchi
    Shimada, Hajime
    ICISSP: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2020, : 325 - 332
  • [8] An Association Rule Mining Method Based on Named Entity Recognition and Text Classification
    He, Bo
    Zhang, Jiru
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1503 - 1511
  • [9] An Association Rule Mining Method Based on Named Entity Recognition and Text Classification
    Bo He
    Jiru Zhang
    Arabian Journal for Science and Engineering, 2023, 48 : 1503 - 1511
  • [10] Pretrained Models with Adversarial Training for Named Entity Recognition in Scientific Text
    Ma, Hangchao
    Zhang, You
    Wang, Jin
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 259 - 264