Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules

被引:19
|
作者
Chen, Xianglong [1 ]
Ouyang, Chunping [1 ]
Liu, Yongbin [1 ]
Bu, Yi [2 ]
机构
[1] Univ South China, Sch Comp, Hengyang 421001, Peoples R China
[2] Indiana Univ, Luddy Sch Informat Comp & Engn, Ctr Complex Networks & Syst Res, Bloomington, IN 47408 USA
基金
中国国家自然科学基金;
关键词
entity recognition; electronic medical records; Bi-LSTM-CRF; rules; domain dictionary;
D O I
10.3390/ijerph17082687
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Electronic medical records are an integral part of medical texts. Entity recognition of electronic medical records has triggered many studies that propose many entity extraction methods. In this paper, an entity extraction model is proposed to extract entities from Chinese Electronic Medical Records (CEMR). In the input layer of the model, we use word embedding and dictionary features embedding as input vectors, where word embedding consists of a character representation and a word representation. Then, the input vectors are fed to the bidirectional long short-term memory to capture contextual features. Finally, a conditional random field is employed to capture dependencies between neighboring tags. We performed experiments on body classification task, and the F1 values reached 90.65%. We also performed experiments on anatomic region recognition task, and the F1 values reached 93.89%. On both tasks, our model had higher performance than state-of-the-art models, such as Bi-LSTM-CRF, Bi-LSTM-Attention, and Vote. Through experiments, our model has a good effect when dealing with small frequency entities and unknown entities; with a small training dataset, our method showed 2-4% improvement on F1 value compared to the basic Bi-LSTM-CRF models. Additionally, on anatomic region recognition task, besides using our proposed entity extraction model, 12 rules we designed and domain dictionary were adopted. Then, in this task, the weighted F1 value of the three specific entities extraction reached 84.36%.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Clinical named entity recognition from Chinese electronic medical records using a double-layer annotation model combining a domain dictionary with CRF
    Gong L.-J.
    Zhang Z.-F.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2020, 42 (04): : 469 - 475
  • [2] A dictionary-guided attention network for biomedical named entity recognition in Chinese electronic medical records
    Zhu, Zhichao
    Li, Jianqiang
    Zhao, Qing
    Akhtar, Faheem
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [3] Data Masking for Chinese Electronic Medical Records with Named Entity Recognition
    He, Tianyu
    Xu, Xiaolong
    Hu, Zhichen
    Zhao, Qingzhan
    Dai, Jianguo
    Dai, Fei
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3657 - 3673
  • [4] Named Entity Recognition and Event Extraction in Chinese Electronic Medical Records
    Ma, Cheng
    Huang, Wenkang
    CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 133 - 138
  • [5] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
    Liu, Kaixin
    Hu, Qingcheng
    Liu, Jianwei
    Xing, Chunxiao
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
  • [6] A Hybrid Model for Named Entity Recognition on Chinese Electronic Medical Records
    Wang, Yu
    Sun, Yining
    Ma, Zuchang
    Gao, Lisheng
    Xu, Yang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (02)
  • [7] Combined Attention Mechanism for Named Entity Recognition in Chinese Electronic Medical Records
    Li, Luqi
    Hou, Li
    2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 476 - 477
  • [8] A weakly supervised method for named entity recognition of Chinese electronic medical records
    Meng Li
    Chunrong Gao
    Kuang Zhang
    Huajian Zhou
    Jing Ying
    Medical & Biological Engineering & Computing, 2023, 61 : 2733 - 2743
  • [9] A weakly supervised method for named entity recognition of Chinese electronic medical records
    Li, Meng
    Gao, Chunrong
    Zhang, Kuang
    Zhou, Huajian
    Ying, Jing
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (10) : 2733 - 2743
  • [10] Named Entity Recognition for Chinese Electronic Medical Records Based on Multitask and Transfer Learning
    Guo, Wenming
    Lu, Junda
    Han, Fang
    IEEE ACCESS, 2022, 10 : 77375 - 77382