Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules

被引:19
|
作者
Chen, Xianglong [1 ]
Ouyang, Chunping [1 ]
Liu, Yongbin [1 ]
Bu, Yi [2 ]
机构
[1] Univ South China, Sch Comp, Hengyang 421001, Peoples R China
[2] Indiana Univ, Luddy Sch Informat Comp & Engn, Ctr Complex Networks & Syst Res, Bloomington, IN 47408 USA
基金
中国国家自然科学基金;
关键词
entity recognition; electronic medical records; Bi-LSTM-CRF; rules; domain dictionary;
D O I
10.3390/ijerph17082687
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Electronic medical records are an integral part of medical texts. Entity recognition of electronic medical records has triggered many studies that propose many entity extraction methods. In this paper, an entity extraction model is proposed to extract entities from Chinese Electronic Medical Records (CEMR). In the input layer of the model, we use word embedding and dictionary features embedding as input vectors, where word embedding consists of a character representation and a word representation. Then, the input vectors are fed to the bidirectional long short-term memory to capture contextual features. Finally, a conditional random field is employed to capture dependencies between neighboring tags. We performed experiments on body classification task, and the F1 values reached 90.65%. We also performed experiments on anatomic region recognition task, and the F1 values reached 93.89%. On both tasks, our model had higher performance than state-of-the-art models, such as Bi-LSTM-CRF, Bi-LSTM-Attention, and Vote. Through experiments, our model has a good effect when dealing with small frequency entities and unknown entities; with a small training dataset, our method showed 2-4% improvement on F1 value compared to the basic Bi-LSTM-CRF models. Additionally, on anatomic region recognition task, besides using our proposed entity extraction model, 12 rules we designed and domain dictionary were adopted. Then, in this task, the weighted F1 value of the three specific entities extraction reached 84.36%.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Named Entity Recognition in Electronic Health Records: A Methodological Review
    Durango, Maria C.
    Torres-Silva, Ever A.
    Orozco-Duque, Andres
    HEALTHCARE INFORMATICS RESEARCH, 2023, 29 (04) : 286 - 300
  • [42] Advances in Named Entity Recognition in Electronic Medical Record
    Liu, Andong
    Peng, Lin
    Ye, Qing
    Du, Jianqiang
    Cheng, Chunlei
    Zha, Qinglin
    Computer Engineering and Applications, 2023, 59 (21) : 39 - 51
  • [43] An RG-FLAT-CRF Model for Named Entity Recognition of Chinese Electronic Clinical Records
    Li, Jiakang
    Liu, Ruixia
    Chen, Changfang
    Zhou, Shuwang
    Shang, Xiaoyi
    Wang, Yinglong
    ELECTRONICS, 2022, 11 (08)
  • [44] Named Entity Recognition for Chinese Electronic Medical Record by Fusing Semantic and Boundary Information
    Cui S.
    Chen J.
    Li X.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (04): : 565 - 571
  • [45] Chinese electronic medical record named entity recognition algorithm based on transfer learning
    Li, Yi
    Liu, Jianyi
    Zhang, Ru
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 19 - 20
  • [46] A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition
    Ji, Bin
    Liu, Rui
    Li, ShaSha
    Tang, JinTao
    Yu, Jie
    Li, Qian
    Xu, WeiSang
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [47] A Named Entity Recognition Approach for Electronic Medical Records Using BERT Semantic Enhancement and BiLSTM
    Lai, Xuewei
    Jie, Qingqing
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2023, 19 (01)
  • [48] A Multiclass Classification Method Based on Deep Learning for Named Entity Recognition in Electronic Medical Records
    Dong, Xishuang
    Qian, Lijun
    Guan, Yi
    Huang, Lei
    Yu, Qiubin
    Yang, Jinfeng
    2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2016,
  • [49] Improving dictionary-based named entity recognition with deep learning
    Nastou, Katerina
    Koutrouli, Mikaela
    Pyysalo, Sampo
    Jensen, Lars Juhl
    BIOINFORMATICS, 2024, 40 : ii45 - ii52
  • [50] The Study of Named Entity Identification in Chinese Electronic Medical Records Based on Multi-tasking
    Guo, Hong
    Yan, Jinfang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 288 - 300