Identifying Medical Named Entities with Word Information

被引:0
|
作者
Ben Y. [1 ]
Pang X. [2 ]
机构
[1] School of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan
[2] Archives of Wuhan University of Science and Technology, Wuhan
基金
中国国家自然科学基金;
关键词
Chinese Named Entity Recognition; MacBERT; Online Medical Consultation; Weighted Cross Entropy; Word Information Embedding;
D O I
10.11925/infotech.2096-3467.2022.0547
中图分类号
学科分类号
摘要
[Objective] This paper utilizes the word information to identify and infer the key clinical features in online consultation records and address the difficulty in recognizing the boundaries of named entities. [Methods] First, we constructed a new model based on MacBERT and conditional random fields. Then, we embedded the word position and part of speech as the dialogue text information by the speaker role embedding. Finally, we used the weighted multi-class cross-entropy to solve the problem of entity category imbalance. [Results] We conducted an empirical study with online consultation records from Chunyu Doctor. The F1 value of the proposed model in the named entity recognition task was 74.35%, which was nearly 2% higher than directly using the MacBERT model. [Limitations] We did not design a specific model for Chinese word segmentation. [Conclusions] Our new model with more dimensional features can effectively improve its ability to recognize key features of clinical findings. © 2023 Data Analysis and Knowledge Discovery. All rights reserved.
引用
收藏
页码:123 / 132
页数:9
相关论文
共 32 条
  • [21] Zhao Hongyang, Research and Implementation of Named Entity Recognition of Electronic Medical Records Based on Deep Learning, Computer Engineering & Software, 40, 8, pp. 208-211, (2019)
  • [22] Tang B Z, Wang X L, Yan J, Et al., Entity Recognition in Chinese Clinical Text Using Attention-Based CNN-LSTM-CRF, BMC Medical Informatics and Decision Making, 19, 3, (2019)
  • [23] Pan Cuiran, Wang Qinghua, Tang Buzhou, Et al., Chinese Electronic Medical Record Named Entity Recognition Based on Sentence-Level Lattice-Long Short-Term Memory Neural Network, Academic Journal of Second Military Medical University, 40, 5, pp. 497-506, (2019)
  • [24] Li Bo, Kang Xiaodong, Zhang Huali, Et al., Named Entity Recognition in Chinese Electronic Medical Records Using Transformer-CRF, Computer Engineering and Applications, 56, 5, pp. 153-159, (2020)
  • [25] Luo Ling, Yang Zhihao, Song Yawen, Et al., Chinese Clinical Named Entity Recognition Based on Stroke ELMo and Multi-Task Learning, Chinese Journal of Computers, 43, 10, pp. 1943-1957, (2020)
  • [26] Tang Guoqiang, Gao Daqi, Ruan Tong, Et al., Clinical Electronic Medical Record Named Entity Recognition Incorporating Language Model, Computer Science, 47, 3, pp. 211-216, (2020)
  • [27] Shen Zhoufeng, Su Qianmin, Guo Jinglei, Named Entity Recognition Model of Chinese Clinical Electronic Medical Record Based on XLNet-BiLSTM, Intelligent Computer and Applications, 11, 8, pp. 97-102, (2021)
  • [28] Zeng Qingxia, Xiong Wangping, Du Jianqiang, Et al., Electronic Medical Record Named Entity Recognition Combined with Self-Attention BiLSTM-CRF, Computer Applications and Software, 38, 3, pp. 159-162, (2021)
  • [29] Zhu Yan, Zhang Li, Wang Yu, Named Entity Recognition on Chinese Electronic Medical Records Based on RoBERTa-WWM, Computer and Modernization, 2, pp. 51-55, (2021)
  • [30] He Tao, Chen Jian, Wen Yingyou, Research on Entity Recognition of Electronic Medical Record Based on BERT-CRF Model, Computer & Digital Engineering, 50, 3, pp. 639-643, (2022)