An imConvNet-based deep learning model for Chinese medical named entity recognition

被引：4

作者：

Zheng, Yuchen ^{[1
]}

Han, Zhenggong ^{[2
]}

Cai, Yimin ^{[1
]}

Duan, Xubo ^{[1
]}

Sun, Jiangling ^{[3
]}

Yang, Wei ^{[1
]}

Huang, Haisong ^{[2
]}

机构：

[1] Guizhou Univ, Med Coll, Guiyang 550025, Guizhou, Peoples R China

[2] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China

[3] Guiyang Hosp Stomatol, Guiyang 550002, Guizhou, Peoples R China

来源：

BMC MEDICAL INFORMATICS AND DECISION MAKING | 2022年 / 22卷 / 01期

关键词：

Named entity recognition; Convolutional neural network; Chinese electronic medical records; BiLSTM-CRF; BERT; BIG DATA; HEALTH; CARE;

D O I：

10.1186/s12911-022-02049-4

中图分类号：

R-058 [];

学科分类号：

摘要：

Background With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. Methods We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer-getting word embedding vector; imConvNet layer-capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer-capturing the long-distance dependencies; CRF (Conditional Random Field) layer-labeling characters based on their features and transfer rules. Results The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. Conclusions The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.

引用

页数：12

共 50 条

[41] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
Liu, Kaixin
Hu, Qingcheng
Liu, Jianwei
Xing, Chunxiao
2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
[42] A Chinese Medical Named Entity Recognition Method Based on Glyph Features
Meng, Wei-Lun
Guo, Jing-Feng
Xing, Ke-Xuan
Wei, Ning
Wang, Qiao-Suo
Liu, Bin
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (06): : 1945 - 1954
[43] Deep adaptation of CNN in Chinese named entity recognition
Lv, Yana
Qin, Xutong
Du, Xiuli
Qiu, Shaoming
ENGINEERING REPORTS, 2023, 5 (06)
[44] A hybrid model for Chinese named entity recognition
Sun, Xiao
Huang, Degen
RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
[45] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
Hui Kang
Jingwu Xiao
Yunpeng Zhang
Lei Zhang
Xu Zhao
Tie Feng
International Journal of Computational Intelligence Systems, 16
[46] CRF-based Active Learning for Chinese Named Entity Recognition
Yao, Lin
Sun, Chengjie
Li, Shaofeng
Wang, Xiaolong
Wang, Xuan
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1557 - +
[47] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
Kang, Hui
Xiao, Jingwu
Zhang, Yunpeng
Zhang, Lei
Zhao, Xu
Feng, Tie
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
[48] Improving dictionary-based named entity recognition with deep learning
Nastou, Katerina
Koutrouli, Mikaela
Pyysalo, Sampo
Jensen, Lars Juhl
BIOINFORMATICS, 2024, 40 : ii45 - ii52
[49] A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records
Cai, Xiaoling
Dong, Shoubin
Hu, Jinlong
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)
[50] A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records
Xiaoling Cai
Shoubin Dong
Jinlong Hu
BMC Medical Informatics and Decision Making, 19

← 1 2 3 4 5 →