An imConvNet-based deep learning model for Chinese medical named entity recognition

被引:4
|
作者
Zheng, Yuchen [1 ]
Han, Zhenggong [2 ]
Cai, Yimin [1 ]
Duan, Xubo [1 ]
Sun, Jiangling [3 ]
Yang, Wei [1 ]
Huang, Haisong [2 ]
机构
[1] Guizhou Univ, Med Coll, Guiyang 550025, Guizhou, Peoples R China
[2] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China
[3] Guiyang Hosp Stomatol, Guiyang 550002, Guizhou, Peoples R China
关键词
Named entity recognition; Convolutional neural network; Chinese electronic medical records; BiLSTM-CRF; BERT; BIG DATA; HEALTH; CARE;
D O I
10.1186/s12911-022-02049-4
中图分类号
R-058 [];
学科分类号
摘要
Background With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. Methods We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer-getting word embedding vector; imConvNet layer-capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer-capturing the long-distance dependencies; CRF (Conditional Random Field) layer-labeling characters based on their features and transfer rules. Results The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. Conclusions The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
    Liu, Kaixin
    Hu, Qingcheng
    Liu, Jianwei
    Xing, Chunxiao
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
  • [42] A Chinese Medical Named Entity Recognition Method Based on Glyph Features
    Meng, Wei-Lun
    Guo, Jing-Feng
    Xing, Ke-Xuan
    Wei, Ning
    Wang, Qiao-Suo
    Liu, Bin
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (06): : 1945 - 1954
  • [43] Deep adaptation of CNN in Chinese named entity recognition
    Lv, Yana
    Qin, Xutong
    Du, Xiuli
    Qiu, Shaoming
    ENGINEERING REPORTS, 2023, 5 (06)
  • [44] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [45] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
    Hui Kang
    Jingwu Xiao
    Yunpeng Zhang
    Lei Zhang
    Xu Zhao
    Tie Feng
    International Journal of Computational Intelligence Systems, 16
  • [46] CRF-based Active Learning for Chinese Named Entity Recognition
    Yao, Lin
    Sun, Chengjie
    Li, Shaofeng
    Wang, Xiaolong
    Wang, Xuan
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1557 - +
  • [47] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
    Kang, Hui
    Xiao, Jingwu
    Zhang, Yunpeng
    Zhang, Lei
    Zhao, Xu
    Feng, Tie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [48] Improving dictionary-based named entity recognition with deep learning
    Nastou, Katerina
    Koutrouli, Mikaela
    Pyysalo, Sampo
    Jensen, Lars Juhl
    BIOINFORMATICS, 2024, 40 : ii45 - ii52
  • [49] A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records
    Cai, Xiaoling
    Dong, Shoubin
    Hu, Jinlong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)
  • [50] A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records
    Xiaoling Cai
    Shoubin Dong
    Jinlong Hu
    BMC Medical Informatics and Decision Making, 19