A CRFs-Based Approach Empowered with Word Representation Features to Learning Biomedical Named Entities from Medical Text

被引:1
|
作者
Xie, Wenxiu [1 ]
Fu, Sihui [1 ]
Jiang, Shengyi [1 ]
Hao, Tianyong [1 ,2 ]
机构
[1] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China
[2] Guangdong Univ Foreign Studies, Collaborat Innovat Ctr 21st Century Maritime Silk, Guangzhou, Guangdong, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Biomedical Named Entity Recognition; CRFs; Word representation features; RECOGNITION;
D O I
10.1007/978-3-319-71084-6_61
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Targeting at identifying specific types of entities, biomedical named entity recognition is a fundamental task of biomedical text processing. This paper presents a CRFs-based approach to learning disease entities by identifying their boundaries in texts. Two types of word representation features are proposed and used including word embedding features and cluster-based features. In addition, an external disease dictionary feature is also explored in the learning process. Based on a publically available NCBI disease corpus, we evaluate the performance of the CRFs-based model with the combination of these word representation features. The results show that using these features can significantly improve BNER performance with an increase of 24.7% on F1 measure, demonstrating the effectiveness of the proposed features and the feature-empowered approach.
引用
收藏
页码:518 / 527
页数:10
相关论文
共 3 条
  • [1] CWI: A multimodal deep learning approach for named entity recognition from social media using character, word and image features
    Meysam Asgari-Chenaghlu
    M. Reza Feizi-Derakhshi
    Leili Farzinvash
    M. A. Balafar
    Cina Motamed
    Neural Computing and Applications, 2022, 34 : 1905 - 1922
  • [2] CWI: A multimodal deep learning approach for named entity recognition from social media using character, word and image features
    Asgari-Chenaghlu, Meysam
    Feizi-Derakhshi, M. Reza
    Farzinvash, Leili
    Balafar, M. A.
    Motamed, Cina
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03): : 1905 - 1922
  • [3] Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation
    Wang, Weijie
    Li, Xiaoying
    Ren, Huiling
    Gao, Dongping
    Fang, An
    JMIR MEDICAL INFORMATICS, 2023, 11