ANeTCM: A Novel MRC Framework for Traditional Chinese Medicine Named Entity Recognition

被引:0
|
作者
Feng, Yuanyu [1 ]
Zhou, Yan [2 ]
机构
[1] Guizhou Med Univ, Affiliated Hosp, Guiyang 550004, Peoples R China
[2] Fudan Univ, Anhui Prov Childrens Hosp, Dept Childrens Hlth Care, Childrens Hosp,Anhui Hosp, Hefei 230022, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Data models; Drugs; Predictive models; Named entity recognition; Solid modeling; Machine learning; Traditional Chinese medicine; named entity recognition; machine reading comprehension; gated linear units; normal distribution;
D O I
10.1109/ACCESS.2024.3444772
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional Chinese medicine (TCM) named entity recognition for supporting downstream tasks is receiving increasing attention. However, mainstream named entity recognition models applied to the TCM domain are still affected by the following two challenges: lack of domain knowledge and imbalance between entity classes. Therefore, we propose ANeTCM, a model that enhances both domain knowledge and inter-entity balance. Specifically, we first use a large number of TCM medical case data to continuously pretrain Roberta and enhance its domain knowledge. Secondly, the sequence annotation is converted into a machine reading comprehension task, and gated linear units are incorporated to further enhance the model's feature learning capability. Finally, the weights of the samples are adjusted using a normal distribution to address the imbalance of entity classes. We conducted extensive experiments on two TCM named entity recognition datasets and selected several competitive models. The experimental results show the effectiveness of our model.
引用
收藏
页码:113235 / 113243
页数:9
相关论文
共 50 条
  • [31] An integrative approach to Chinese Named Entity recognition
    Huang, Degen
    Sun, Xiao
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 171 - +
  • [32] Rembrandt - a named-entity recognition framework
    Cardoso, Nuno
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1240 - 1243
  • [33] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [34] Chinese Data Extraction and Named Entity Recognition
    Yang, Tingwei
    Jiang, Daguang
    Shi, Shenghui
    Than, Siyan
    Zhuo, Lin
    Yin, Yukang
    Liang, Zheng
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 105 - 109
  • [35] Bag of Tricks for Chinese Named Entity Recognition
    Xiao, Yao
    Peng, Jingbo
    Fu, Luoyi
    Zhang, Haisong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [36] Named Entity Recognition Datasets: A Classification Framework
    Ying Zhang
    Gang Xiao
    International Journal of Computational Intelligence Systems, 17
  • [37] Multitask Learning for Chinese Named Entity Recognition
    Zhang, Qun
    Li, Zhenzhen
    Feng, Dawei
    Li, Dongsheng
    Huang, Zhen
    Peng, Yuxing
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 653 - 662
  • [38] Chinese named entity recognition: The state of the art
    Liu, Pan
    Guo, Yanming
    Wang, Fenglei
    Li, Guohui
    Neurocomputing, 2022, 473 : 37 - 53
  • [39] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217
  • [40] CLASSIFICATION ATTENTION FOR CHINESE NAMED ENTITY RECOGNITION
    Cong, Kai
    Wang, Yunpeng
    Li, Tao
    Xu, Yanbin
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2021, 22 (09) : 1675 - 1686