A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

被引:0
|
作者
Wang, Xiajun [1 ,2 ]
Peng, Cheng [1 ,3 ,4 ]
Li, Qifeng [1 ,3 ,4 ]
Yu, Qinyang [1 ,3 ,4 ]
Lin, Liqun [2 ]
Li, Pingping [1 ,2 ]
Gao, Ronghua [1 ,3 ,4 ]
Wu, Wenbiao [1 ,3 ,4 ]
Jiang, Ruixiang [1 ,3 ,4 ]
Yu, Ligen [1 ,3 ,4 ]
Ding, Luyu [1 ,3 ,4 ]
Zhu, Lei [1 ,3 ,4 ]
机构
[1] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100097, Peoples R China
[2] Hubei Univ, Fac Resources & Environm Sci, Wuhan 430061, Peoples R China
[3] Natl Innovat Ctr Digital Technol Anim Husb, Beijing 100097, Peoples R China
[4] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
nested named entity recognition; chicken disease; multiple fine-grained feature fusion; RoBERTa; efficient global pointer; NETWORK;
D O I
10.3390/app14188495
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application This study proposes a multiple fine-grained nested named entity recognition model, which provides a solution for other specialized fields and lays the foundation for subsequent knowledge graph construction and intelligent inquiry system construction.Abstract Extracting entities from large volumes of chicken epidemic texts is crucial for knowledge sharing, integration, and application. However, named entity recognition (NER) encounters significant challenges in this domain, particularly due to the prevalence of nested entities and domain-specific named entities, coupled with a scarcity of labeled data. To address these challenges, we compiled a corpus from 50 books on chicken diseases, covering 28 different disease types. Utilizing this corpus, we constructed the CDNER dataset and developed a nested NER model, MFGFF-BiLSTM-EGP. This model integrates the multiple fine-grained feature fusion (MFGFF) module with a BiLSTM neural network and employs an efficient global pointer (EGP) to predict the entity location encoding. In the MFGFF module, we designed three encoders: the character encoder, word encoder, and sentence encoder. This design effectively captured fine-grained features and improved the recognition accuracy of nested entities. Experimental results showed that the model performed robustly, with F1 scores of 91.98%, 73.32%, and 82.54% on the CDNER, CMeEE V2, and CLUENER datasets, respectively, outperforming other commonly used NER models. Specifically, on the CDNER dataset, the model achieved an F1 score of 79.68% for nested entity recognition. This research not only advances the development of a knowledge graph and intelligent question-answering system for chicken diseases, but also provides a viable solution for extracting disease information that can be applied to other livestock species.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Fine-grained pornographic image recognition with multiple feature fusion transfer learning
    Lin, Xinnan
    Qin, Feiwei
    Peng, Yong
    Shao, Yanli
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (01) : 73 - 86
  • [22] Research on Chinese Fine-grained Geographic Entity Recognition Model based on Joint Lexicon Enhancement
    Li F.
    Wang H.
    Kong H.
    Liu F.
    Wang Z.
    Wang Q.
    Xu J.
    Shan Y.
    Zhou X.
    Yan F.
    Journal of Geo-Information Science, 2023, 25 (06) : 1106 - 1120
  • [23] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Zhang, Lei
    Xia, Pengfei
    Ma, Xiaoxuan
    Yang, Chengwei
    Ding, Xin
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4473 - 4491
  • [24] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Lei Zhang
    Pengfei Xia
    Xiaoxuan Ma
    Chengwei Yang
    Xin Ding
    Complex & Intelligent Systems, 2024, 10 : 4473 - 4491
  • [25] Chinese Fine-grained Name Entity Recognition Based on Associated Memory Networks
    Ju S.-G.
    Li T.-N.
    Sun J.-P.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (08): : 2545 - 2556
  • [26] Chinese Named Entity Recognition for IC Patent Domain Based on RoBERTa-wwm-ext, GCN and Efficient Global Pointer
    Lin, Yunxiao
    Tang, Jiahao
    Huang, Wenjun
    Ding, Yanyu
    Hu, Jianguo
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 234 - 240
  • [27] Target Detection Optimization Model Based On Fine-grained Feature Fusion
    Bao, Xianfu
    Qiang, Zanxia
    Bai, Guangyao
    Yang, Rui
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND INTELLIGENT CONTROL (IPIC 2021), 2021, 11928
  • [28] Chinese medical named entity recognition based on feature fusion and multihead biaffine transformations
    Wang, Zhixiang
    Yolwas, Nurmemet
    Proceedings of SPIE - The International Society for Optical Engineering, 2024, 13210
  • [29] Chinese Named Entity Recognition method based on multi-feature fusion and biaffine
    Ke, Xiaohua
    Wu, Xiaobo
    Ou, Zexian
    Li, Binglong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (05) : 6305 - 6318
  • [30] A probabilistic feature based Maximum Entropy model for Chinese named entity recognition
    Zhang, Suxiang
    Wang, Xiaojie
    Wen, Juan
    Qin, Ying
    Zhong, Yixin
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 189 - +