Named-entity recognition for the diagnosis and treatment of aquatic animal diseases using knowledge graph construction

被引:0
|
作者
Jusheng L. [1 ,2 ,3 ]
Huining Y. [1 ,2 ,3 ]
Zhetao S. [1 ,2 ,3 ]
He Y. [1 ,2 ,3 ]
Liming S. [1 ,2 ,3 ]
Hong Y. [1 ,2 ,3 ]
Sijia Z. [1 ,2 ,3 ]
Shigen Y. [4 ]
机构
[1] College of Information Engineering, Dalian Ocean University, Dalian
[2] Key Laboratory of Environment Controlled Aquaculture, Ministry of Education, Dalian Ocean University, Dalian
[3] Key Laboratory of Marine Information Technology of Liaoning Province, Dalian
[4] College of Fisheries and Life, Dalian Ocean University, Dalian
关键词
aquaculture; BERT; BiLSTM; knowledge graph; models; named entity recognition; nested entity;
D O I
10.11975/j.issn.1002-6819.2022.07.023
中图分类号
学科分类号
摘要
Disease diagnosis and treatment have been an important support for aquatic animal health in aquaculture. A knowledge graph can be an effective way to express and apply the knowledge on the aquatic animal disease diagnosis and treatment. Among them, the named entity recognition has been the key component to construct the knowledge graph of aquatic animal diseases, particularly on the polysemy and entity nesting. However, the low recognition accuracy of named entities has posed a great challenge to the diagnosis and treatment of aquatic animal diseases. In this study, a diagnosis and treatment of aquatic animal diseases named entity recognition was proposed using BERT+CaBiLSTM+CRF (Bidirectional Encoder Representations from Transformers+Cascade-Bi-directional Long Short-Term Memory+Conditional Random Field). Firstly, the feature of the BERT model contained the position vector information. The polysemy was effectively improved to distinguish the different meanings that were expressed by entities in different contexts. Secondly, the CaBiLSTM model was designed for the nested named entity recognition using “hierarchical thinking”. The reason was that the inner entity in the nested entity of aquatic medicine greatly contributed to the recognition of the outer entity. First of all, the BiLSTM+CRF model was used to identify the inner entities that appeared frequently, and then the dimension reduction of the identified inner entity feature matrix was connected outer entity feature matrix to retain the complete inner entity feature information. After that, the BiLSTM+CRF model was used for the outer entity recognition to improve the discrimination of outer entities for the accurate recognition of outer entities. Finally, a comparative experiment was designed to verify the effectiveness of the proposed recognition. The test results show that the accuracy, recall, and F1 value of the named entity recognition task in the aquatic medicine using the BERT+CaBiLSTM+CRF model reached 93.07%, 92.85%, and 92.96%, respectively. The entity structure features were outstanding in terms of specific entity categories, due to the five types of non-nested entities, such as aquatic animal names, drug names, disease names, disease sites, and pathogens. For example, most aquatic animal names contained the radicals, such as “worm” and “fish”. The radicalsand drug names were mostly composed of chemical elements, while the disease names were mostly ended with the word “disease”, indicating a higher recognition accuracy than that in the nested entities. But in view of the outstanding nested structure of entities, the model performed better to identify the nested named entities, such as the clinical symptoms using the named entity recognition model integrating the BERT and CaBiLSTM designed by the “hierarchical idea”. Higher recognition was achieved than before. The recognition accuracy, recall, and F1 value increased by 12.31, 12.76, and 12.53 percentage points, respectively. Therefore, the model can be expected to effectively improve the accuracy of entity recognition caused by ambiguity and entity nesting in the task of diagnosis and treatment of aquatic animal diseases named entity recognition. The finding can provide the potential support to construct the fisheries field knowledge graph, further promote the healthy aquaculture projects. © 2022 Chinese Society of Agricultural Engineering. All rights reserved.
引用
收藏
页码:210 / 217
页数:7
相关论文
共 28 条
  • [1] Zhong Meiying, Guo Ya, Hu Kai, Et al., Advances and trends in the research of Eriocheir sinensis based on bibliometrics and knowledge graph, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 38, 3, pp. 311-322, (2022)
  • [2] Zheng Sining, Liu Qiang, Zheng Yifang, Et al., Analysis of scale aquaculture production efficiency and analysis of its determinants, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 32, 20, pp. 229-235, (2016)
  • [3] Zhang Shanwen, Wang Zhen, Wang Zuliang, Prediction of wheat stripe rust disease by combining knowledge graph and bidirectional long short term memory network, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 36, 12, pp. 172-178, (2020)
  • [4] Hou Mengwei, Wei Rong, Lu Liang, Et al., Research review of knowledge graph and its application in medical domain, Journal of Computer Research and Development, 55, 12, pp. 2587-2599, (2018)
  • [5] Yang Jinfeng, Yu Qiubin, Guan Yi, Et al., An overview of research on electronic medical record oriented named entity recognition and entity relation extraction, Acta Automatica Sinica, 40, 8, pp. 1537-1562, (2014)
  • [6] Zhao Ming, Du Yaru, Du Huifang, Et al., Research on ontology non-taxonomic relations extraction in plant domain knowledge graph construction, Transactions of the Chinese Society for Agricultural Machinery, 47, 9, pp. 278-284, (2016)
  • [7] Hu X, Duan J, Dang D., Natural language question answering over knowledge graph: The marriage of SPARQL query and keyword search, Knowledge and Information Systems, 63, 4, pp. 819-844, (2021)
  • [8] Weng H, Liu Z, Yan S, Et al., A framework for automated knowledge graph construction towards traditional Chinese medicine, International Conference on Health Information Science (HIS), pp. 170-181, (2017)
  • [9] Ai-khatib K, Hou Y, Wachsmuth H, Et al., End-to-end argumentation knowledge graph construction, The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI), pp. 7367-7374, (2020)
  • [10] Yang Yuji, Xu Bin, Hu Jiawei, Et al., Accurate and efficient method for constructing domain knowledge graph, Journal of Software, 29, 10, pp. 2931-2947, (2018)