Named Entity Recognition in Aviation Products Domain Based on BERT

被引:0
|
作者
Yang, Mingye [1 ]
Namoano, Bernadin [1 ]
Farsi, Maryam [1 ]
Erkoyuncu, John Ahmet [1 ]
机构
[1] Cranfield Univ, Ctr Digital Engn & Mfg, Cranfield MK43 0AL, England
来源
IEEE ACCESS | 2024年 / 12卷
基金
英国工程与自然科学研究理事会;
关键词
Hidden Markov models; Data models; Named entity recognition; Knowledge graphs; Atmospheric modeling; Feature extraction; Data mining; Ontologies; Encoding; Biological system modeling; Aviation; named entity recognition (NER); knowledge graph; bidirectional encoder representations from transformers (BERT); bidirectional long short-term memory network (Bi-LSTM);
D O I
10.1109/ACCESS.2024.3516390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aviation products' manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph construction, named entity recognition (NER) is a key step and one of the main tasks of knowledge extraction. Given the high degree of specialisation of aviation product text data and the wide span of contextual information, existing models often perform poorly in entity extraction. This paper proposes a new Named Entity Recognition (NER) method specifically tailored for the aviation product field (BBC-Ap), introducing an innovative approach that leverages domain-specific ontologies and advanced deep learning algorithms to significantly enhance the accuracy and efficiency of entity extraction from complex technical documents. The first step of this method is to establish an ontology model of aviation products and annotate the relevant text data to form a dataset for training the named entity model. Next, it adopts a multi-level model structure based on BERT, in which BERT is used to generate word vector representations, a bidirectional long short-term memory network (BiLSTM) is used as an encoder to extract semantic features, and a conditional random field (CRF) is used as a decoder to achieve optimal label assignment. Through experiments on the constructed aviation product dataset, the model achieved a Precision value of 91.74%, a Recall value of 92.46%, and an F1 score of 92.1%, Compared with other baseline models, the F1-score is improved by 0.9% to 1.5%. At the same time, the model also performs well on standard datasets such as CoNLLpp, with a Precision value of 92.87%, a Recall value of 92.54%, and an F1-Score of 92.70%. Finally, the model was used to successfully construct a knowledge graph reflecting the relationships between aviation products in Neo4j, further demonstrating the effectiveness and practicality of the method.
引用
收藏
页码:189710 / 189721
页数:12
相关论文
共 50 条
  • [41] Named Entity Recognition in the Domain of Geographical Subject
    Xu, Feifei
    Li, Huiying
    Li, Xuelian
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 2229 - 2234
  • [42] Named Entity Recognition System for the Biomedical Domain
    Sharma, Raghav
    Chauhan, Deependra
    Sharma, Raksha
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 837 - 840
  • [43] A framework for Named Entity Recognition in the Open domain
    Evans, RJ
    RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING III, 2004, 260 : 267 - 276
  • [44] Medical Named Entity Recognition with Domain Knowledge
    Pei W.
    Sun S.
    Li X.
    Lu J.
    Yang L.
    Wu Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (03) : 142 - 154
  • [45] Named Entity Recognition in a Very Homogeneous Domain
    Agarwal, Oshin
    Nenkova, Ani
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1850 - 1855
  • [46] Fine-Tuning BERT Model for Materials Named Entity Recognition
    Zhao, Xintong
    Greenberg, Jane
    An, Yuan
    Hu, Xiaohua Tony
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3717 - 3720
  • [47] A New Chinese Named Entity Recognition Method for Pig Disease Domain Based on Lexicon-Enhanced BERT and Contrastive Learning
    Peng, Cheng
    Wang, Xiajun
    Li, Qifeng
    Yu, Qinyang
    Jiang, Ruixiang
    Ma, Weihong
    Wu, Wenbiao
    Meng, Rui
    Li, Haiyan
    Huai, Heju
    Wang, Shuyan
    He, Longjuan
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [48] Wojood: Nested Arabic Named Entity Corpus and Recognition using BERT
    Jarrar, Mustafa
    Khalilia, Mohammed
    Ghanem, Sana
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3626 - 3636
  • [49] Named Entity Recognition for Open Domain Data Based on Distant Supervision
    Wu, Junshuang
    Zhang, Richong
    Deng, Ting
    Huai, Jinpeng
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 185 - 197
  • [50] Named Entity Recognition in Threat Intelligence Domain Based on Deep Learning
    Wang Y.
    Wang Z.-H.
    Li H.
    Huang W.-J.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2023, 44 (01): : 33 - 39