Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards

被引:56
|
作者
Fan, Runyu [1 ,2 ]
Wang, Lizhe [1 ,2 ]
Yan, Jining [1 ,2 ]
Song, Weijing [1 ,2 ]
Zhu, Yingqian [1 ,2 ]
Chen, Xiaodao [1 ,2 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
named entity recognition; knowledge graph; deep learning; geological hazards; NEURAL-NETWORKS;
D O I
10.3390/ijgi9010015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Constructing a knowledge graph of geological hazards literature can facilitate the reuse of geological hazards literature and provide a reference for geological hazard governance. Named entity recognition (NER), as a core technology for constructing a geological hazard knowledge graph, has to face the challenges that named entities in geological hazard literature are diverse in form, ambiguous in semantics, and uncertain in context. This can introduce difficulties in designing practical features during the NER classification. To address the above problem, this paper proposes a deep learning-based NER model; namely, the deep, multi-branch BiGRU-CRF model, which combines a multi-branch bidirectional gated recurrent unit (BiGRU) layer and a conditional random field (CRF) model. In an end-to-end and supervised process, the proposed model automatically learns and transforms features by a multi-branch bidirectional GRU layer and enhances the output with a CRF layer. Besides the deep, multi-branch BiGRU-CRF model, we also proposed a pattern-based corpus construction method to construct the corpus needed for the deep, multi-branch BiGRU-CRF model. Experimental results indicated the proposed deep, multi-branch BiGRU-CRF model outperformed state-of-the-art models. The proposed deep, multi-branch BiGRU-CRF model constructed a large-scale geological hazard literature knowledge graph containing 34,457 entities nodes and 84,561 relations.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] DTranNER: biomedical named entity recognition with deep learning-based label-label transition model
    Hong, S. K.
    Lee, Jae-Gil
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [32] Named Entity Recognition using Knowledge Graph Embeddings and DistilBERT
    Mehta, Shreyansh
    Radke, Mansi A.
    Sunkle, Sagar
    2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 146 - 150
  • [33] Multi-feature fusion named entity recognition method for grape knowledge graph construction
    Nie X.
    Zhang L.
    Niu D.
    Wu H.
    Zhu H.
    Zhang H.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2024, 40 (03): : 201 - 210
  • [34] A lattice-transformer-graph deep learning model for Chinese named entity recognition
    Lin, Min
    Xu, Yanyan
    Cai, Chenghao
    Ke, Dengfeng
    Su, Kaile
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [35] Named entity recognition in medical domain combined with knowledge graph
    Jin Z.
    He X.
    Yue S.
    Xiong Y.
    Luo J.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2023, 55 (05): : 50 - 58
  • [36] Improving dictionary-based named entity recognition with deep learning
    Nastou, Katerina
    Koutrouli, Mikaela
    Pyysalo, Sampo
    Jensen, Lars Juhl
    BIOINFORMATICS, 2024, 40 : ii45 - ii52
  • [37] Named Entity Recognition in Threat Intelligence Domain Based on Deep Learning
    Wang Y.
    Wang Z.-H.
    Li H.
    Huang W.-J.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2023, 44 (01): : 33 - 39
  • [38] Named Entity Recognition for Amharic Using Deep Learning
    Gamback, Bjorn
    Sikdar, Utpal Kumar
    2017 IST-AFRICA WEEK CONFERENCE (IST-AFRICA), 2017,
  • [39] Deep Learning Architectures for Named Entity Recognition: A Survey
    Thomas, Anu
    Sangeetha, S.
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 215 - 225
  • [40] Deep Learning Approach for Arabic Named Entity Recognition
    Gridach, Mourad
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 439 - 451