Deep Entity Linking via Eliminating Semantic Ambiguity With BERT

被引:18
|
作者
Yin, Xiaoyao [1 ]
Huang, Yangchen [1 ]
Zhou, Bin [1 ]
Li, Aiping [1 ]
Lan, Long [1 ,2 ]
Jia, Yan [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Task analysis; Knowledge based systems; Semantics; Joining processes; Bit error rate; Natural languages; Computational modeling; Entity linking; natural language processing (NLP); bidirectional encoder representations from transformers (BERT); deep neural network (DNN);
D O I
10.1109/ACCESS.2019.2955498
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entity linking refers to the task of aligning mentions of entities in the text to their corresponding entries in a specific knowledge base, which is of great significance for many natural language process applications such as semantic text understanding and knowledge fusion. The pivotal of this problem is how to make effective use of contextual information to disambiguate mentions. Moreover, it has been observed that, in most cases, mention has similar or even identical strings to the entity it refers to. To prevent the model from linking mentions to entities with similar strings rather than the semantically similar ones, in this paper, we introduce the advanced language representation model called BERT (Bidirectional Encoder Representations from Transformers) and design a hard negative samples mining strategy to fine-tune it accordingly. Based on the learned features, we obtain the valid entity through computing the similarity between the textual clues of mentions and the entity candidates in the knowledge base. The proposed hard negative samples mining strategy benefits entity linking from the larger, more expressive pre-trained representations of BERT with limited training time and computing sources. To the best of our knowledge, we are the first to equip entity linking task with the powerful pre-trained general language model by deliberately tackling its potential shortcoming of learning literally, and the experiments on the standard benchmark datasets show that the proposed model yields state-of-the-art results.
引用
收藏
页码:169434 / 169445
页数:12
相关论文
共 50 条
  • [1] Ad hoc retrieval via entity linking and semantic similarity
    Ensan, Faezeh
    Du, Weichang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (03) : 551 - 583
  • [2] Ad hoc retrieval via entity linking and semantic similarity
    Faezeh Ensan
    Weichang Du
    Knowledge and Information Systems, 2019, 58 : 551 - 583
  • [3] Semantic Video Entity Linking
    Grams, Tim
    Li, Honglin
    Tong, Bo
    Shaban, Ali
    Weller, Tobias
    SEMANTIC WEB: ESWC 2022 SATELLITE EVENTS, 2022, 13384 : 129 - 132
  • [4] Entity Linking on Chinese Microblogs via Deep Neural Network
    Zeng, Weixin
    Tang, Jiuyang
    Zhao, Xiang
    IEEE ACCESS, 2018, 6 : 25908 - 25920
  • [5] Deep Semantic Match Model for Entity Linking Using Knowledge Graph and Text
    Luo, Angen
    Gao, Sheng
    Xu, Yajing
    2017 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2018, 129 : 110 - 114
  • [6] Entity Linking with a Unified Semantic Representation
    Guo, Zhaochen
    Barbosa, Denilson
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 1305 - 1309
  • [7] Entity Linking and Retrieval for Semantic Search
    Meij, Edgar
    Balog, Krisztian
    Odijk, Daan
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 683 - 683
  • [8] Boosting Collective Entity Linking via Type-Guided Semantic Embedding
    Lu, Weiming
    Zhou, Yangfan
    Lu, Haijiao
    Ma, Pengkun
    Zhang, Zhenyu
    Wei, Baogang
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 541 - 553
  • [9] ENTITY LINKING AS A STEP FOR SEMANTIC INFORMATION RETRIEVAL
    Salcedo, Diego Andres
    Accioly Bezerra, Vinicius Cabral
    Correa, Renato Fernandes
    INFORMACAO & SOCIEDADE-ESTUDOS, 2020, 30 (02)
  • [10] A novel semantic representation approach for web documents using deep entity linking and multidocument support
    Urkude, Giridhar
    Pandey, Manju
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 35 (08)