Using graph distances for named-entity linking

被引:3
|
作者
Blanco, Roi [1 ]
Boldi, Paolo [2 ]
Marino, Andrea [2 ]
机构
[1] Yahoo Labs, London, England
[2] Univ Milan, Dipartimento Informat, I-20122 Milan, Italy
关键词
Entity linking; Maximum Capacity Representative Set; Minimum Distance Representative;
D O I
10.1016/j.scico.2015.10.013
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Entity-linking is a natural-language-processing task that consists in identifying strings of text that refer to a particular item in some reference knowledge base. When the knowledge base is Wikipedia, the problem is also referred to as wikification (in this case, items are Wikipedia articles). Entity-linking consists conceptually of many different phases: identifying the portions of text that may refer to an entity (sometimes called "entity detection"), determining a set of concepts (candidates) from the knowledge base that may match each such portion, and choosing one candidate for each set; the latter step, known as candidate selection, is the phase on which this paper focuses. One instance of candidate selection can be formalized as an optimization problem on the underlying concept graph, where the quantity to be optimized is the average distance between the selected items. Inspired by this application, we define a new graph problem which is a natural variant of the Maximum Capacity Representative Set. We prove that our problem is NP-hard for general graphs; we propose several heuristics trying to optimize similar easier objective functions; we show experimentally how these approaches perform with respect to some baselines on a real-world dataset. Finally, in the appendix, we show an exact linear time algorithm that works under some more restrictive assumptions. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:24 / 36
页数:13
相关论文
共 50 条
  • [1] Re-ranking for Joint Named-Entity Recognition and Linking
    Sil, Avirup
    Yates, Alexander
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2369 - 2374
  • [2] Deep Learning for Named-Entity Linking with Transfer Learning for Legal Documents
    Elnaggar, Ahmed
    Otto, Robin
    Matthes, Florian
    PROCEEDINGS OF 2018 ARTIFICIAL INTELLIGENCE AND CLOUD COMPUTING CONFERENCE (AICCC 2018), 2018, : 23 - 28
  • [3] Document Theme Extraction Using Named-Entity Recognition
    Nagrale, Deepali
    Khatavkar, Vaibhav
    Kulkarni, Parag
    COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 499 - 509
  • [4] Named-entity recognition for the diagnosis and treatment of aquatic animal diseases using knowledge graph construction
    Jusheng L.
    Huining Y.
    Zhetao S.
    He Y.
    Liming S.
    Hong Y.
    Sijia Z.
    Shigen Y.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (07): : 210 - 217
  • [5] Curatable Named-Entity Recognition Using Semantic Relations
    Hsu, Yi-Yu
    Kao, Hung-Yu
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (04) : 785 - 792
  • [6] Named-entity recognition for Polish with SProUT
    Piskorski, J
    INTELLIGENT MEDIA TECHNOLOGY FOR COMMUNICATIVE INTELLIGENCE, 2005, 3490 : 122 - 133
  • [7] Related Work in Named-Entity Recognition
    不详
    IEEE INTELLIGENT SYSTEMS, 2015, 30 (06) : 52 - 52
  • [8] Rembrandt - a named-entity recognition framework
    Cardoso, Nuno
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1240 - 1243
  • [9] Zero-shot evaluation of ChatGPT for food named-entity recognition and linking
    Ogrinc, Matevz
    Korousic Seljak, Barbara
    Eftimov, Tome
    FRONTIERS IN NUTRITION, 2024, 11
  • [10] Graph-Based Named Entity Linking with Wikipedia
    Hachey, Ben
    Radford, Will
    Curran, James R.
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2011, 2011, 6997 : 213 - +