Learning Entity Linking Features for Emerging Entities

被引:0
|
作者
Ran, Chenwei [1 ]
Shen, Wei [2 ]
Gao, Jianbo [2 ]
Li, Yuhan [2 ]
Wang, Jianyong [1 ,3 ]
Jia, Yantao [4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Nankai Univ, TMCC, TKLNDST, Coll Comp Sci, Tianjin 300350, Peoples R China
[3] Jiangsu Normal Univ, Jiangsu Collaborat Innovat Ctr Language Abil, Xuzhou 221008, Jiangsu, Peoples R China
[4] Huawei Technol Co Ltd, Beijing 100077, Peoples R China
基金
中国国家自然科学基金;
关键词
Encyclopedias; Optimization; Online services; Internet; Task analysis; Numerical models; Data models; Entity linking; entity linking feature; emerging entity; self-training;
D O I
10.1109/TKDE.2022.3197707
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Entity linking (EL) is the process of linking entity mentions appearing in text with their corresponding entities in a knowledge base. EL features of entities (e.g., prior probability, relatedness score, and entity embedding) are usually estimated based on Wikipedia. However, for newly emerging entities (EEs) which have just been discovered in news, they may still not be included in Wikipedia yet. As a consequence, it is unable to obtain required EL features for those EEs from Wikipedia and EL models will always fail to link ambiguous mentions with those EEs correctly as the absence of their EL features. To deal with this problem, in this paper we focus on a new task of learning EL features for emerging entities in a general way. We propose a novel approach called STAMO to learn high-quality EL features for EEs automatically, which needs just a small number of labeled documents for each EE collected from the Web, as it could further leverage the knowledge hidden in the unlabeled data. STAMO is mainly based on self-training, which makes it flexibly integrated with any EL feature or EL model, but also makes it easily suffer from the error reinforcement problem caused by the mislabeled data. Instead of some common self-training strategies that try to throw the mislabeled data away explicitly, we regard self-training as a multiple optimization process with respect to the EL features of EEs, and propose both intra-slot and inter-slot optimizations to alleviate the error reinforcement problem implicitly. We construct two EL datasets involving selected EEs to evaluate the quality of obtained EL features for EEs, and the experimental results show that our approach significantly outperforms other baseline methods of learning EL features.
引用
收藏
页码:7088 / 7102
页数:15
相关论文
共 50 条
  • [1] Predicting Guiding Entities for Entity Aspect Linking
    Chatterjee, Shubham
    Dietz, Laura
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3848 - 3852
  • [2] Entity Linking over Nested Named Entities for Russian
    Loukachevitch, Natalia
    Braslavski, Pavel
    Ivanov, Vladimir
    Batura, Tatiana
    Manandhar, Suresh
    Shelmanov, Artem
    Tutubalina, Elena
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4458 - 4466
  • [3] Semantically linking molecular entities in literature through entity relationships
    Sofie Van Landeghem
    Jari Björne
    Thomas Abeel
    Bernard De Baets
    Tapio Salakoski
    Yves Van de Peer
    BMC Bioinformatics, 13
  • [4] Semantically linking molecular entities in literature through entity relationships
    Van Landeghem, Sofie
    Bjorne, Jari
    Abeel, Thomas
    De Baets, Bernard
    Salakoski, Tapio
    Van de Peer, Yves
    BMC BIOINFORMATICS, 2012, 13
  • [5] TempEL: Linking Dynamically Evolving and Newly Emerging Entities
    Zaporojets, Klim
    Kaffee, Lucie-Aimee
    Deleu, Johannes
    Demeester, Thomas
    Develder, Chris
    Augenstein, Isabelle
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Joint learning of local and global features for entity linking via neural networks
    1600, Association for Computational Linguistics, ACL Anthology
  • [7] Learning representations of Web entities for entity resolution
    Barbosa, Luciano
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (03) : 346 - 358
  • [8] Joint Learning of Named Entity Recognition and Entity Linking
    Martins, Pedro Henrique
    Marinho, Zita
    Martins, Andre F. T.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 190 - 196
  • [9] Entity Linking at the Tail: Sparse Signals, Unknown Entities, and Phrase Models
    Jin, Yuzhe
    Kiciman, Emre
    Wang, Kuansan
    Loynd, Ricky
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 453 - 462
  • [10] An entity linking model based on candidate features
    Dun Li
    Zijian Fu
    Zhiyun Zheng
    Social Network Analysis and Mining, 2021, 11