LargeEA: Aligning Entities for Large-scale Knowledge Graphs

被引:12
|
作者
Ge, Congcong [1 ]
Liu, Xiaoze [1 ]
Chen, Lu [1 ]
Gao, Yunjun [1 ]
Zheng, Baihua [2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou, Peoples R China
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 15卷 / 02期
关键词
ALIGNMENT;
D O I
10.14778/3489496.3489504
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entity alignment (EA) aims to find equivalent entities in different knowledge graphs (KGs). Current EA approaches suffer from scalability issues, limiting their usage in real-world EA scenarios. To tackle this challenge, we propose LargeEA to align entities between large-scale KGs. LargeEA consists of two channels, i.e., structure channel and name channel. For the structure channel, we present METIS-CPS, a memory-saving mini-batch generation strategy, to partition large KGs into smaller mini-batches. LargeEA, designed as a general tool, can adopt any existing EA approach to learn entities' structural features within each mini-batch independently. For the name channel, we first introduce NFF, a name feature fusion method, to capture rich name features of entities without involving any complex training process; we then exploit a name-based data augmentation to generate seed alignment without any human intervention. Such design fits common real-world scenarios much better, as seed alignment is not always available. Finally, LargeEA derives the EA results by fusing the structural features and name features of entities. Since no widely-acknowledged benchmark is available for large-scale EA evaluation, we also develop a large-scale EA benchmark called DBP1M extracted from real-world KGs. Extensive experiments confirm the superiority of LargeEA against state-of-the-art competitors.
引用
收藏
页码:237 / 245
页数:9
相关论文
共 50 条
  • [1] OAG: Linking Entities Across Large-Scale Heterogeneous Knowledge Graphs
    Zhang, Fanjin
    Liu, Xiao
    Tang, Jie
    Dong, Yuxiao
    Yao, Peiran
    Zhang, Jie
    Gu, Xiaotao
    Wang, Yan
    Kharlamov, Evgeny
    Shao, Bin
    Li, Rui
    Wang, Kuansan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 9225 - 9239
  • [2] Linking Surface Facts to Large-Scale Knowledge Graphs
    Radevski, Gorjan
    Gashteovski, Kiril
    Hung, Chia-Chien
    Lawrence, Carolin
    Glavas, Goran
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7189 - 7207
  • [3] Aligning geographic entities from historical maps for building knowledge graphs
    Sun, Kai
    Hu, Yingjie
    Song, Jia
    Zhu, Yunqiang
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2021, 35 (10) : 2078 - 2107
  • [4] Aligning Rewards With Large-Scale Improvement
    Mandel, Keith Evan
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2010, 303 (07): : 663 - 664
  • [5] Fast Computation of Explanations for Inconsistency in Large-Scale Knowledge Graphs
    Tran, Trung-Kien
    Gad-Elrab, Mohamed H.
    Stepanova, Daria
    Kharlamov, Evgeny
    Strotgen, Jannik
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2613 - 2619
  • [6] From language models to large-scale food and biomedical knowledge graphs
    Gjorgjina Cenikj
    Lidija Strojnik
    Risto Angelski
    Nives Ogrinc
    Barbara Koroušić Seljak
    Tome Eftimov
    Scientific Reports, 13
  • [7] A Generic Database Indexing Framework for Large-Scale Geographic Knowledge Graphs
    Sun, Yuhan
    Sarwat, Mohamed
    26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 289 - 298
  • [8] Querying large-scale knowledge graphs using Qualitative Spatial Reasoning
    Mantle, Matthew
    Batsakis, Sotirios
    Antoniou, Grigoris
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [9] Differentiable Neuro-Symbolic Reasoning on Large-Scale Knowledge Graphs
    Chen, Shengyuan
    Cai, Yunfeng
    Fang, Huang
    Huang, Xiao
    Sun, Mingming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] From language models to large-scale food and biomedical knowledge graphs
    Cenikj, Gjorgjina
    Strojnik, Lidija
    Angelski, Risto
    Ogrinc, Nives
    Seljak, Barbara Korousic
    Eftimov, Tome
    SCIENTIFIC REPORTS, 2023, 13 (01)