LargeEA: Aligning Entities for Large-scale Knowledge Graphs

被引:12
|
作者
Ge, Congcong [1 ]
Liu, Xiaoze [1 ]
Chen, Lu [1 ]
Gao, Yunjun [1 ]
Zheng, Baihua [2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou, Peoples R China
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 15卷 / 02期
关键词
ALIGNMENT;
D O I
10.14778/3489496.3489504
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entity alignment (EA) aims to find equivalent entities in different knowledge graphs (KGs). Current EA approaches suffer from scalability issues, limiting their usage in real-world EA scenarios. To tackle this challenge, we propose LargeEA to align entities between large-scale KGs. LargeEA consists of two channels, i.e., structure channel and name channel. For the structure channel, we present METIS-CPS, a memory-saving mini-batch generation strategy, to partition large KGs into smaller mini-batches. LargeEA, designed as a general tool, can adopt any existing EA approach to learn entities' structural features within each mini-batch independently. For the name channel, we first introduce NFF, a name feature fusion method, to capture rich name features of entities without involving any complex training process; we then exploit a name-based data augmentation to generate seed alignment without any human intervention. Such design fits common real-world scenarios much better, as seed alignment is not always available. Finally, LargeEA derives the EA results by fusing the structural features and name features of entities. Since no widely-acknowledged benchmark is available for large-scale EA evaluation, we also develop a large-scale EA benchmark called DBP1M extracted from real-world KGs. Extensive experiments confirm the superiority of LargeEA against state-of-the-art competitors.
引用
收藏
页码:237 / 245
页数:9
相关论文
共 50 条
  • [21] Research and Practice on the Framework for the Construction, Sharing, and Application of Large-scale Geoscience Knowledge Graphs
    Zhu Y.
    Sun K.
    Hu X.
    Lv H.
    Wang X.
    Yang J.
    Wang S.
    Li W.
    Song J.
    Su N.
    Mu X.
    Journal of Geo-Information Science, 2023, 25 (06) : 1215 - 1227
  • [22] Enhancing KBQA Performance in Large-Scale Chinese Knowledge Graphs Using Apache Spark
    Su, Yi-Jen
    Wu, Cheng-Wei
    Chen, Yi-Ju
    2024 6TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET, ICCCI 2024, 2024, : 181 - 186
  • [23] Explainable AI for Estimating Pathogenicity of Genetic Variants Using Large-Scale Knowledge Graphs
    Abe, Shuya
    Tago, Shinichiro
    Yokoyama, Kazuaki
    Ogawa, Miho
    Takei, Tomomi
    Imoto, Seiya
    Fuji, Masaru
    CANCERS, 2023, 15 (04)
  • [24] Group Centrality Maximization for Large-scale Graphs
    Angriman, Eugenio
    van der Grinten, Alexander
    Bojchevski, Aleksandar
    Zuegner, Daniel
    Guennemann, Stephan
    Meyerhenke, Henning
    2020 PROCEEDINGS OF THE SYMPOSIUM ON ALGORITHM ENGINEERING AND EXPERIMENTS, ALENEX, 2020, : 56 - 69
  • [25] EFFICIENCY OF THE LARGE-SCALE AGRI-INDUSTRIAL ENTITIES IN UKRAINE
    Demianenko, S.
    Sahaidak, M.
    Sas, O.
    Avramenko, T.
    Levkivskyi, Ye
    FINANCIAL AND CREDIT ACTIVITY-PROBLEMS OF THEORY AND PRACTICE, 2021, 1 (36): : 179 - 189
  • [26] Readable representations for large-scale bipartite graphs
    Sato, Shuji
    Misue, Kazuo
    Tanaka, Jiro
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 831 - 838
  • [27] Efficient Machine Learning On Large-Scale Graphs
    Erickson, Parker
    Lee, Victor E.
    Shi, Feng
    Tang, Jiliang
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4788 - 4789
  • [28] Understanding Coarsening for Embedding Large-Scale Graphs
    Akyildiz, Taha Atahan
    Aljundi, Amro Alabsi
    Kaya, Kamer
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2937 - 2946
  • [29] Generating Large-Scale Heterogeneous Graphs for Benchmarking
    Gupta, Amarnath
    SPECIFYING BIG DATA BENCHMARKS, 2014, 8163 : 113 - 128
  • [30] Efficient mining algorithms for large-scale graphs
    Kishimoto, Yasunari
    Shiokawa, Hiroaki
    Fujiwara, Yasuhiro
    Onizuka, Makoto
    NTT Technical Review, 2013, 11 (12):