SLHCat: Mapping Wikipedia Categories and Lists to DBpedia by Leveraging Semantic, Lexical, and Hierarchical Features

被引:0
|
作者
Wang, Zhaoyi [1 ]
Zhang, Zhenyang [1 ]
Qin, Jiaxin [1 ,2 ]
Iwaihara, Mizuho [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu 8080135, Japan
[2] United Automot Elect Syst Co Ltd, Beijing, Peoples R China
关键词
Knowledge graph; Ontology alignment; Wikipedia categories and lists; DBpedia; CaLiGraph; Distant supervision;
D O I
10.1007/978-981-99-8085-7_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wikipedia articles are hierarchically organized through categories and lists, providing one of the most comprehensive and universal taxonomy, but its open creation is causing redundancies and inconsistencies. Assigning DBPedia classes toWikipedia categories and lists can alleviate the problem, realizing a large knowledge graphwhich is essential for categorizing digital contents through entity linking and typing. However, the existing approach of CaLiGraph is producing incomplete and non-fine grained mappings. In this paper, we tackle the problem as ontology alignment, where structural information of knowledge graphs and lexical and semantic features of ontology class names are utilized to discover confident mappings, which are in turn utilized for finetuing pretrained language models in a distant supervision fashion. Our method SLHCat consists of two main parts: 1) Automatically generating training data by leveraging knowledge graph structure, semantic similarities, and named entity typing. 2) Finetuning and prompt-tuning of the pre-trained language model BERT are carried out over the training data, to capture semantic and syntactic properties of class names. Our model SLHCat is evaluated over a benchmark dataset constructed by annotating 3000 fine-grained CaLiGraph-DBpediamapping pairs. SLHCat is outperforming the baseline model by a large margin of 25% in accuracy, offering a practical solution for large-scale ontology mapping.
引用
收藏
页码:133 / 148
页数:16
相关论文
共 3 条
  • [1] SEMANTIC FEATURES OF NOUNS REFERRED TO VARIOUS LEXICAL-AND-GRAMMATICAL CATEGORIES
    Belov, Vadim A.
    VESTNIK VOLGOGRADSKOGO GOSUDARSTVENNOGO UNIVERSITETA-SERIYA 2-YAZYKOZNANIE, 2020, 19 (03): : 38 - 48
  • [2] Mapping innate lexical features to grammatical categories:: Acquisition of English -ing and -ed
    Olsen, MB
    Weinberg, A
    Lilly, JP
    Drury, JE
    PROCEEDINGS OF THE TWENTIETH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1998, : 794 - 799
  • [3] Semantic Segmentation of Aerial Imagery: A Novel Approach Leveraging Hierarchical Multi-scale Features and Channel-based Attention for Drone Applications
    Sahragard, E.
    Farsi, H.
    Mohamadzadeh, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2024, 37 (05): : 1022 - 1035