Explainable Similarity of Datasets Using Knowledge Graph

被引:3
|
作者
Skoda, Petr [1 ]
Klimek, Jakub [1 ]
Necasky, Martin [1 ]
Skopal, Tomas [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Fac Math & Phys, Malostranske Namesti 25, Prague 11800 1, Czech Republic
关键词
Similarity; Datasets; Search; Graph;
D O I
10.1007/978-3-030-32047-8_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a large quantity of datasets available as Open Data on the Web. However, it is challenging for users to find datasets relevant to their needs, even though the datasets are registered in catalogs such as the European Data Portal. This is because the available metadata such as keywords or textual description is not descriptive enough. At the same time, datasets exist in various types of contexts not expressed in the metadata. These may include information about the dataset publisher, the legislation related to dataset publication, language and cultural specifics, etc. In this paper we introduce a similarity model for matching datasets. The model assumes an ontology/knowledge graph, such as Wikidata.org, that serves as a graph-based context to which individual datasets are mapped based on their metadata. A similarity of the datasets is then computed as an aggregation over paths among nodes in the graph. The proposed similarity aims at addressing the problem of explainability of similarity, i.e., providing the user a structured explanation of the match which, in a broader sense, is nowadays a hot topic in the field of artificial intelligence.
引用
收藏
页码:103 / 110
页数:8
相关论文
共 50 条
  • [31] Deriving similarity for Semantic Web using similarity graph
    Kwon, JuHum
    Choi, O-Hoon
    Moon, Chang-Joo
    Park, Soo-Hyun
    Baik, Doo-Kwon
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2006, 26 (02) : 149 - 166
  • [32] Knowledge Graph-Based Integration of Autonomous Driving Datasets
    Halilaj, Lavdim
    Luettin, Juergen
    Monka, Sebastian
    Henson, Cory
    Schmid, Stefan
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2023, 17 (02) : 249 - 271
  • [33] Knowledge-driven graph similarity for text classification
    Niloofer Shanavas
    Hui Wang
    Zhiwei Lin
    Glenn Hawe
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1067 - 1081
  • [34] Integrating Knowledge Graph Data with Large Language Models for Explainable Inference
    Efrain Quintero-Narvaez, Carlos
    Monroy, Raul
    PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 1198 - 1199
  • [35] A Human-Centric Evaluation Platform for Explainable Knowledge Graph Completion
    Xu, Zhao
    Ben Rim, Wiem
    Gashteovski, Kiril
    Sztyler, Timo
    Lawrence, Carolin
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, 2024, : 18 - 26
  • [36] XCoS: Explainable Code Search Based on Query Scoping and Knowledge Graph
    Wang, Chong
    Peng, Xin
    Xing, Zhenchang
    Zhang, Yue
    Liu, Mingwei
    Luo, Rong
    Meng, Xiujie
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (06)
  • [37] Explainable recommendation based on knowledge graph and multi-objective optimization
    Xie, Lijie
    Hu, Zhaoming
    Cai, Xingjuan
    Zhang, Wensheng
    Chen, Jinjun
    COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (03) : 1241 - 1252
  • [38] Knowledge-driven graph similarity for text classification
    Shanavas, Niloofer
    Wang, Hui
    Lin, Zhiwei
    Hawe, Glenn
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (04) : 1067 - 1081
  • [39] Contextualized Knowledge Graph Embedding for Explainable Talent Training Course Recommendation
    Yang, Yang
    Zhang, Chubing
    Song, Xin
    Dong, Zheng
    Zhu, Hengshu
    Li, Wenjie
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (02)
  • [40] Explainable recommendation based on knowledge graph and multi-objective optimization
    Lijie Xie
    Zhaoming Hu
    Xingjuan Cai
    Wensheng Zhang
    Jinjun Chen
    Complex & Intelligent Systems, 2021, 7 : 1241 - 1252