Unsupervised framework for evaluating and explaining structural node embeddings of graphs

被引：0

作者：

Dehghan, Ashkan ^{[1
]}

Siuta, Kinga ^{[1
,2
]}

Skorupka, Agata ^{[1
,2
]}

Betlen, Andrei ^{[3
]}

Miller, David ^{[3
]}

Kaminski, Bogumil ^{[2
]}

Pralat, Pawel ^{[1
]}

机构：

[1] Toronto Metropolitan Univ, Dept Math, 350 Victoria St, Toronto, ON M5B 2K3, Canada

[2] SGH Warsaw Sch Econ, Al Niepodleglosci 162, PL-02554 Warsaw, Poland

[3] Patagona Technol, Pickering, ON, Canada

来源：

JOURNAL OF COMPLEX NETWORKS | 2024年 / 12卷 / 02期

基金：

瑞典研究理事会; 加拿大自然科学与工程研究理事会;

关键词：

graph embeddings; structural similarity; machine learning on graphs; explainable AI; node structural embeddings; role-based embeddings; network representation learning;

D O I：

10.1093/comnet/cnae003

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

An embedding is a mapping from a set of nodes of a network into a real vector space. Embeddings can have various aims like capturing the underlying graph topology and structure, node-to-node relationship, or other relevant information about the graph, its subgraphs or nodes themselves. A practical challenge with using embeddings is that there are many available variants to choose from. Selecting a small set of most promising embeddings from the long list of possible options for a given task is challenging and often requires domain expertise. Embeddings can be categorized into two main types: classical embeddings and structural embeddings. Classical embeddings focus on learning both local and global proximity of nodes, while structural embeddings learn information specifically about the local structure of nodes' neighbourhood. For classical node embeddings, there exists a framework which helps data scientists to identify (in an unsupervised way) a few embeddings that are worth further investigation. Unfortunately, no such framework exists for structural embeddings. In this article, we propose a framework for unsupervised ranking of structural graph embeddings. The proposed framework, apart from assigning an aggregate quality score for a structural embedding, additionally gives a data scientist insights into properties of this embedding. It produces information which predefined node features the embedding learns, how well it learns them, and which dimensions in the embedded space represent the predefined node features. Using this information, the user gets a level of explainability to an otherwise complex black-box embedding algorithm.

引用

页数：24

共 50 条

[21] Semantic Node Embeddings of Distributed Graphs Using Apache Spark
Narayanan, V. Suriya
Vijayakumar, Vijeth Bidare
Venkatraman, Sai Raam
Baruah, Pallav Kumar
2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 709 - 713
[22] FONDUE: Framework for Node Disambiguation Using Network Embeddings
Mel, Ahmad
Kang, Bo
Lijffijt, Jefrey
De Bie, Tijl
2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 158 - 167
[23] Learning Structural Node Embeddings via Diffusion Wavelets
Donnat, Claire
Zitnik, Marinka
Hallac, David
Leskovec, Jure
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1320 - 1329
[24] Retrofitting Structural Graph Embeddings with Node Attribute Information
Bielak, Piotr
Puchalska, Daria
Kajdanowicz, Tomasz
COMPUTATIONAL SCIENCE - ICCS 2022, PT I, 2022, : 178 - 191
[25] Learning structured embeddings of knowledge graphs with generative adversarial framework
Liu, Lu
Zeng, Jiehang
Zheng, Xiaoqing
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
[26] FONDUE: A Framework for Node Disambiguation and Deduplication Using Network Embeddings
Mel, Ahmad
Kang, Bo
Lijffijt, Jefrey
De Bie, Tijl
APPLIED SCIENCES-BASEL, 2021, 11 (21):
[27] Node rewriting in graphs and hypergraphs: a categorical framework
Bauderon, M
Jacquet, H
THEORETICAL COMPUTER SCIENCE, 2001, 266 (1-2) : 463 - 487
[28] A Transparent Framework for Evaluating Unintended Demographic Bias in Word Embeddings
Sweeney, Chris
Najafian, Maryam
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1662 - 1667
[29] Evaluating the hyponym attachments in an unsupervised taxonomy enrichment framework
Chifu, Emil St.
Chifu, Viorica R.
WEBIST 2008: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2008, : 247 - 252
[30] ExFaKT: A Framework for Explaining Facts over Knowledge Graphs and Text
Gad-Elrab, Mohamed H.
Stepanova, Dania
Urbani, Jacopo
Weikum, Gerhard
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 87 - 95

← 1 2 3 4 5 →