Unsupervised framework for evaluating and explaining structural node embeddings of graphs

被引：0

作者：

Dehghan, Ashkan ^{[1
]}

Siuta, Kinga ^{[1
,2
]}

Skorupka, Agata ^{[1
,2
]}

Betlen, Andrei ^{[3
]}

Miller, David ^{[3
]}

Kaminski, Bogumil ^{[2
]}

Pralat, Pawel ^{[1
]}

机构：

[1] Toronto Metropolitan Univ, Dept Math, 350 Victoria St, Toronto, ON M5B 2K3, Canada

[2] SGH Warsaw Sch Econ, Al Niepodleglosci 162, PL-02554 Warsaw, Poland

[3] Patagona Technol, Pickering, ON, Canada

来源：

JOURNAL OF COMPLEX NETWORKS | 2024年 / 12卷 / 02期

基金：

瑞典研究理事会; 加拿大自然科学与工程研究理事会;

关键词：

graph embeddings; structural similarity; machine learning on graphs; explainable AI; node structural embeddings; role-based embeddings; network representation learning;

D O I：

10.1093/comnet/cnae003

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

An embedding is a mapping from a set of nodes of a network into a real vector space. Embeddings can have various aims like capturing the underlying graph topology and structure, node-to-node relationship, or other relevant information about the graph, its subgraphs or nodes themselves. A practical challenge with using embeddings is that there are many available variants to choose from. Selecting a small set of most promising embeddings from the long list of possible options for a given task is challenging and often requires domain expertise. Embeddings can be categorized into two main types: classical embeddings and structural embeddings. Classical embeddings focus on learning both local and global proximity of nodes, while structural embeddings learn information specifically about the local structure of nodes' neighbourhood. For classical node embeddings, there exists a framework which helps data scientists to identify (in an unsupervised way) a few embeddings that are worth further investigation. Unfortunately, no such framework exists for structural embeddings. In this article, we propose a framework for unsupervised ranking of structural graph embeddings. The proposed framework, apart from assigning an aggregate quality score for a structural embedding, additionally gives a data scientist insights into properties of this embedding. It produces information which predefined node features the embedding learns, how well it learns them, and which dimensions in the embedded space represent the predefined node features. Using this information, the user gets a level of explainability to an otherwise complex black-box embedding algorithm.

引用

页数：24

共 50 条

[31] BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs
Liu, Kay
Dou, Yingtong
Zhao, Yue
Ding, Xueying
Hu, Xiyang
Zhang, Ruitong
Ding, Kaize
Chen, Canyu
Peng, Hao
Shu, Kai
Sun, Lichao
Li, Jundong
Chen, George H.
Jia, Zhihao
Yu, Philip S.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[32] Subexponential-time framework for optimal embeddings of graphs in integer lattices
Dessmark, A
Lingas, A
Lundell, EM
ALGORITHM THEORY- SWAT 2004, 2004, 3111 : 248 - 259
[33] Learning Structural Node Representations on Directed Graphs
Steenfatt, Niklas
Nikolentzos, Giannis
Vazirgiannis, Michalis
Zhao, Qiang
COMPLEX NETWORKS AND THEIR APPLICATIONS VII, VOL 2, 2019, 813 : 132 - 144
[34] A framework for demonstrating, forecasting, and explaining topic evolution by analyzing geometrical motion of topic embeddings
Huang, Shengzhi
Lu, Wei
Cheng, Qikai
Huang, Yong
Yi, Fan
Zhu, Liang
QUANTITATIVE SCIENCE STUDIES, 2025, 6 : 171 - 193
[35] A Framework for Developing and Evaluating Word Embeddings of Drug-named Entity
Zhao, Mengnan
Masino, Aaron J.
Yang, Christopher C.
SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), 2018, : 156 - 160
[36] Structural Intervention Distance for Evaluating Causal Graphs
Peters, Jonas
Buehlmann, Peter
NEURAL COMPUTATION, 2015, 27 (03) : 771 - 799
[37] Detecting bots in social-networks using node and structural embeddings
Dehghan, Ashkan
Siuta, Kinga
Skorupka, Agata
Dubey, Akshat
Betlen, Andrei
Miller, David
Xu, Wei
Kaminski, Bogumil
Pralat, Pawel
JOURNAL OF BIG DATA, 2023, 10 (01)
[38] Walk Extraction Strategies for Node Embeddings with RDF2Vec in Knowledge Graphs
Steenwinckel, Bram
Vandewiele, Gilles
Bonte, Pieter
Weyns, Michael
Paulheim, Heiko
Ristoski, Petar
De Turck, Filip
Ongenae, Femke
DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS, 2021, 1479 : 70 - 80
[39] An Unsupervised Neural Prediction Framework for Learning Speaker Embeddings using Recurrent Neural Networks
Jati, Arindam
Georgiou, Panayiotis
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1131 - 1135
[40] Detecting bots in social-networks using node and structural embeddings
Ashkan Dehghan
Kinga Siuta
Agata Skorupka
Akshat Dubey
Andrei Betlen
David Miller
Wei Xu
Bogumił Kamiński
Paweł Prałat
Journal of Big Data, 10

← 1 2 3 4 5 →