Unsupervised framework for evaluating and explaining structural node embeddings of graphs

被引:0
|
作者
Dehghan, Ashkan [1 ]
Siuta, Kinga [1 ,2 ]
Skorupka, Agata [1 ,2 ]
Betlen, Andrei [3 ]
Miller, David [3 ]
Kaminski, Bogumil [2 ]
Pralat, Pawel [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Math, 350 Victoria St, Toronto, ON M5B 2K3, Canada
[2] SGH Warsaw Sch Econ, Al Niepodleglosci 162, PL-02554 Warsaw, Poland
[3] Patagona Technol, Pickering, ON, Canada
基金
瑞典研究理事会; 加拿大自然科学与工程研究理事会;
关键词
graph embeddings; structural similarity; machine learning on graphs; explainable AI; node structural embeddings; role-based embeddings; network representation learning;
D O I
10.1093/comnet/cnae003
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
An embedding is a mapping from a set of nodes of a network into a real vector space. Embeddings can have various aims like capturing the underlying graph topology and structure, node-to-node relationship, or other relevant information about the graph, its subgraphs or nodes themselves. A practical challenge with using embeddings is that there are many available variants to choose from. Selecting a small set of most promising embeddings from the long list of possible options for a given task is challenging and often requires domain expertise. Embeddings can be categorized into two main types: classical embeddings and structural embeddings. Classical embeddings focus on learning both local and global proximity of nodes, while structural embeddings learn information specifically about the local structure of nodes' neighbourhood. For classical node embeddings, there exists a framework which helps data scientists to identify (in an unsupervised way) a few embeddings that are worth further investigation. Unfortunately, no such framework exists for structural embeddings. In this article, we propose a framework for unsupervised ranking of structural graph embeddings. The proposed framework, apart from assigning an aggregate quality score for a structural embedding, additionally gives a data scientist insights into properties of this embedding. It produces information which predefined node features the embedding learns, how well it learns them, and which dimensions in the embedded space represent the predefined node features. Using this information, the user gets a level of explainability to an otherwise complex black-box embedding algorithm.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs
    Liu, Kay
    Dou, Yingtong
    Zhao, Yue
    Ding, Xueying
    Hu, Xiyang
    Zhang, Ruitong
    Ding, Kaize
    Chen, Canyu
    Peng, Hao
    Shu, Kai
    Sun, Lichao
    Li, Jundong
    Chen, George H.
    Jia, Zhihao
    Yu, Philip S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [32] Subexponential-time framework for optimal embeddings of graphs in integer lattices
    Dessmark, A
    Lingas, A
    Lundell, EM
    ALGORITHM THEORY- SWAT 2004, 2004, 3111 : 248 - 259
  • [33] Learning Structural Node Representations on Directed Graphs
    Steenfatt, Niklas
    Nikolentzos, Giannis
    Vazirgiannis, Michalis
    Zhao, Qiang
    COMPLEX NETWORKS AND THEIR APPLICATIONS VII, VOL 2, 2019, 813 : 132 - 144
  • [34] A framework for demonstrating, forecasting, and explaining topic evolution by analyzing geometrical motion of topic embeddings
    Huang, Shengzhi
    Lu, Wei
    Cheng, Qikai
    Huang, Yong
    Yi, Fan
    Zhu, Liang
    QUANTITATIVE SCIENCE STUDIES, 2025, 6 : 171 - 193
  • [35] A Framework for Developing and Evaluating Word Embeddings of Drug-named Entity
    Zhao, Mengnan
    Masino, Aaron J.
    Yang, Christopher C.
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), 2018, : 156 - 160
  • [36] Structural Intervention Distance for Evaluating Causal Graphs
    Peters, Jonas
    Buehlmann, Peter
    NEURAL COMPUTATION, 2015, 27 (03) : 771 - 799
  • [37] Detecting bots in social-networks using node and structural embeddings
    Dehghan, Ashkan
    Siuta, Kinga
    Skorupka, Agata
    Dubey, Akshat
    Betlen, Andrei
    Miller, David
    Xu, Wei
    Kaminski, Bogumil
    Pralat, Pawel
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [38] Walk Extraction Strategies for Node Embeddings with RDF2Vec in Knowledge Graphs
    Steenwinckel, Bram
    Vandewiele, Gilles
    Bonte, Pieter
    Weyns, Michael
    Paulheim, Heiko
    Ristoski, Petar
    De Turck, Filip
    Ongenae, Femke
    DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS, 2021, 1479 : 70 - 80
  • [39] An Unsupervised Neural Prediction Framework for Learning Speaker Embeddings using Recurrent Neural Networks
    Jati, Arindam
    Georgiou, Panayiotis
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1131 - 1135
  • [40] Detecting bots in social-networks using node and structural embeddings
    Ashkan Dehghan
    Kinga Siuta
    Agata Skorupka
    Akshat Dubey
    Andrei Betlen
    David Miller
    Wei Xu
    Bogumił Kamiński
    Paweł Prałat
    Journal of Big Data, 10