Unsupervised framework for evaluating and explaining structural node embeddings of graphs

被引:0
|
作者
Dehghan, Ashkan [1 ]
Siuta, Kinga [1 ,2 ]
Skorupka, Agata [1 ,2 ]
Betlen, Andrei [3 ]
Miller, David [3 ]
Kaminski, Bogumil [2 ]
Pralat, Pawel [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Math, 350 Victoria St, Toronto, ON M5B 2K3, Canada
[2] SGH Warsaw Sch Econ, Al Niepodleglosci 162, PL-02554 Warsaw, Poland
[3] Patagona Technol, Pickering, ON, Canada
基金
瑞典研究理事会; 加拿大自然科学与工程研究理事会;
关键词
graph embeddings; structural similarity; machine learning on graphs; explainable AI; node structural embeddings; role-based embeddings; network representation learning;
D O I
10.1093/comnet/cnae003
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
An embedding is a mapping from a set of nodes of a network into a real vector space. Embeddings can have various aims like capturing the underlying graph topology and structure, node-to-node relationship, or other relevant information about the graph, its subgraphs or nodes themselves. A practical challenge with using embeddings is that there are many available variants to choose from. Selecting a small set of most promising embeddings from the long list of possible options for a given task is challenging and often requires domain expertise. Embeddings can be categorized into two main types: classical embeddings and structural embeddings. Classical embeddings focus on learning both local and global proximity of nodes, while structural embeddings learn information specifically about the local structure of nodes' neighbourhood. For classical node embeddings, there exists a framework which helps data scientists to identify (in an unsupervised way) a few embeddings that are worth further investigation. Unfortunately, no such framework exists for structural embeddings. In this article, we propose a framework for unsupervised ranking of structural graph embeddings. The proposed framework, apart from assigning an aggregate quality score for a structural embedding, additionally gives a data scientist insights into properties of this embedding. It produces information which predefined node features the embedding learns, how well it learns them, and which dimensions in the embedded space represent the predefined node features. Using this information, the user gets a level of explainability to an otherwise complex black-box embedding algorithm.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Semantic Node Embeddings of Distributed Graphs Using Apache Spark
    Narayanan, V. Suriya
    Vijayakumar, Vijeth Bidare
    Venkatraman, Sai Raam
    Baruah, Pallav Kumar
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 709 - 713
  • [22] FONDUE: Framework for Node Disambiguation Using Network Embeddings
    Mel, Ahmad
    Kang, Bo
    Lijffijt, Jefrey
    De Bie, Tijl
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 158 - 167
  • [23] Learning Structural Node Embeddings via Diffusion Wavelets
    Donnat, Claire
    Zitnik, Marinka
    Hallac, David
    Leskovec, Jure
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1320 - 1329
  • [24] Retrofitting Structural Graph Embeddings with Node Attribute Information
    Bielak, Piotr
    Puchalska, Daria
    Kajdanowicz, Tomasz
    COMPUTATIONAL SCIENCE - ICCS 2022, PT I, 2022, : 178 - 191
  • [25] Learning structured embeddings of knowledge graphs with generative adversarial framework
    Liu, Lu
    Zeng, Jiehang
    Zheng, Xiaoqing
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [26] FONDUE: A Framework for Node Disambiguation and Deduplication Using Network Embeddings
    Mel, Ahmad
    Kang, Bo
    Lijffijt, Jefrey
    De Bie, Tijl
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [27] Node rewriting in graphs and hypergraphs: a categorical framework
    Bauderon, M
    Jacquet, H
    THEORETICAL COMPUTER SCIENCE, 2001, 266 (1-2) : 463 - 487
  • [28] A Transparent Framework for Evaluating Unintended Demographic Bias in Word Embeddings
    Sweeney, Chris
    Najafian, Maryam
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1662 - 1667
  • [29] Evaluating the hyponym attachments in an unsupervised taxonomy enrichment framework
    Chifu, Emil St.
    Chifu, Viorica R.
    WEBIST 2008: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2008, : 247 - 252
  • [30] ExFaKT: A Framework for Explaining Facts over Knowledge Graphs and Text
    Gad-Elrab, Mohamed H.
    Stepanova, Dania
    Urbani, Jacopo
    Weikum, Gerhard
    PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 87 - 95