Understanding Coarsening for Embedding Large-Scale Graphs

被引:2
|
作者
Akyildiz, Taha Atahan [1 ]
Aljundi, Amro Alabsi [1 ]
Kaya, Kamer [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
关键词
Graph coarsening; graph embedding; multi-level approach; SCHEME;
D O I
10.1109/BigData50022.2020.9377898
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A significant portion of the data today, e.g, social networks, web connections, etc., can be modeled by graphs. A proper analysis of graphs with Machine Learning (ML) algorithms has the potential to yield far-reaching insights into many areas of research and industry. However, the irregular structure of graph data constitutes an obstacle for running ML tasks on graphs such as link prediction, node classification, and anomaly detection. Graph embedding is a compute-intensive process of representing graphs as a set of vectors in a d-dimensional space, which in turn makes it amenable to ML tasks. Many approaches have been proposed in the literature to improve the performance of graph embedding, e.g., using distributed algorithms, accelerators, and pre-processing techniques. Graph coarsening, which can be considered a pre-processing step, is a structural approximation of a given, large graph with a smaller one. As the literature suggests, the cost of embedding significantly decreases when coarsening is employed. In this work, we thoroughly analyze the impact of the coarsening quality on the embedding performance both in terms of speed and accuracy. Our experiments with a state-of-the-art, fast graph embedding tool show that there is an interplay between the coarsening decisions taken and the embedding quality.
引用
收藏
页码:2937 / 2946
页数:10
相关论文
共 50 条
  • [1] Gaussian Embedding of Large-Scale Attributed Graphs
    Hettige, Bhagya
    Li, Yuan-Fang
    Wang, Weiqing
    Buntine, Wray
    DATABASES THEORY AND APPLICATIONS, ADC 2020, 2020, 12008 : 134 - 146
  • [2] Approximate Deep Network Embedding for Mining Large-scale Graphs
    Zhou, Yang
    Liu, Ling
    2019 IEEE FIRST INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2019), 2019, : 53 - 60
  • [3] Large-Scale Heterogeneous Feature Embedding
    Huang, Xiao
    Song, Qingquan
    Yang, Fan
    Hu, Xia
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3878 - 3885
  • [4] Finding Structures in Large-scale Graphs
    Chin, Sang Peter
    Reilly, Elizabeth
    Lu, Linyuan
    CYBER SENSING 2012, 2012, 8408
  • [5] Large-scale structures in random graphs
    Bottcher, Julia
    SURVEYS IN COMBINATORICS 2017, 2017, 440 : 87 - 140
  • [6] Visualizing large-scale high-dimensional data via hierarchical embedding of KNN graphs
    Zhu, Haiyang
    Zhu, Minfeng
    Feng, Yingchaojie
    Cai, Deng
    Hu, Yuanzhe
    Wu, Shilong
    Wu, Xiangyang
    Chen, Wei
    VISUAL INFORMATICS, 2021, 5 (02) : 51 - 59
  • [7] Large-Scale Network Embedding in Apache Spark
    Lin, Wenqing
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3271 - 3279
  • [8] LINE: Large-scale Information Network Embedding
    Tang, Jian
    Qu, Meng
    Wang, Mingzhe
    Zhang, Ming
    Yan, Jun
    Mei, Qiaozhu
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1067 - 1077
  • [9] Decentralized Embedding Framework for Large-Scale Networks
    Imran, Mubashir
    Yin, Hongzhi
    Chen, Tong
    Shao, Yingxia
    Zhang, Xiangliang
    Zhou, Xiaofang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 425 - 441
  • [10] Large-Scale Clustering through Functional Embedding
    Ratle, Frederic
    Weston, Jason
    Miller, Matthew L.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 266 - +