Understanding Coarsening for Embedding Large-Scale Graphs

被引:2
|
作者
Akyildiz, Taha Atahan [1 ]
Aljundi, Amro Alabsi [1 ]
Kaya, Kamer [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
关键词
Graph coarsening; graph embedding; multi-level approach; SCHEME;
D O I
10.1109/BigData50022.2020.9377898
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A significant portion of the data today, e.g, social networks, web connections, etc., can be modeled by graphs. A proper analysis of graphs with Machine Learning (ML) algorithms has the potential to yield far-reaching insights into many areas of research and industry. However, the irregular structure of graph data constitutes an obstacle for running ML tasks on graphs such as link prediction, node classification, and anomaly detection. Graph embedding is a compute-intensive process of representing graphs as a set of vectors in a d-dimensional space, which in turn makes it amenable to ML tasks. Many approaches have been proposed in the literature to improve the performance of graph embedding, e.g., using distributed algorithms, accelerators, and pre-processing techniques. Graph coarsening, which can be considered a pre-processing step, is a structural approximation of a given, large graph with a smaller one. As the literature suggests, the cost of embedding significantly decreases when coarsening is employed. In this work, we thoroughly analyze the impact of the coarsening quality on the embedding performance both in terms of speed and accuracy. Our experiments with a state-of-the-art, fast graph embedding tool show that there is an interplay between the coarsening decisions taken and the embedding quality.
引用
收藏
页码:2937 / 2946
页数:10
相关论文
共 50 条
  • [21] Generating Large-Scale Heterogeneous Graphs for Benchmarking
    Gupta, Amarnath
    SPECIFYING BIG DATA BENCHMARKS, 2014, 8163 : 113 - 128
  • [22] Efficient mining algorithms for large-scale graphs
    Kishimoto, Yasunari
    Shiokawa, Hiroaki
    Fujiwara, Yasuhiro
    Onizuka, Makoto
    NTT Technical Review, 2013, 11 (12):
  • [23] Parallel generation of large-scale random graphs
    Vullikanti, Anil
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 278 - 278
  • [24] Large-scale Machine Learning over Graphs
    Yang, Yiming
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 9 - 9
  • [25] Large-scale quantum networks based on graphs
    Epping, Michael
    Kampermann, Hermann
    Bruss, Dagmar
    NEW JOURNAL OF PHYSICS, 2016, 18
  • [26] Adaptive Partitioning of Large-Scale Dynamic Graphs
    Vaquero, Luis M.
    Cuadrado, Felix
    Logothetis, Dionysios
    Martella, Claudio
    2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2014), 2014, : 144 - 153
  • [28] Multilevel Parallelism for the Exploration of Large-Scale Graphs
    Bernaschi, Massimo
    Bisson, Mauro
    Mastrostefano, Enrico
    Vella, Flavio
    IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2018, 4 (03): : 204 - 216
  • [29] Large-Scale Image Categorization with Explicit Data Embedding
    Perronnin, Florent
    Sanchez, Jorge
    Liu, Yan
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2297 - 2304
  • [30] Evolutionary compact embedding for large-scale image classification
    Liu, Li
    Shao, Ling
    Li, Xuelong
    INFORMATION SCIENCES, 2015, 316 : 567 - 581