Query Processing on Large Graphs: Scalability Through Partitioning

被引:3
|
作者
Bodra, Jay
Das, Soumyava
Santra, Abhishek [1 ]
Chakravarthy, Sharma
机构
[1] UT Arlington, IT Lab, Arlington, TX 76019 USA
关键词
D O I
10.1007/978-3-319-98539-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphs, as an expressive data structure, have become increasingly important for modeling real-world applications (collaboration, different kinds of transactions, social networks, to name a few.) With the advent of social networks and the web, the graphs have grown too large to fit in main memory. This calls for alternative approaches, algorithms, and their analysis to develop an efficient, scalable evaluation of queries on graphs of any size. In this paper, we use the time-tested "divide and conquer" approach by partitioning a graph into desired number of partitions and process queries over those partitions to obtain all or specified number of answers. This entails correctly computing answers that span multiple partitions or need the same partition more than once. A query evaluation approach along with the necessary minimal book keeping is proposed and its correctness established. Query answering on partitioned graphs also requires analyzing partitioning schemes for their impact on query processing and determining the number as well as the sequence in which partitions are loaded to reduce the response time for processing one or a batch of queries. We correlate query properties and partition characteristics to reduce query processing time in terms of the number of partitions loaded. We identify a set of quantitative metrics and use them for formulating heuristics to determine the order of loading partitions for efficient query processing. Extensive experiments on large graphs (synthetic and real-world) using different partitioning schemes analyze the proposed heuristics on a variety of query types. An existing graph querying system has been extended to evaluate queries on partitioned graphs.
引用
收藏
页码:271 / 288
页数:18
相关论文
共 50 条
  • [41] Hybrid query execution engine for large attributed graphs
    Sakr, Sherif
    Elnikety, Sameh
    He, Yuxiong
    INFORMATION SYSTEMS, 2014, 41 : 45 - 73
  • [42] Most similar maximal clique query on large graphs
    Peng, Yun
    Xu, Yitong
    Zhao, Huawei
    Zhou, Zhizheng
    Han, Huimin
    FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (03)
  • [43] Fast and Accurate Optimizer for Query Processing over Knowledge Graphs
    Wu, Jingqi
    Chen, Rong
    Xia, Yubin
    PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 503 - 517
  • [44] Large Scale Hamming Distance Query Processing
    Liu, Alex X.
    Shen, Ke
    Torng, Eric
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 553 - 564
  • [45] Efficient query processing in large traffic networks
    Kriegel, Hans-Peter
    Kroeger, Peer
    Kunath, Peter
    Renz, Matthias
    Schmidt, Tim
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1451 - +
  • [46] Exploiting early sorting and early partitioning for decision support query processing
    J. Claussen
    A. Kemper
    D. Kossmann
    C. Wiesner
    The VLDB Journal, 2000, 9 : 190 - 213
  • [47] Inverted file partitioning for distributed query processing in information retrieval systems
    Srisawat, J
    Alexandridis, N
    OConnell, M
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS - PROCEEDINGS OF THE ISCA 9TH INTERNATIONAL CONFERENCE, VOLS I AND II, 1996, : 738 - 743
  • [48] THE EFFECT OF INDEX PARTITIONING SCHEMES ON THE PERFORMANCE OF DISTRIBUTED QUERY-PROCESSING
    LIEBEHERR, J
    OMIECINSKI, ER
    AKYILDIZ, IF
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (03) : 510 - 522
  • [49] Jigsaw: A Data Storage and Query Processing Engine for Irregular Table Partitioning
    Kang, Donghe
    Jiang, Ruochen
    Blanas, Spyros
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 898 - 911
  • [50] Exploiting early sorting and early partitioning for decision support query processing
    Claussen, J
    Kemper, A
    Kossmann, D
    Wiesner, C
    VLDB JOURNAL, 2000, 9 (03): : 190 - 213