Query Processing on Large Graphs: Scalability Through Partitioning

被引:3
|
作者
Bodra, Jay
Das, Soumyava
Santra, Abhishek [1 ]
Chakravarthy, Sharma
机构
[1] UT Arlington, IT Lab, Arlington, TX 76019 USA
关键词
D O I
10.1007/978-3-319-98539-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphs, as an expressive data structure, have become increasingly important for modeling real-world applications (collaboration, different kinds of transactions, social networks, to name a few.) With the advent of social networks and the web, the graphs have grown too large to fit in main memory. This calls for alternative approaches, algorithms, and their analysis to develop an efficient, scalable evaluation of queries on graphs of any size. In this paper, we use the time-tested "divide and conquer" approach by partitioning a graph into desired number of partitions and process queries over those partitions to obtain all or specified number of answers. This entails correctly computing answers that span multiple partitions or need the same partition more than once. A query evaluation approach along with the necessary minimal book keeping is proposed and its correctness established. Query answering on partitioned graphs also requires analyzing partitioning schemes for their impact on query processing and determining the number as well as the sequence in which partitions are loaded to reduce the response time for processing one or a batch of queries. We correlate query properties and partition characteristics to reduce query processing time in terms of the number of partitions loaded. We identify a set of quantitative metrics and use them for formulating heuristics to determine the order of loading partitions for efficient query processing. Extensive experiments on large graphs (synthetic and real-world) using different partitioning schemes analyze the proposed heuristics on a variety of query types. An existing graph querying system has been extended to evaluate queries on partitioned graphs.
引用
收藏
页码:271 / 288
页数:18
相关论文
共 50 条
  • [1] Query processing on large graphs: Approaches to scalability and response time trade offs
    Das, Soumyava
    Santra, Abhishek
    Bodra, Jay
    Chakravarthy, Sharma
    DATA & KNOWLEDGE ENGINEERING, 2020, 126 (126)
  • [2] Path Pattern Query Processing on Large Graphs
    Bai, Yiyuan
    Wang, Chaokun
    Ying, Xiang
    Wang, Meng
    Gong, Yunqing
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 767 - 774
  • [3] A Distributed Graph Partitioning Algorithm for Processing Large Graphs
    Chen, Tefeng
    Li, Bo
    PROCEEDINGS 2016 IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING SOSE 2016, 2016, : 71 - 77
  • [4] Efficient Partitioning and Query Processing of Spatio-Temporal Graphs with Trillion Edges
    Ding, Mengsu
    Chen, Shimin
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1714 - 1717
  • [5] Optimization of query processing through constrained vertical partitioning of relational tables
    Liu, Zhenjie
    Getta, Janusz R.
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON DATABASES AND APPLICATIONS, 2006, : 221 - +
  • [6] RDFPath: Path Query Processing on Large RDF Graphs with Map Reduce
    Przyjaciel-Zablocki, Martin
    Schaetzle, Alexander
    Hornung, Thomas
    Lausen, Georg
    SEMANTIC WEB: ESWC 2011 WORKSHOPS, 2012, 7117 : 50 - 64
  • [7] Workload-aware Subgraph Query Caching and Processing in Large Graphs
    Liang, Yongjiang
    Zhao, Peixiang
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1754 - 1757
  • [8] Para-G: Path pattern query processing on large graphs
    Bai, Yiyuan
    Wang, Chaokun
    Ying, Xiang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (03): : 515 - 541
  • [9] Para-G: Path pattern query processing on large graphs
    Yiyuan Bai
    Chaokun Wang
    Xiang Ying
    World Wide Web, 2017, 20 : 515 - 541
  • [10] Performance and Scalability of Indexed Subgraph Query Processing Methods
    Katsarou, Foteini
    Ntarmos, Nikos
    Triantafillou, Peter
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1566 - 1577