Query Processing on Large Graphs: Scalability Through Partitioning

被引:3
|
作者
Bodra, Jay
Das, Soumyava
Santra, Abhishek [1 ]
Chakravarthy, Sharma
机构
[1] UT Arlington, IT Lab, Arlington, TX 76019 USA
关键词
D O I
10.1007/978-3-319-98539-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphs, as an expressive data structure, have become increasingly important for modeling real-world applications (collaboration, different kinds of transactions, social networks, to name a few.) With the advent of social networks and the web, the graphs have grown too large to fit in main memory. This calls for alternative approaches, algorithms, and their analysis to develop an efficient, scalable evaluation of queries on graphs of any size. In this paper, we use the time-tested "divide and conquer" approach by partitioning a graph into desired number of partitions and process queries over those partitions to obtain all or specified number of answers. This entails correctly computing answers that span multiple partitions or need the same partition more than once. A query evaluation approach along with the necessary minimal book keeping is proposed and its correctness established. Query answering on partitioned graphs also requires analyzing partitioning schemes for their impact on query processing and determining the number as well as the sequence in which partitions are loaded to reduce the response time for processing one or a batch of queries. We correlate query properties and partition characteristics to reduce query processing time in terms of the number of partitions loaded. We identify a set of quantitative metrics and use them for formulating heuristics to determine the order of loading partitions for efficient query processing. Extensive experiments on large graphs (synthetic and real-world) using different partitioning schemes analyze the proposed heuristics on a variety of query types. An existing graph querying system has been extended to evaluate queries on partitioned graphs.
引用
收藏
页码:271 / 288
页数:18
相关论文
共 50 条
  • [31] MapReduce skyline query processing with partitioning and distributed dominance tests
    Koh, Jia-Ling
    Chen, Chia-Ching
    Chan, Chih-Yu
    Chen, Arbee L. P.
    INFORMATION SCIENCES, 2017, 375 : 114 - 137
  • [32] MapReduce Skyline Query Processing with A New Angular Partitioning Approach
    Chen, Liang
    Hwang, Kai
    Wu, Jian
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2262 - 2270
  • [33] LShape Partitioning: Parallel Skyline Query Processing Using MapReduce
    Wijayanto, Heri
    Wang, Wenlu
    Ku, Wei-Shinn
    Chen, Arbee L. P.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (07) : 3363 - 3376
  • [34] I/O-Efficient Multi-Criteria Shortest Paths Query Processing on Large Graphs
    Zhou, Xinjie
    Huang, Kai
    Li, Lei
    Zhang, Mengxuan
    Zhou, Xiaofang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6430 - 6446
  • [35] HyPSo: Hybrid Partitioning for Big RDF Storage and Query Processing
    Chawla, Tanvi
    Singh, Girdhari
    Pilli, Emmanuel S.
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 188 - 194
  • [36] Parallel processing of large graphs
    Kajdanowicz, Tomasz
    Kazienko, Przemyslaw
    Indyk, Wojciech
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 32 : 324 - 337
  • [37] Distributed Query Evaluation over Large RDF Graphs
    Peng, Peng
    WEB AND BIG DATA, APWEB-WAIM 2019, 2019, 11809 : 3 - 7
  • [38] FLAG: Towards Graph Query Autocompletion for Large Graphs
    Yi, Peipei
    Li, Jianping
    Choi, Byron
    Bhowmick, Sourav S.
    Xu, Jianliang
    DATA SCIENCE AND ENGINEERING, 2022, 7 (02) : 175 - 191
  • [39] FLAG: Towards Graph Query Autocompletion for Large Graphs
    Peipei Yi
    Jianping Li
    Byron Choi
    Sourav S. Bhowmick
    Jianliang Xu
    Data Science and Engineering, 2022, 7 : 175 - 191
  • [40] Most similar maximal clique query on large graphs
    Yun Peng
    Yitong Xu
    Huawei Zhao
    Zhizheng Zhou
    Huimin Han
    Frontiers of Computer Science, 2020, 14