Query Processing on Large Graphs: Scalability Through Partitioning

被引:3
|
作者
Bodra, Jay
Das, Soumyava
Santra, Abhishek [1 ]
Chakravarthy, Sharma
机构
[1] UT Arlington, IT Lab, Arlington, TX 76019 USA
关键词
D O I
10.1007/978-3-319-98539-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphs, as an expressive data structure, have become increasingly important for modeling real-world applications (collaboration, different kinds of transactions, social networks, to name a few.) With the advent of social networks and the web, the graphs have grown too large to fit in main memory. This calls for alternative approaches, algorithms, and their analysis to develop an efficient, scalable evaluation of queries on graphs of any size. In this paper, we use the time-tested "divide and conquer" approach by partitioning a graph into desired number of partitions and process queries over those partitions to obtain all or specified number of answers. This entails correctly computing answers that span multiple partitions or need the same partition more than once. A query evaluation approach along with the necessary minimal book keeping is proposed and its correctness established. Query answering on partitioned graphs also requires analyzing partitioning schemes for their impact on query processing and determining the number as well as the sequence in which partitions are loaded to reduce the response time for processing one or a batch of queries. We correlate query properties and partition characteristics to reduce query processing time in terms of the number of partitions loaded. We identify a set of quantitative metrics and use them for formulating heuristics to determine the order of loading partitions for efficient query processing. Extensive experiments on large graphs (synthetic and real-world) using different partitioning schemes analyze the proposed heuristics on a variety of query types. An existing graph querying system has been extended to evaluate queries on partitioned graphs.
引用
收藏
页码:271 / 288
页数:18
相关论文
共 50 条
  • [21] Partitioning approach to visualization of large graphs
    Batagelj, V
    Mrvar, A
    Zaversnik, M
    GRAPH DRAWING, 1999, 1731 : 90 - 97
  • [22] Vertical Partitioning for Query Processing over Raw Data
    Zhao, Weijie
    Cheng, Yu
    Rusu, Florin
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [23] DIAERESIS: RDF data partitioning and query processing on SPARK
    Troullinou, Georgia
    Agathangelos, Giannis
    Kondylakis, Haridimos
    Stefanidis, Kostas
    Plexousakis, Dimitris
    SEMANTIC WEB, 2024, 15 (05) : 1763 - 1789
  • [24] Mining Large Query Induced Graphs towards a Hierarchical Query Folksonomy
    Francisco, Alexandre P.
    Baeza-Yates, Ricardo
    Oliveira, Arlindo L.
    STRING PROCESSING AND INFORMATION RETRIEVAL, 2010, 6393 : 237 - +
  • [26] Scalability and Parallel Execution of Warp Processing: Dynamic Hardware/Software Partitioning
    Roman Lysecky
    International Journal of Parallel Programming, 2008, 36 : 478 - 492
  • [27] CoUPE: Continuous Query Processing Engine for Evolving Graphs
    Mullangi, Phani Rohit
    Ramaswamy, Lakshmish
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 142 - 149
  • [28] General-purpose query processing on summary graphs
    Anagnostopoulos, Aris
    Arrigoni, Valentina
    Gullo, Francesco
    Salvatori, Giorgia
    Severini, Lorenzo
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [29] Acyclic Partitioning of Large Directed Acyclic Graphs
    Herrmann, Julien
    Kho, Jonathan
    Ucar, Bora
    Kaya, Kamer
    Catalyurek, Umit V.
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 371 - 380
  • [30] Adaptive hybrid partitioning for OLAP query processing in a database cluster
    Computer Science Department, COPPE, Federal University of Rio de Janeiro , P.O. Box 68511, 21941-972 Rio de Janeiro, Brazil
    不详
    不详
    Int. J. High Perform. Comput. Networking, 2008, 4 (251-262):