Efficient Query Processing on Graph Databases

被引:32
|
作者
Cheng, James [1 ]
Ke, Yiping [2 ]
Ng, Wilfred [3 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[2] Chinese Univ Hong Kong, Dept Syst Engn & Management, Hong Kong, Hong Kong, Peoples R China
[3] Hong Kong Univ Sci & Technol, Clear Water Bay, Hong Kong, Peoples R China
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2009年 / 34卷 / 01期
关键词
Algorithms; Experimentation; Performance; Graph databases; graph indexing; graph query processing; frequent subgraphs;
D O I
10.1145/1508857.1508859
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of processing subgraph queries on a database that consists of a set of graphs. The answer to a subgraph query is the set of graphs in the database that are supergraphs of the query. In this article, we propose an efficient index, FG*-index, to solve this problem. The cost of processing a subgraph query using most existing indexes mainly consists of two parts: the index probing cost and the candidate verification cost. Index probing is to find the query in the index, or to find the graphs from which we can generate a candidate answer set for the query. Candidate verification is to test whether each graph in the candidate set is indeed a supergraph of the query. We design FG*-index to minimize these two costs as follows. FG*-index consists of three components: the FG-index, the feature-index, and the FAQ-index. First, the FG-index employs the concept of Frequent subGraph (FG) to allow the set of queries that are FGs to be answered without candidate verification. We call this set of queries FG-queries. We can enlarge the set of FG-queries so that more queries can be answered without candidate verification; however, a larger set of FG-queries implies a larger FG-index and hence the index probing cost also increases. We propose the feature-index to reduce the index probing cost. The feature-index uses features to filter false results that are matched in the FG-index, so that we can quickly find the truly matching graphs for a query. For processing non-FG-queries, we propose the FAQ-index, which is dynamically constructed from the set of Frequently Asked non-FG-Queries (FAQs). Using the FAQ-index, verification is not required for processing FAQs and only a small number of candidates need to be verified for processing non-FG-queries that are not frequently asked. Finally, a comprehensive set of experiments verifies that query processing using FG*-index is up to orders of magnitude more efficient than state-of-the-art indexes and it is also more scalable.
引用
收藏
页数:48
相关论文
共 50 条
  • [21] Adaptive query compilation in graph databases
    Alexander Baumstark
    Muhammad Attahir Jibril
    Kai-Uwe Sattler
    Distributed and Parallel Databases, 2023, 41 : 359 - 386
  • [22] Adaptive query compilation in graph databases
    Baumstark, Alexander
    Jibril, Muhammad Attahir
    Sattler, Kai-Uwe
    DISTRIBUTED AND PARALLEL DATABASES, 2023, 41 (03) : 359 - 386
  • [23] Query Processing for RDF Databases
    Kaoudi, Zoi
    Kementsietsidis, Anastasios
    REASONING WEB: REASONING ON THE WEB IN THE BIG DATA ERA, 2014, 8714 : 141 - +
  • [24] Adaptive Query Compilation in Graph Databases
    Baumstark, Alexander
    Jibril, Muhammad Attahir
    Sattler, Kai-Uwe
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2021), 2021, : 112 - 119
  • [25] A novel graph containment query algorithm on graph databases
    Li, Xiantong
    Zhang, Wei
    Li, Jianzhong
    Journal of Digital Information Management, 2009, 7 (03): : 143 - 151
  • [26] Efficient Privacy-Preserving Query Processing on Outsourced Geographic Databases
    Zhao, Li
    Liu, Qin
    Huang, Hejiao
    Jia, Xiaohua
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [27] QRDF: An efficient RDF graph processing system for fast query
    Jia, Menghan
    Zhang, Yiming
    Li, Dongsheng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (24):
  • [28] Efficient and Scalable Integrity Verification of Data and Query Results for Graph Databases (Extended Abstract)
    Arshad, Muhammad U.
    Kundu, Ashish
    Bertino, Elisa
    Ghafoor, Arif
    Kundu, Chinmay
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1821 - 1822
  • [29] GraphTQL: A visual query system for graph databases
    Constanza Pabon, Maria
    Millan, Marta
    Roncancio, Claudia
    Collazos, Cesar A.
    JOURNAL OF COMPUTER LANGUAGES, 2019, 51 (97-111) : 97 - 111
  • [30] A graph query language and its query processing
    Sheng, L
    Özsoyoglu, ZM
    Özsoyoglu, G
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 572 - 581