Efficient Query Processing on Graph Databases

被引：32

作者：

Cheng, James ^{[1
]}

Ke, Yiping ^{[2
]}

Ng, Wilfred ^{[3
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore

[2] Chinese Univ Hong Kong, Dept Syst Engn & Management, Hong Kong, Hong Kong, Peoples R China

[3] Hong Kong Univ Sci & Technol, Clear Water Bay, Hong Kong, Peoples R China

来源：

ACM TRANSACTIONS ON DATABASE SYSTEMS | 2009年 / 34卷 / 01期

关键词：

Algorithms; Experimentation; Performance; Graph databases; graph indexing; graph query processing; frequent subgraphs;

D O I：

10.1145/1508857.1508859

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We study the problem of processing subgraph queries on a database that consists of a set of graphs. The answer to a subgraph query is the set of graphs in the database that are supergraphs of the query. In this article, we propose an efficient index, FG*-index, to solve this problem. The cost of processing a subgraph query using most existing indexes mainly consists of two parts: the index probing cost and the candidate verification cost. Index probing is to find the query in the index, or to find the graphs from which we can generate a candidate answer set for the query. Candidate verification is to test whether each graph in the candidate set is indeed a supergraph of the query. We design FG*-index to minimize these two costs as follows. FG*-index consists of three components: the FG-index, the feature-index, and the FAQ-index. First, the FG-index employs the concept of Frequent subGraph (FG) to allow the set of queries that are FGs to be answered without candidate verification. We call this set of queries FG-queries. We can enlarge the set of FG-queries so that more queries can be answered without candidate verification; however, a larger set of FG-queries implies a larger FG-index and hence the index probing cost also increases. We propose the feature-index to reduce the index probing cost. The feature-index uses features to filter false results that are matched in the FG-index, so that we can quickly find the truly matching graphs for a query. For processing non-FG-queries, we propose the FAQ-index, which is dynamically constructed from the set of Frequently Asked non-FG-Queries (FAQs). Using the FAQ-index, verification is not required for processing FAQs and only a small number of candidates need to be verified for processing non-FG-queries that are not frequently asked. Finally, a comprehensive set of experiments verifies that query processing using FG*-index is up to orders of magnitude more efficient than state-of-the-art indexes and it is also more scalable.

引用

页数：48

共 50 条

[41] Elasticity in Cloud Databases and Their Query Processing
Graefe, Goetz
Nica, Anisoara
Stolze, Knut
Neumann, Thomas
Eavis, Todd
Petrov, Ilia
Pourabbas, Elaheh
Fekete, David
INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (02) : 1 - 20
[42] Efficient Distributed Query Processing on Large Scale RDF Graph Data
Wang X.
Xu Q.
Chai L.-L.
Yang Y.-J.
Chai Y.-P.
Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 498 - 514
[43] Efficient Query Processing of Semantic Data using Graph Contraction on RDBMS
Hayakawa, Akira
Nishiyama, Hiroyasu
2013 INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2013, : 958 - 965
[44] Efficient Graph Query Processing over Geo-Distributed Datacenters
Yuan, Ye
Ma, Delong
Wen, Zhenyu
Ma, Yuliang
Wang, Guoren
Chen, Lei
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 619 - 628
[45] Efficient query evaluation on probabilistic databases
Dalvi, Nilesh
Suciu, Dan
VLDB JOURNAL, 2007, 16 (04): : 523 - 544
[46] Efficient query evaluation on probabilistic databases
Nilesh Dalvi
Dan Suciu
The VLDB Journal, 2007, 16 : 523 - 544
[47] A solution of spatial query processing and query optimization for spatial databases
YUAN Jie XIE Kun qing MA Xiu jun ZHANG Min SUN Le bin Department of Computer Science Peking University Beijing PRChina Department of Intelligence Science Peking University Beijing PRChina Beijing Institute of Surveying and Mapping Beijing PRChina
重庆邮电学院学报(自然科学版), 2004, (05) : 165 - 172
[48] Graph-based parallel query processing and optimization strategies for object-oriented databases
Su, SYW
Huang, Y
Akaboshi, N
DISTRIBUTED AND PARALLEL DATABASES, 1998, 6 (03) : 247 - 285
[49] Graph-based parallel query processing and optimization strategies for object-oriented databases
Univ of Florida, Gainesville, United States
Distrib Parallel Databases, 3 (247-285):
[50] Graph-Based Parallel Query Processing and Optimization Strategies for Object-Oriented Databases
Stanley Y.W. Su
Ying Huang
Naoki Akaboshi
Distributed and Parallel Databases, 1998, 6 : 247 - 285

← 1 2 3 4 5 →