Distance-Join: Pattern Match Query In a Large Graph Database

被引:10
|
作者
Zou, Lei [1 ]
Chen, Lei [2 ]
Oezsu, M. Tamer [3 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[3] Univ Waterloo, Waterloo, ON, Canada
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2009年 / 2卷 / 01期
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
28;
D O I
10.14778/1687627.1687727
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing popularity of graph databases has generated interesting data ianageient probleis, such as subgraph search, shortest-path query, reachability verification, and pattern match. Aiong these, a pattern match query is more flexible compared to a subgraph search and more informative compared to a shortest-path or reachability query. In this paper, we address pattern match probleis over a large data graph G. Specifically, given a pattern graph (i.e., query Q), we want to find all inatches (in G) that have the similar connections as those in Q. In order to reduce the search space significantly, we first transform the vertices into points in a vector space via graph embedding techniques, converting a pattern match query into a distance-based multi-way join problem over the converted vector space. We also propose several pruning strategies and a join order selection method to process join processing efficiently. Extensive experiments on both real and synthetic datasets show that our method outperforms existing ones by orders of magnitude.
引用
收藏
页码:886 / 897
页数:12
相关论文
共 50 条
  • [31] A framework for join pattern indexing in intelligent database systems
    Segev, A
    Zhao, JL
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1995, 7 (06) : 941 - 947
  • [32] Testing Graph Database Engines via Query Partitioning
    Kamm, Matteo
    Rigger, Manuel
    Zhang, Chengyu
    Su, Zhendong
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 140 - 149
  • [33] ON THE QUERY COMPLEXITY OF ESTIMATING THE DISTANCE TO HEREDITARY GRAPH PROPERTIES
    Hoppen, Carlos
    Kohayakawa, Yoshiharu
    Lang, Richard
    Lefmann, Hanno
    Stagni, Henrique
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2021, 35 (02) : 1238 - 1251
  • [34] Discussion of Graph Reachability Query with Keyword and Distance Constraint
    Wen Juping
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 293 - 301
  • [35] A New Query Method Based on Distance Education Database
    Gu, Wenxuan
    Yuan, Cao
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ELECTRONIC, MECHANICAL, INFORMATION AND MANAGEMENT SOCIETY (EMIM), 2016, 40 : 637 - 641
  • [36] Large-Scale Spatial Join Query Processing in Cloud
    You, Simin
    Zhang, Jianting
    Gruenwald, Le
    2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 34 - 41
  • [37] Match Graph Construction for Large Image Databases
    Kim, Kwang In
    Tompkin, James
    Theobald, Martin
    Kautz, Jan
    Theobalt, Christian
    COMPUTER VISION - ECCV 2012, PT I, 2012, 7572 : 272 - 285
  • [38] Research on Optimizing Strategy of Database-oriented GIS Graph database Query
    Wu, Xinxin
    Deng, Song
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 305 - 309
  • [39] Optimized subgraph match over large graph
    Lou, Y. (xianyinglou@gmail.com), 1600, Binary Information Press (10):
  • [40] On the Distance Pattern Distinguishing Number of a Graph
    Jose, Sona
    Augustine, Germina K.
    JOURNAL OF APPLIED MATHEMATICS, 2014,