Distance-Join: Pattern Match Query In a Large Graph Database

被引:10
|
作者
Zou, Lei [1 ]
Chen, Lei [2 ]
Oezsu, M. Tamer [3 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[3] Univ Waterloo, Waterloo, ON, Canada
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2009年 / 2卷 / 01期
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
28;
D O I
10.14778/1687627.1687727
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing popularity of graph databases has generated interesting data ianageient probleis, such as subgraph search, shortest-path query, reachability verification, and pattern match. Aiong these, a pattern match query is more flexible compared to a subgraph search and more informative compared to a shortest-path or reachability query. In this paper, we address pattern match probleis over a large data graph G. Specifically, given a pattern graph (i.e., query Q), we want to find all inatches (in G) that have the similar connections as those in Q. In order to reduce the search space significantly, we first transform the vertices into points in a vector space via graph embedding techniques, converting a pattern match query into a distance-based multi-way join problem over the converted vector space. We also propose several pruning strategies and a join order selection method to process join processing efficiently. Extensive experiments on both real and synthetic datasets show that our method outperforms existing ones by orders of magnitude.
引用
收藏
页码:886 / 897
页数:12
相关论文
共 50 条
  • [1] Improving Distance-Join Query processing with Voronoi-Diagram based partitioning in SpatialHadoop
    Garcia-Garcia, Francisco
    Corral, Antonio
    Iribarne, Luis
    Vassilakopoulos, Michael
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 111 (111): : 723 - 740
  • [2] Pattern match query over fuzzy RDF graph
    Li, Guanfeng
    Yan, Li
    Ma, Zongmin
    KNOWLEDGE-BASED SYSTEMS, 2019, 165 : 460 - 473
  • [3] Answering pattern match queries in large graph databases via graph embedding
    Zou, Lei
    Chen, Lei
    Oezsu, M. Tamer
    Zhao, Dongyan
    VLDB JOURNAL, 2012, 21 (01): : 97 - 120
  • [4] Answering pattern match queries in large graph databases via graph embedding
    Lei Zou
    Lei Chen
    M. Tamer Özsu
    Dongyan Zhao
    The VLDB Journal, 2012, 21 : 97 - 120
  • [5] Top-k shortest distance join over large graph
    Cheng, J. (jf.cheng@siat.ac.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [6] GLogS: Interactive Graph Pattern Matching Query At Large Scale
    Lai, Longbin
    Yang, Yufan
    Wang, Zhibin
    Liu, Yuxuan
    Ma, Haotian
    Shen, Sijie
    Lyu, Bingqing
    Zhou, Xiaoli
    Yu, Wenyuan
    Qian, Zhengping
    Tian, Chen
    Zhong, Sheng
    Chung, Yeh-Ching
    Zhou, Jingren
    PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 53 - 69
  • [7] Exact Distance Query in Large Graphs through Fast Graph Simplification
    Liu, Jun
    Pan, Yicheng
    Hu, Qifu
    COMPUTER JOURNAL, 2021, 64 (01): : 93 - 107
  • [8] Stableness In Large Join Query Optimization
    Bini, Tarcizio Alexandre
    Lange, Adriano
    Sunye, Marcos Sfair
    Silva, Fabiano
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 637 - 642
  • [9] Thorough Data Pruning for Join Query in Database System
    Gao, Jintao
    Li, Zhanhuai
    Sun, Jian
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 9 (03): : 409 - 421
  • [10] An algorithm for multi-way distance join query
    Liang, Yin
    Zhang, Hong
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 412 - +