Scalable Supergraph Search in Large Graph Databases

被引:0
|
作者
Lyu, Bingqing [1 ]
Qin, Lu [2 ]
Lin, Xuemin [1 ,3 ]
Chang, Lijun [3 ]
Yu, Jeffrey Xu [4 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW, Australia
[3] Univ New South Wales, Sydney, NSW, Australia
[4] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supergraph search is a fundamental problem in graph databases that is widely applied in many application scenarios. Given a graph database and a query-graph, supergraph search retrieves all data-graphs contained in the query-graph from the graph database. Most existing solutions for supergraph search follow the pruning-and-verification framework, which prunes false answers based on features in the pruning phase and performs subgraph isomorphism testings on the remaining graphs in the verification phase. However, they are not scalable to handle large-sized data-graphs and query-graphs due to three drawbacks. First, they rely on a frequent subgraph mining algorithm to select features which is expensive and cannot generate large features. Second, they require a costly verification phase. Third, they process features in a fixed order without considering their relationship to the query-graph. In this paper, we address the three drawbacks and propose new indexing and query processing algorithms. In indexing, we select features directly from the data-graphs without expensive frequent subgraph mining. The features form a feature-tree that contains all-sized features and both the cost sharing and pruning power of the features are considered. In query processing, we propose a verification-free algorithm, where the order to process features is query-dependent by considering both the cost sharing and the pruning power. We explore two optimization strategies to further improve the algorithm efficiency. The first strategy applies a lightweight graph compression technique and the second strategy optimizes the inclusion of answers. Finally, we conduct extensive performance studies on two real large datasets to demonstrate the high scalability of our algorithms.
引用
收藏
页码:157 / 168
页数:12
相关论文
共 50 条
  • [31] Subgraph Isomorphism Search in Massive Graph Databases
    Nabti, Chemseddine
    Seba, Hamida
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 204 - 213
  • [32] Mining and Indexing Graphs for Supergraph Search
    Yuan, Dayu
    Mitra, Prasenjit
    Giles, C. Lee
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (10): : 829 - 840
  • [33] Scalable feature selection for large sized databases
    Liu, H
    Setiono, R
    4TH WORLD CONGRESS OF EXPERT SYSTEMS, VOL 1 AND 2: APPLICATION OF ADVANCED INFORMATION TECHNOLOGIES, 1998, : 521 - 528
  • [34] GHashing: Semantic Graph Hashing for Approximate Similarity Search in Graph Databases
    Qin, Zongyue
    Bai, Yunsheng
    Sun, Yizhou
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2062 - 2072
  • [35] Scalable Time-Versioning Support for Property Graph Databases
    Vijitbenjaronk, Warut D.
    Lee, Jinho
    Suzumura, Toyotaro
    Tanase, Gabriel
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1580 - 1589
  • [36] Neural Similarity Search on Supergraph Containment
    Wang, Hanchen
    Yu, Jianke
    Wang, Xiaoyang
    Chen, Chen
    Zhang, Wenjie
    Lin, Xuemin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 281 - 295
  • [37] Search for approximate matches in large databases
    Fink, E
    Goldstein, A
    Hayes, P
    Carbonell, JG
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 1431 - 1435
  • [38] CONFUSIONAL SEARCH ON LARGE FILES OR DATABASES
    FELICIAN, L
    INFORMATION SYSTEMS, 1988, 13 (03) : 239 - 244
  • [39] On the order supergraph of the power graph of a finite group
    Xuanlong Ma
    Huadong Su
    Ricerche di Matematica, 2022, 71 : 381 - 390
  • [40] Geometric Graph Indexing for Similarity Search in Scientific Databases
    Armiti, Ayser
    Gertz, Michael
    28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,