Scalable Supergraph Search in Large Graph Databases

被引:0
|
作者
Lyu, Bingqing [1 ]
Qin, Lu [2 ]
Lin, Xuemin [1 ,3 ]
Chang, Lijun [3 ]
Yu, Jeffrey Xu [4 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW, Australia
[3] Univ New South Wales, Sydney, NSW, Australia
[4] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supergraph search is a fundamental problem in graph databases that is widely applied in many application scenarios. Given a graph database and a query-graph, supergraph search retrieves all data-graphs contained in the query-graph from the graph database. Most existing solutions for supergraph search follow the pruning-and-verification framework, which prunes false answers based on features in the pruning phase and performs subgraph isomorphism testings on the remaining graphs in the verification phase. However, they are not scalable to handle large-sized data-graphs and query-graphs due to three drawbacks. First, they rely on a frequent subgraph mining algorithm to select features which is expensive and cannot generate large features. Second, they require a costly verification phase. Third, they process features in a fixed order without considering their relationship to the query-graph. In this paper, we address the three drawbacks and propose new indexing and query processing algorithms. In indexing, we select features directly from the data-graphs without expensive frequent subgraph mining. The features form a feature-tree that contains all-sized features and both the cost sharing and pruning power of the features are considered. In query processing, we propose a verification-free algorithm, where the order to process features is query-dependent by considering both the cost sharing and the pruning power. We explore two optimization strategies to further improve the algorithm efficiency. The first strategy applies a lightweight graph compression technique and the second strategy optimizes the inclusion of answers. Finally, we conduct extensive performance studies on two real large datasets to demonstrate the high scalability of our algorithms.
引用
收藏
页码:157 / 168
页数:12
相关论文
共 50 条
  • [21] Semantic keyword search in graph databases
    Lou, Ying
    Wu, Qingtao
    Ji, Baiyang
    Zheng, Ruijuan
    Zhang, Mingchuan
    Wei, Wangyang
    Journal of Computational Information Systems, 2013, 9 (15): : 5913 - 5920
  • [22] A Scalable Reference-Point Based Algorithm to Efficiently Search Large Chemical Databases
    Napolitano, Francesco
    Tagliaferri, Roberto
    Baldi, Pierre
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [23] Efficient Probabilistic Supergraph Search
    Zhang, Wenjie
    Lin, Xuemin
    Zhang, Ying
    Zhu, Ke
    Zhu, Gaoping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (04) : 965 - 978
  • [24] Efficient Probabilistic Supergraph Search
    Zhang, Wenjie
    Lin, Xuemin
    Zhang, Ying
    Zhu, Ke
    Zhu, Gaoping
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1542 - 1543
  • [25] Scalable Blocking for Very Large Databases
    Borthwick, Andrew
    Ash, Stephen
    Pang, Bin
    Qureshi, Shehzad
    Jones, Timothy
    ECML PKDD 2020 WORKSHOPS, 2020, 1323 : 303 - 319
  • [26] A Scalable Graph-Coarsening Based Index for Dynamic Graph Databases
    Kansal, Akshay
    Spezzano, Francesca
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 207 - 216
  • [27] Aggregated Search in Graph Databases: Preliminary Results
    Elghazel, Haytham
    Hacid, Mohand-Said
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, 2011, 6658 : 92 - 101
  • [28] Efficient Correlation Search from Graph Databases
    Ke, Yiping
    Cheng, James
    Ng, Wilfred
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (12) : 1601 - 1615
  • [29] Knowledge Graph Construction and Search for Biological Databases
    Zaki, Nazar
    Tennakoon, Chandana
    Al Ashwal, Hany
    2017 5TH INTERNATIONAL CONFERENCE ON RESEARCH AND INNOVATION IN INFORMATION SYSTEMS (ICRIIS 2017): SOCIAL TRANSFORMATION THROUGH DATA SCIENCE, 2017,
  • [30] Scalable Community Search over Large-scale Graphs based on Graph Transformer
    Wang, Yuxiang
    Gou, Xiaoxuan
    Xu, Xiaoliang
    Geng, Yuxia
    Ke, Xiangyu
    Wu, Tianxing
    Yu, Zhiyuan
    Chen, Runhuai
    Wu, Xiangying
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1680 - 1690