Mining and Indexing Graphs for Supergraph Search

被引:9
|
作者
Yuan, Dayu [1 ]
Mitra, Prasenjit [1 ,2 ]
Giles, C. Lee [1 ,2 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 6卷 / 10期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2536206.2536211
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study supergraph search (SPS), that is, given a query graph q and a graph database G that contains a collection of graphs, return graphs that have q as a supergraph from G. SPS has broad applications in bioinformatics, cheminformatics and other scientific and commercial fields. Determining whether a graph is a subgraph (or supergraph) of another is an NP-complete problem. Hence, it is intractable to compute SPS for large graph databases. Two separate indexing methods, a "filter + verify"-based method and a "prefix-sharing"-based method, have been studied to efficiently compute SPS. To implement the above two methods, subgraph patterns are mined from the graph database to build an index. Those subgraphs are mined to optimize either the filtering gain or the prefix-sharing gain. However, no single subgraph-mining algorithm considers both gains. This work is the first one to mine subgraphs to optimize both the filtering gain and the prefix-sharing gain while processing SPS queries. First, we show that the subgraph-mining problem is NP-hard. Then, we propose two polynomial-time algorithms to solve the problem with an approximation ratio of 1 - 1/e and 1/4 respectively. In addition, we construct a lattice-like index, LW-index, to organize the selected subgraph patterns for fast index-lookup. Our experiments show that our approach improves the query processing time for SPS queries by a factor of 3 to 10.
引用
收藏
页码:829 / 840
页数:12
相关论文
共 50 条
  • [31] Supergraph Topology Feature Index for Personalized Interesting Subgraph Query in Large Labeled Graphs
    Shan, Xiaohuan
    Li, Haihai
    Jia, Chunjie
    Li, Dong
    Song, Baoyan
    COMPLEXITY, 2021, 2021
  • [32] Algorithms and Complexity on Indexing Founder Graphs
    Massimo Equi
    Tuukka Norri
    Jarno Alanko
    Bastien Cazaux
    Alexandru I. Tomescu
    Veli Mäkinen
    Algorithmica, 2023, 85 : 1586 - 1623
  • [33] Indexing Continuous Paths in Temporal Graphs
    Kuijpers, Bart
    Ribas, Ignacio
    Soliani, Valeria
    Vaisman, Alejandro
    NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS, ADBIS 2022, 2022, 1652 : 232 - 242
  • [34] Indexing Bipartite Memberships in Web Graphs
    Xie, Yusheng
    Chen, Zhengzhang
    Palsetia, Diana
    Agrawal, Ankit
    Choudhary, Alok
    2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), 2014, : 166 - 173
  • [35] Indexing Public-Private Graphs
    Archer, Aaron
    Lattanzi, Silvio
    Likarish, Peter
    Vassilvitskii, Sergei
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1461 - 1470
  • [36] Efficient Subgraph Indexing for Biochemical Graphs
    Wangmo, Chimi
    Wiese, Lena
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 533 - 540
  • [37] Indexing De Bruijn graphs with minimizers
    Marchet, Camille
    Kerbiriou, Mael
    Limasset, Antoine
    BMC BIOINFORMATICS, 2019, 20
  • [38] INDEXING THEORY, INDEXING METHODS AND SEARCH DEVICES - JONKER,F
    PIZER, IH
    BULLETIN OF THE MEDICAL LIBRARY ASSOCIATION, 1964, 52 (04): : 827 - 828
  • [39] INDEXING THEORY, INDEXING METHODS, AND SEARCH DEVICES - JONKER,F
    BECKER, J
    COLLEGE & RESEARCH LIBRARIES, 1964, 25 (06): : 519 - 519
  • [40] INDEXING THEORY, INDEXING METHODS AND SEARCH DEVICES - JONKER,F
    CULBERTSON, DS
    LIBRARY JOURNAL, 1964, 89 (14) : 2988 - 2989