Mining and Indexing Graphs for Supergraph Search

被引:9
|
作者
Yuan, Dayu [1 ]
Mitra, Prasenjit [1 ,2 ]
Giles, C. Lee [1 ,2 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 6卷 / 10期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2536206.2536211
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study supergraph search (SPS), that is, given a query graph q and a graph database G that contains a collection of graphs, return graphs that have q as a supergraph from G. SPS has broad applications in bioinformatics, cheminformatics and other scientific and commercial fields. Determining whether a graph is a subgraph (or supergraph) of another is an NP-complete problem. Hence, it is intractable to compute SPS for large graph databases. Two separate indexing methods, a "filter + verify"-based method and a "prefix-sharing"-based method, have been studied to efficiently compute SPS. To implement the above two methods, subgraph patterns are mined from the graph database to build an index. Those subgraphs are mined to optimize either the filtering gain or the prefix-sharing gain. However, no single subgraph-mining algorithm considers both gains. This work is the first one to mine subgraphs to optimize both the filtering gain and the prefix-sharing gain while processing SPS queries. First, we show that the subgraph-mining problem is NP-hard. Then, we propose two polynomial-time algorithms to solve the problem with an approximation ratio of 1 - 1/e and 1/4 respectively. In addition, we construct a lattice-like index, LW-index, to organize the selected subgraph patterns for fast index-lookup. Our experiments show that our approach improves the query processing time for SPS queries by a factor of 3 to 10.
引用
收藏
页码:829 / 840
页数:12
相关论文
共 50 条
  • [21] INDEXING THEORY INDEXING METHODS AND SEARCH DEVICES
    WARD, E
    AMERICAN DOCUMENTATION, 1966, 17 (02): : 109 - &
  • [22] Supergraph Search in Graph Databases via Hierarchical Feature-Tree
    Lyu, Bingqing
    Qin, Lu
    Lin, Xuemin
    Chang, Lijun
    Yu, Jeffrey Xu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 385 - 400
  • [23] OPTIMAL INDEXING OF VERTICES OF GRAPHS
    FITZGERALD, CH
    MATHEMATICS OF COMPUTATION, 1974, 28 (127) : 825 - 831
  • [24] Indexing and mining audiovisual data
    Morizet-Mahoudeaux, P
    Bachimont, B
    ACTIVE MINING, 2005, 3430 : 34 - 58
  • [25] Indexing and mining free trees
    Chi, Y
    Yang, YR
    Muntz, RR
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 509 - 512
  • [26] Indexing and mining large-scale neuron databases using maximum inner product search
    Li, Zhongyu
    Fang, Ruogu
    Shen, Fumin
    Katouzian, Amin
    Zhang, Shaoting
    PATTERN RECOGNITION, 2017, 63 : 680 - 688
  • [27] Indexing and search of multimodal information
    Hauptmann, AG
    Wactlar, HD
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 195 - 198
  • [28] Efficient indexing for semantic search
    Lashkari, Fatemeh
    Ensan, Faezeh
    Bagheri, Ebrahim
    Ghorbani, Ali A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 : 92 - 114
  • [29] Algorithms and Complexity on Indexing Founder Graphs
    Equi, Massimo
    Norri, Tuukka
    Alanko, Jarno
    Cazaux, Bastien
    Tomescu, Alexandru, I
    Makinen, Veli
    ALGORITHMICA, 2023, 85 (06) : 1586 - 1623
  • [30] An Indexing Framework for Queries on Probabilistic Graphs
    Maniu, Silviu
    Cheng, Reynold
    Senellart, Pierre
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (02):