An Indexing Framework for Queries on Probabilistic Graphs

被引:14
|
作者
Maniu, Silviu [1 ]
Cheng, Reynold [2 ]
Senellart, Pierre [3 ,4 ]
机构
[1] Univ Paris Saclay, LRI, PCRI, Univ Paris Sud, Bat 650, F-91405 Orsay, France
[2] Univ Hong Kong, Dept Comp Sci, Chow Yei Ching Bldg,Pokfulam Rd, Hong Kong, Hong Kong, Peoples R China
[3] PSL Res Univ, Ecole Normale Super, DI ENS, 45 Rue Ulm, F-75230 Paris, France
[4] Inria Paris, Paris, France
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2017年 / 42卷 / 02期
关键词
Reachability; shortest path; SPQR; treewidth; triconnected component; tree decomposition; uncertain graph;
D O I
10.1145/3044713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information in many applications, such as mobile wireless systems, social networks, and road networks, is captured by graphs. In many cases, such information is uncertain. We study the problem of querying a probabilistic graph, in which vertices are connected to each other probabilistically. In particular, we examine "source-to-target" queries (ST-queries), such as computing the shortest path between two vertices. The major difference with the deterministic setting is that query answers are enriched with probabilistic annotations. Evaluating ST-queries over probabilistic graphs is #P-hard, as it requires examining an exponential number of "possible worlds"-database instances generated from the probabilistic graph. Existing solutions to the ST-query problem, which sample possible worlds, have two downsides: (i) a possible world can be very large and (ii) many samples are needed for reasonable accuracy. To tackle these issues, we study the ProbTree, a data structure that stores a succinct, or indexed, version of the possible worlds of the graph. Existing ST-query solutions are executed on top of this structure, with the number of samples and sizes of the possible worlds reduced. We examine lossless and lossy methods for generating the ProbTree, which reflect the tradeoff between the accuracy and efficiency of query evaluation. We analyze the correctness and complexity of these approaches. Our extensive experiments on real datasets show that the ProbTree is fast to generate and small in size. It also enhances the accuracy and efficiency of existing ST-query algorithms significantly.
引用
收藏
页数:34
相关论文
共 50 条
  • [21] Probabilistic relational indexing
    Chou, YY
    Shapiro, LG
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1331 - 1335
  • [22] Indexing a dictionary for subset matching queries
    Landau, Cad M.
    Tsur, Dekel
    Weimann, Oren
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2007, 4726 : 195 - +
  • [23] Multiresolution indexing of XML for frequent queries
    He, H
    Yang, J
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 683 - 694
  • [24] Less Space: Indexing for Queries with Wildcards
    Lewenstein, Moshe
    Munro, J. Ian
    Raman, Venkatesh
    Thankachan, Sharma V.
    ALGORITHMS AND COMPUTATION, 2013, 8283 : 89 - 99
  • [25] Indexing views to route queries in a PDMS
    Lefteris Sidirourgos
    George Kokkinidis
    Theodore Dalamagas
    Vassilis Christophides
    Timos Sellis
    Distributed and Parallel Databases, 2008, 23 : 45 - 68
  • [26] Less space: Indexing for queries with wildcards
    Lewenstein, Moshe
    Munro, J. Ian
    Raman, Venkatesh
    Thankachan, Sharma V.
    THEORETICAL COMPUTER SCIENCE, 2014, 557 : 120 - 127
  • [27] Indexing for Summary Queries: Theory and Practice
    Yi, Ke
    Wang, Lu
    Wei, Zhewei
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2014, 39 (01):
  • [28] Indexing a Dictionary for Subset Matching Queries
    Landau, Gad M.
    Tsur, Dekel
    Weimann, Oren
    ALGORITHMS AND APPLICATIONS: ESSAYS DEDICATED TO ESKO UKKONEN ON THE OCCASION OF HIS 60TH BIRTHDAY, 2010, 6060 : 158 - +
  • [29] PROBABILISTIC APPROACH TO AUTOMATIC KEYWORD INDEXING .2. ALGORITHM FOR PROBABILISTIC INDEXING
    HARTER, SP
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1975, 26 (05): : 280 - 289
  • [30] Indexing views to route queries in a PDMS
    Sidirourgos, Lefteris
    Kokkinidis, George
    Dalamagas, Theodore
    Christophides, Vassilis
    Sellis, Timos
    DISTRIBUTED AND PARALLEL DATABASES, 2008, 23 (01) : 45 - 68