An Indexing Framework for Queries on Probabilistic Graphs

被引:14
|
作者
Maniu, Silviu [1 ]
Cheng, Reynold [2 ]
Senellart, Pierre [3 ,4 ]
机构
[1] Univ Paris Saclay, LRI, PCRI, Univ Paris Sud, Bat 650, F-91405 Orsay, France
[2] Univ Hong Kong, Dept Comp Sci, Chow Yei Ching Bldg,Pokfulam Rd, Hong Kong, Hong Kong, Peoples R China
[3] PSL Res Univ, Ecole Normale Super, DI ENS, 45 Rue Ulm, F-75230 Paris, France
[4] Inria Paris, Paris, France
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2017年 / 42卷 / 02期
关键词
Reachability; shortest path; SPQR; treewidth; triconnected component; tree decomposition; uncertain graph;
D O I
10.1145/3044713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information in many applications, such as mobile wireless systems, social networks, and road networks, is captured by graphs. In many cases, such information is uncertain. We study the problem of querying a probabilistic graph, in which vertices are connected to each other probabilistically. In particular, we examine "source-to-target" queries (ST-queries), such as computing the shortest path between two vertices. The major difference with the deterministic setting is that query answers are enriched with probabilistic annotations. Evaluating ST-queries over probabilistic graphs is #P-hard, as it requires examining an exponential number of "possible worlds"-database instances generated from the probabilistic graph. Existing solutions to the ST-query problem, which sample possible worlds, have two downsides: (i) a possible world can be very large and (ii) many samples are needed for reasonable accuracy. To tackle these issues, we study the ProbTree, a data structure that stores a succinct, or indexed, version of the possible worlds of the graph. Existing ST-query solutions are executed on top of this structure, with the number of samples and sizes of the possible worlds reduced. We examine lossless and lossy methods for generating the ProbTree, which reflect the tradeoff between the accuracy and efficiency of query evaluation. We analyze the correctness and complexity of these approaches. Our extensive experiments on real datasets show that the ProbTree is fast to generate and small in size. It also enhances the accuracy and efficiency of existing ST-query algorithms significantly.
引用
收藏
页数:34
相关论文
共 50 条
  • [41] Probabilistic semantic video indexing
    Naphade, MR
    Kozintsev, I
    Huang, T
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 967 - 973
  • [42] Hierarchical indexing approach to support XPath queries
    Tang, Nan
    Yu, Jeffrey Xu
    Oezsu, M. Tamer
    Wong, Kam-Fai
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1510 - +
  • [43] Indexing Uncertain Data for Supporting Range Queries
    Zhu, Rui
    Wang, Bin
    Wang, Guoren
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 72 - 83
  • [44] Indexing moving objects for directions and velocities queries
    Sultan Alamri
    David Taniar
    Maytham Safar
    Information Systems Frontiers, 2013, 15 : 235 - 248
  • [45] Block splitting indexing for supporting containment queries
    Jin, M
    Kim, JM
    WEB AND COMMUNICATION TECHNOLOGIES AND INTERNET-RELATED SOCIAL ISSUES - HSI 2003, 2003, 2713 : 688 - 693
  • [46] An Indexing Method for Answering Queries on Moving Objects
    Khaled Elbassioni
    Amr Elmasry
    Ibrahim Kamel
    Distributed and Parallel Databases, 2005, 17 : 215 - 249
  • [47] An indexing method for answering queries on moving objects
    Elbassioni, K
    Elmasry, A
    Kamel, I
    DISTRIBUTED AND PARALLEL DATABASES, 2005, 17 (03) : 215 - 249
  • [48] Indexing XML data for path expression queries
    Hu, G
    Tang, C
    SOFTWARE ENGINEERING RESEARCH AND APPLICATIONS, 2004, 3026 : 332 - 348
  • [49] Models for retrieval with probabilistic indexing
    Fuhr, N., 1600, Elsevier Ltd (25):
  • [50] PROBABILISTIC INDEXING FOR OBJECT RECOGNITION
    OLSON, CF
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (05) : 518 - 522