An Indexing Framework for Queries on Probabilistic Graphs

被引:14
|
作者
Maniu, Silviu [1 ]
Cheng, Reynold [2 ]
Senellart, Pierre [3 ,4 ]
机构
[1] Univ Paris Saclay, LRI, PCRI, Univ Paris Sud, Bat 650, F-91405 Orsay, France
[2] Univ Hong Kong, Dept Comp Sci, Chow Yei Ching Bldg,Pokfulam Rd, Hong Kong, Hong Kong, Peoples R China
[3] PSL Res Univ, Ecole Normale Super, DI ENS, 45 Rue Ulm, F-75230 Paris, France
[4] Inria Paris, Paris, France
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2017年 / 42卷 / 02期
关键词
Reachability; shortest path; SPQR; treewidth; triconnected component; tree decomposition; uncertain graph;
D O I
10.1145/3044713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information in many applications, such as mobile wireless systems, social networks, and road networks, is captured by graphs. In many cases, such information is uncertain. We study the problem of querying a probabilistic graph, in which vertices are connected to each other probabilistically. In particular, we examine "source-to-target" queries (ST-queries), such as computing the shortest path between two vertices. The major difference with the deterministic setting is that query answers are enriched with probabilistic annotations. Evaluating ST-queries over probabilistic graphs is #P-hard, as it requires examining an exponential number of "possible worlds"-database instances generated from the probabilistic graph. Existing solutions to the ST-query problem, which sample possible worlds, have two downsides: (i) a possible world can be very large and (ii) many samples are needed for reasonable accuracy. To tackle these issues, we study the ProbTree, a data structure that stores a succinct, or indexed, version of the possible worlds of the graph. Existing ST-query solutions are executed on top of this structure, with the number of samples and sizes of the possible worlds reduced. We examine lossless and lossy methods for generating the ProbTree, which reflect the tradeoff between the accuracy and efficiency of query evaluation. We analyze the correctness and complexity of these approaches. Our extensive experiments on real datasets show that the ProbTree is fast to generate and small in size. It also enhances the accuracy and efficiency of existing ST-query algorithms significantly.
引用
收藏
页数:34
相关论文
共 50 条
  • [1] Indexing Graphs for Path Queries with Applications in Genome Research
    Siren, Jouni
    Valimaki, Niko
    Makinen, Veli
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (02) : 375 - 388
  • [2] Conjunctive Queries on Probabilistic Graphs: The Limits of Approximability
    Amarilli, Antoine
    van Bremen, Timothy
    Meel, Kuldeep S.
    27TH INTERNATIONAL CONFERENCE ON DATABASE THEORY, ICDT 2024, 2024, 290
  • [3] Conjunctive Queries on Probabilistic Graphs: Combined Complexity
    Amarilli, Antoine
    Monet, Mikael
    Senellart, Pierre
    PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 217 - 232
  • [4] Efficient Probabilistic Truss Indexing on Uncertain Graphs
    Sun, Zitan
    Huang, Xin
    Xu, Jianliang
    Bonchi, Francesco
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 354 - 366
  • [5] Uncertain Data Queries Processing in a Probabilistic Framework
    He, Ming
    Du, Yong-ping
    JOURNAL OF COMPUTERS, 2010, 5 (11) : 1663 - 1669
  • [6] Semantic Video Indexing using a probabilistic framework
    Naphade, MR
    Huang, TS
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 79 - 84
  • [7] A probabilistic framework for semantic indexing and retrieval in video
    Naphade, MR
    Huang, TS
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 475 - 478
  • [8] THE DICHOTOMY OF EVALUATING HOMOMORPHISM-CLOSED QUERIES ON PROBABILISTIC GRAPHS
    Amarilli, Antoine
    Ceylan, Ismail Ilkan
    LOGICAL METHODS IN COMPUTER SCIENCE, 2022, 18 (01) : 1 - 2
  • [9] A Framework for Probabilistic Reasoning on Knowledge Graphs
    Bellomarini, Luigi
    Benedetto, Davide
    Laurenza, Eleonora
    Sallinger, Emanuel
    BUILDING BRIDGES BETWEEN SOFT AND STATISTICAL METHODOLOGIES FOR DATA SCIENCE, 2023, 1433 : 48 - 56
  • [10] A probabilistic framework for semantic video indexing, filtering, and retrieval
    Naphade, MR
    Huang, TS
    IEEE TRANSACTIONS ON MULTIMEDIA, 2001, 3 (01) : 141 - 151