Wander Join: Online Aggregation via Random Walks

被引:94
|
作者
Li, Feifei [1 ]
Wu, Bin [2 ]
Yi, Ke [2 ]
Zhao, Zhuoyue [3 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
[2] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
10.1145/2882903.2915235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joins are expensive, and online aggregation over joins was proposed to mitigate the cost, which offers users a nice and flexible tradeoff between query efficiency and accuracy in a continuous, online fashion. However, the state-of-the-art approach, in both internal and external memory, is based on ripple join, which is still very expensive and even needs unrealistic assumptions (e.g., tuples in a table are stored in random order). This paper proposes a new approach, the wander join algorithm, to the online aggregation problem by performing random walks over the underlying join graph. We also design an optimizer that chooses the optimal plan for conducting the random walks without having to collect any statistics a priori. Compared with ripple join, wander join is particularly efficient for equality joins involving multiple tables, but also supports theta-joins. Selection predicates and group-by clauses can be handled as well. Extensive experiments using the TPC-H benchmark have demonstrated the superior performance of wander join over ripple join. In particular, we have integrated and tested wander join in the latest version of PostgreSQL, demonstrating its practicality in a full-fledged database system.
引用
收藏
页码:615 / 629
页数:15
相关论文
共 50 条
  • [1] Wander Join and XDB: Online Aggregation via Random Walks
    Li, Feifei
    Wu, Bin
    Yi, Ke
    Zhao, Zhuoyue
    SIGMOD RECORD, 2017, 46 (01) : 33 - 40
  • [2] Wander Join and XDB: Online Aggregation via Random Walks
    Li, Feifei
    Wu, Bin
    Yi, Ke
    Zhao, Zhuoyue
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2019, 44 (01):
  • [3] Wander Join: Online Aggregation for Joins
    Li, Feifei
    Wu, Bin
    Yi, Ke
    Zhao, Zhuoyue
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2121 - 2124
  • [4] Aggregation is the key to succeed in random walks
    Hernandez-Suarez, Carlos M.
    MATHEMATICAL BIOSCIENCES, 2016, 279 : 33 - 37
  • [5] Amenability via random walks
    Bartholdi, L
    Virág, B
    DUKE MATHEMATICAL JOURNAL, 2005, 130 (01) : 39 - 56
  • [6] Algorithms via Quantum Random Walks
    Mc Gettrick, Michael
    ERCIM NEWS, 2011, (85): : 14 - 15
  • [7] Distinguishing certain random sceneries on Z via random walks
    Howard, CD
    STATISTICS & PROBABILITY LETTERS, 1997, 34 (02) : 123 - 132
  • [8] Aggregation through adaptive random walks in a minimalist robot swarm
    Feola, Luigi
    Sion, Antoine
    Reina, Andreagiovanni
    Trianni, Vito
    Tuci, Elio
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 21 - 29
  • [9] Characterisation of equalisation problems via random walks
    Nakata, Toshio
    MATHEMATICAL GAZETTE, 2022, 106 (565): : 61 - 67
  • [10] Lattice Sieving via Quantum Random Walks
    Chailloux, Andre
    Loyer, Johanna
    ADVANCES IN CRYPTOLOGY - ASIACRYPT 2021, PT IV, 2021, 13093 : 63 - 91