Accurate Summary-based Cardinality Estimation Through the Lens of Cardinality Estimation Graphs

被引:4
|
作者
Chen, Jeremy [1 ]
Huang, Yuqing [1 ]
Wang, Mushi [1 ]
Salihoglu, Semih [1 ]
Salem, Kenneth [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
关键词
SELECTIVITY; QUERIES; BOUNDS;
D O I
10.14778/3529337.3529339
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study two classes of summary-based cardinality estimators that use statistics about input relations and small-size joins: (i) optimistic estimators, which were defined in the context of graph database management systems, that make uniformity and conditional independence assumptions; and (ii) the recent pessimistic estimators that use information theoretic linear programs (LPs). We show that optimistic estimators can be modeled as picking bottom-to-top paths in a cardinality estimation graph (CEG), which contains subqueries as nodes and edges whose weights are average degree statistics. We show that existing optimistic estimators have either undefined or fixed choices for picking CEG paths as their estimates and ignore alternative choices. Instead, we outline a space of optimistic estimators to make an estimate on CEGs, which subsumes existing estimators. We show, using an extensive empirical analysis, that effective paths depend on the structure of the queries. We next show that optimistic estimators and seemingly disparate LP-based pessimistic estimators are in fact connected. Specifically, we show that CEGs can also model some recent pessimistic estimators. This connection allows us to provide insights into the pessimistic estimators, such as showing that they have combinatorial solutions.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [21] Deep Unsupervised Cardinality Estimation
    Yang, Zongheng
    Liang, Eric
    Kamsetty, Amog
    Wu, Chenggang
    Duan, Yan
    Chen, Xi
    Abbeel, Pieter
    Hellerstein, Joseph M.
    Krishnan, Sanjay
    Stoica, Ion
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 13 (03): : 279 - 292
  • [22] Characteristic Sets: Accurate Cardinality Estimation for RDF Queries with Multiple Joins
    Neumann, Thomas
    Moerkotte, Guido
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 984 - 994
  • [23] A Cardinality Estimation Approach Based on Two Level Histograms
    Lin, Xudong
    Zeng, Xiaoning
    Pu, Xiaowei
    Sun, Yanyan
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (05) : 1733 - 1756
  • [24] Lightweight Cardinality Estimation in LSM-based Systems
    Absalyamov, Ildar
    Carey, Michael J.
    Tsotras, Vassilis J.
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 841 - 855
  • [25] RFID Cardinality Estimation Approach Based on Multilayer Perceptron
    Xie X.
    Liu X.-L.
    Wang J.-X.
    Guo S.
    Li K.-Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (03): : 499 - 511
  • [26] A cardinality estimation approach based on two level histograms
    Department of Information Engineering, Environmental Management College of China, Qinhuangdao, Hebei
    066004, China
    不详
    066004, China
    J. Inf. Sci. Eng., 5 (1733-1756):
  • [27] Survey of cardinality estimation techniques based on machine learning
    Yue W.
    Qu W.
    Lin K.
    Wang X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (02): : 413 - 427
  • [28] Automating localized learning for cardinality estimation based on XGBoost
    Feng, Jieming
    Li, Zhanhuai
    Chen, Qun
    Liu, Hailong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (07) : 3825 - 3854
  • [29] Cardinality estimation using normalizing flow
    Wang, Jiayi
    Chai, Chengliang
    Liu, Jiabin
    Li, Guoliang
    VLDB JOURNAL, 2024, 33 (02): : 323 - 348
  • [30] Accelerating the HyperLogLog Cardinality Estimation Algorithm
    Bozkus, Cem
    Fraguela, Basilio B.
    SCIENTIFIC PROGRAMMING, 2017, 2017