An Approach Based on Bayesian Networks for Query Selectivity Estimation

被引:5
|
作者
Halford, Max [1 ,2 ]
Saint-Pierre, Philippe [2 ]
Morvan, Franck [1 ]
机构
[1] Paul Sabatier Univ, IRIT Lab, Toulouse, France
[2] Paul Sabatier Univ, IMT Lab, Toulouse, France
关键词
Query optimisation; Cardinality estimation; Bayesian networks; COMPLEXITY; SIZE;
D O I
10.1007/978-3-030-18579-4_1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The efficiency of a query execution plan depends on the accuracy of the selectivity estimates given to the query optimiser by the cost model. The cost model makes simplifying assumptions in order to produce said estimates in a timely manner. These assumptions lead to selectivity estimation errors that have dramatic effects on the quality of the resulting query execution plans. A convenient assumption that is ubiquitous among current cost models is to assume that attributes are independent with each other. However, it ignores potential correlations which can have a huge negative impact on the accuracy of the cost model. In this paper we attempt to relax the attribute value independence assumption without unreasonably deteriorating the accuracy of the cost model. We propose a novel approach based on a particular type of Bayesian networks called Chow-Liu trees to approximate the distribution of attribute values inside each relation of a database. Our results on the TPC-DS benchmark show that our method is an order of magnitude more precise than other approaches whilst remaining reasonably efficient in terms of time and space.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 50 条
  • [31] Clock Synchronization in Wireless Sensor Networks Based on Bayesian Estimation
    Yang, Ting
    Niu, Yuqing
    Yu, Jiexiao
    IEEE ACCESS, 2020, 8 : 69683 - 69694
  • [33] A recommendation approach based on bayesian networks for clone refactor
    Zhai, Ye
    Liu, Dongsheng
    Wu, Celimuge
    She, Rongrong
    Computers, Materials and Continua, 2020, 64 (03): : 1999 - 2012
  • [34] A Recommendation Approach Based on Bayesian Networks for Clone Refactor
    Zhai, Ye
    Liu, Dongsheng
    Wu, Celimuge
    She, Rongrong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (03): : 1999 - 2012
  • [35] An approach to vital signal fusion based on Bayesian Networks
    Martins, V. R.
    Boudy, J.
    Andreao, R. V.
    Bastos-Filho, T. F.
    IV LATIN AMERICAN CONGRESS ON BIOMEDICAL ENGINEERING 2007, BIOENGINEERING SOLUTIONS FOR LATIN AMERICA HEALTH, VOLS 1 AND 2, 2008, 18 (1,2): : 178 - 182
  • [37] Query Selectivity Estimation Based on Improved V-optimal Histogram by Introducing Information about Distribution of Boundaries of Range Query Conditions
    Augustyn, Dariusz Rafal
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2014, 2014, 8838 : 151 - 164
  • [38] Query execution time estimation in graph databases based on graph neural networks
    He, Zhenzhen
    Yu, Jiong
    Gu, Tiquan
    Yang, Dexian
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)
  • [39] Query Based Iterative Learning Approach for Lightpath Deployment in Optical Networks
    Usmani, Fehmida
    Khan, Ihtesham
    Tariq, Hafsa
    Shahzad, Muhammad
    Ahmad, Arsalan
    Curri, Vittorio
    2022 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE, ACP, 2022, : 1253 - 1257
  • [40] A Search Approach Based on Query Similarity in Content-Centric Networks
    Tu, Jiun-Yu
    Hu, Chih-Lin
    Hu, Han
    2019 20TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2019,