Self-tuning histograms: Building histograms without looking at data

被引:0
|
作者
Aboulnaga, A [1 ]
Chaudhuri, S [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce self-tuning histograms. Although similar in structure to traditional histograms, these histograms infer data distributions not by examining the data or a sample thereof, but by using feedback from the query execution engine about the actual selectivity of range selection operators to progressively refine the histogram. Since the cost of building and maintaining self-tuning histograms is independent of the data size, self-tuning histograms provide a remarkably inexpensive way to construct histograms for large data sets with little up-front costs. Self-tuning histograms are particularly attractive as an alternative to multi-dimensional traditional histograms that capture dependencies between attributes but are prohibitively expensive to build and maintain. In this paper, we describe the techniques for initializing and refining self-tuning histograms. Our experimental results show that self-tuning histograms provide a low-cost alternative to traditional multi-dimensional histograms with little loss of accuracy for data distributions with low to moderate skew.
引用
收藏
页码:181 / 192
页数:12
相关论文
共 50 条
  • [1] Improving Accuracy and Robustness of Self-Tuning Histograms by Subspace Clustering
    Khachatryan, Andranik
    Mueller, Emmanuel
    Boehm, Klemens
    Stier, Christian
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1544 - 1545
  • [2] Workload-Aware Self-tuning Histograms for the Semantic Web
    Zamani, Katerina
    Charalambidis, Angelos
    Konstantopoulos, Stasinos
    Zoulis, Nickolas
    Mavroudi, Effrosyni
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXVIII: SPECIAL ISSUE ON DATABASE- AND EXPERT-SYSTEMS APPLICATIONS, 2016, 9940 : 133 - 156
  • [3] Improving Accuracy and Robustness of Self-Tuning Histograms by Subspace Clustering
    Khachatryan, Andranik
    Mueller, Emmanuel
    Stier, Christian
    Boehm, Klemens
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) : 2377 - 2389
  • [4] Sensitivity of Self-tuning Histograms: Query Order Affecting Accuracy and Robustness
    Khachatryan, Andranik
    Mueller, Emmanuel
    Stier, Christian
    Beohm, Klemens
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 334 - 342
  • [5] Remember the past: Self-tuning histograms based on query-feedback logs
    Li, Xiaojing
    Zhou, Bo
    Dong, Jinxiang
    WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 1, 2005, : 98 - 101
  • [6] Building wavelet histograms on large data in MapReduce
    Jestes, J. (jestes@cs.utah.edu), 1600, International Journal of Computer Science Issues (IJCSI) (09):
  • [7] Building Wavelet Histograms on Large Data in MapReduce
    Jestes, Jeffrey
    Yi, Ke
    Li, Feifei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 5 (02): : 109 - 120
  • [8] From data to probability densities without histograms
    Berg, Bernd A.
    Harris, Robert C.
    COMPUTER PHYSICS COMMUNICATIONS, 2008, 179 (06) : 443 - 448
  • [9] Similarity of histograms and circular histograms from interval and fuzzy data
    Mezeil, Jozsef
    Luukka, Pasi
    Collan, Mikael
    2017 JOINT 17TH WORLD CONGRESS OF INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (IFSA-SCIS), 2017,
  • [10] Looking for histograms on the power supply of an artificial retina
    Mercier, DS
    Nguyen, PE
    Bernard, TM
    ADVANCED FOCAL PLANE ARRAYS AND ELECTRONIC CAMERAS, 1996, 2950 : 265 - 272