Answering approximate range aggregate queries on OLAP data cubes with probabilistic guarantees

被引:0
|
作者
Cuzzocrea, A [1 ]
Wang, W
Matrangolo, U
机构
[1] Univ New S Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] Univ Calabria, DEIS Dept, I-87036 Cosenza, Italy
[3] ICAR Inst, CNR, I-87036 Cosenza, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Approximate range aggregate queries are one of the most frequent and useful kinds of queries for Decision Support Systems (DSS). Traditionally, sampling-based techniques have been proposed to tackle this problem. However, its effectiveness will degrade when the underlying data distribution is skewed. Another approach based on the outlier management can limit the effect of data skew but fails to address other requirements of approximate range aggregate queries, such as error guarantees and query processing efficiency. In this paper, we present a technique that provide approximate answers to range aggregate queries on OLAP data cubes efficiently with theoretical error guarantees, Our basic idea is to build different data structures for outliers and the rest of the data. Experimental results verified the effectiveness of our proposed methods.
引用
收藏
页码:97 / 107
页数:11
相关论文
共 50 条
  • [31] Data structures for range-aggregate extent queries
    Gupta, Prosenjit
    Janardan, Ravi
    Kumar, Yokesh
    Smid, Michiel
    COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2014, 47 (02): : 329 - 347
  • [32] Improvising Range Aggregate Queries in Big Data Environment
    Arbad, Ganesh R.
    Kulkarni, P. V.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1896 - 1901
  • [33] Performing Range Aggregate Queries in Stream Data Warehouse
    Gorawski, Marcin
    Malczok, Rafal
    MAN-MACHINE INTERACTIONS, 2009, 59 : 615 - 622
  • [34] Answering ad hoc aggregate queries from data streams using prefix aggregate trees
    Cho, Moonjung
    Pei, Jian
    Wang, Ke
    KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (03) : 301 - 329
  • [35] Answering Approximate String Queries on Large Data Sets Using External Memory
    Behm, Alexander
    Li, Chen
    Carey, Michael J.
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 888 - 899
  • [36] A probabilistic framework for estimating the accuracy of aggregate range queries evaluated over histograms
    Buccafurri, Francesco
    Furfaro, Filippo
    Sacca, Domenico
    INFORMATION SCIENCES, 2012, 188 : 121 - 150
  • [37] Efficiently answering probabilistic threshold top-k queries on uncertain data
    Hua, Ming
    Pei, Jian
    Zhang, Wenjie
    Lin, Xuemin
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1403 - +
  • [38] Efficient Range-Sum Queries along Dimensional Hierarchies in Data Cubes
    Lauer, Tobias
    Mai, Dominic
    Hagedorn, Philippe
    2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS, 2009, : 7 - +
  • [39] The Ra*-tree:: An improved R*-tree with materialized data for supporting range queries on OLAP-data
    Jurgens, M
    Lenz, HJ
    NINTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 1998, : 186 - 191
  • [40] Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams
    Chen, Donghui
    Chen, Ling
    INFORMATION SCIENCES, 2020, 520 (520) : 353 - 372