Answering approximate range aggregate queries on OLAP data cubes with probabilistic guarantees

被引:0
|
作者
Cuzzocrea, A [1 ]
Wang, W
Matrangolo, U
机构
[1] Univ New S Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] Univ Calabria, DEIS Dept, I-87036 Cosenza, Italy
[3] ICAR Inst, CNR, I-87036 Cosenza, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Approximate range aggregate queries are one of the most frequent and useful kinds of queries for Decision Support Systems (DSS). Traditionally, sampling-based techniques have been proposed to tackle this problem. However, its effectiveness will degrade when the underlying data distribution is skewed. Another approach based on the outlier management can limit the effect of data skew but fails to address other requirements of approximate range aggregate queries, such as error guarantees and query processing efficiency. In this paper, we present a technique that provide approximate answers to range aggregate queries on OLAP data cubes efficiently with theoretical error guarantees, Our basic idea is to build different data structures for outliers and the rest of the data. Experimental results verified the effectiveness of our proposed methods.
引用
收藏
页码:97 / 107
页数:11
相关论文
共 50 条
  • [21] A Neural Database for Answering Aggregate Queries on Incomplete Relational Data
    Zeighami, Sepanta
    Seshadri, Raghav
    Shahabi, Cyrus
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2790 - 2802
  • [22] Efficiently Computing and Querying Multidimensional OLAP Data Cubes over Probabilistic Relational Data
    Cuzzocrea, Alfredo
    Gunopulos, Dimitrios
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, 2010, 6295 : 132 - +
  • [23] A data cube for range queries and approximate queries in dynamic environments
    Shi, Zhi-Bin
    Wang, Bao-Min
    ISTM/2007: 7TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-7, CONFERENCE PROCEEDINGS, 2007, : 1587 - 1590
  • [24] Answering skyline queries on probabilistic data using the dominance of probabilistic skyline tuples
    Trieu Minh Nhut Le
    Cao, Jinli
    He, Zhen
    INFORMATION SCIENCES, 2016, 340 : 58 - 85
  • [25] OLAP over Probabilistic Data Cubes II: Parallel Materialization and Extended Aggregates
    Xie, Xike
    Zou, Kai
    Hao, Xingjun
    Pedersen, Torben Bach
    Jin, Peiquan
    Yang, Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (10) : 1966 - 1981
  • [26] Uncertain probabilistic range queries on multidimensional data
    Bernad, Jorge
    Bobed, Carlos
    Mena, Eduardo
    INFORMATION SCIENCES, 2020, 537 (334-367) : 334 - 367
  • [27] A Decomposition Framework for Computing and Querying Multidimensional OLAP Data Cubes over Probabilistic Relational Data
    Cuzzocrea, Alfredo
    Gunopulos, Dimitrios
    FUNDAMENTA INFORMATICAE, 2014, 132 (02) : 239 - 266
  • [28] A distributed system for answering range queries on sensor network data
    Cuzzocrea, A
    Furfaro, F
    Greco, S
    Masciari, E
    Mazzeo, GM
    Saccà, D
    THIRD IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, WORKSHOPS, 2005, : 369 - 373
  • [29] Answering Provenance-Aware Queries on RDF Data Cubes Under Memory Budgets
    Galarraga, Luis
    Ahlstrom, Kim
    Hose, Katja
    Pedersen, Torben Bach
    SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 547 - 565
  • [30] Answering ad hoc aggregate queries from data streams using prefix aggregate trees
    Moonjung Cho
    Jian Pei
    Ke Wang
    Knowledge and Information Systems, 2007, 12 : 301 - 329