Efficient aggregation algorithms on very large compressed data warehouses

被引:2
|
作者
Li, JZ [1 ]
Li, YS
Srivastava, J
机构
[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
[2] Beijing Inst Technol, Beijing 100876, Peoples R China
[3] Univ Minnesota, Minneapolis, MN 55455 USA
基金
中国国家自然科学基金;
关键词
OLAP; aggregation; data warehouse;
D O I
10.1007/BF02948809
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multidimensional aggregation is a dominant operation on data warehouses for on-line analytical processing (OLAP). Many efficient algorithms to compute multidimensional aggregation on relational database based data warehouses have been developed. However, to our knowledge, there is nothing to date in the literature about aggregation algorithms on multidimensional data warehouses that store datasets in multidimensional arrays rather than in tables. This paper presents a set of multidimensional aggregation algorithms on very large and compressed multidimensional data warehouses. These algorithms operate directly on compressed datasets in multidimensional data warehouses without the need to first decompress them. They are applicable to a variety of data compression methods. The algorithms have different performance behavior as a function of dataset parameters, sizes of outputs and main memory availability. The algorithms are described and analyzed with respect to the I/O and CPU costs. A decision procedure to select the most efficient algorithm, given an aggregation request, is also proposed. The analytical and experimental results show that the algorithms are more efficient than the traditional aggregation algorithms.
引用
收藏
页码:213 / 229
页数:17
相关论文
共 50 条
  • [1] Efficient aggregation algorithms on very large compressed data warehouses
    Jianzhong Li
    Yingshu Li
    Jaideep Srivastava
    Journal of Computer Science and Technology, 2000, 15 : 213 - 229
  • [2] Aggregation algorithms for very large compressed data warehouses
    Li, JZ
    Rotem, D
    Srivastava, J
    PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 651 - 662
  • [3] Efficient aggregation algorithms for compressed data warehouses
    Li, JZ
    Srivastava, J
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (03) : 515 - 529
  • [4] Cube algorithms for very large compressed data warehouses
    Gao, H.
    Li, J.
    Ruan Jian Xue Bao/Journal of Software, 2001, 12 (06): : 830 - 839
  • [5] Efficient Aggregation Algorithms on Very LargeCompressed Data Warehouses
    李建中
    李英姝
    Jaideep Srivastava
    Journal of Computer Science and Technology, 2000, (03) : 213 - 229
  • [6] Efficient algorithms for on-line analysis processing on compressed data warehouses
    Li, Jianzhong
    Gao, Hong
    2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 11 - 12
  • [7] Data classification and management in very large data warehouses
    Chelluri, K
    Kumar, V
    THIRD INTERNATIONAL WORKSHOP ON ADVANCED ISSUES OF E-COMMERCE AND WEB-BASED INFORMATION SYSTEMS, PROCEEDINGS, 2001, : 52 - 57
  • [8] Querying Compressed Data in Data Warehouses
    Anindya Datta
    Helen Thomas
    Information Technology and Management, 2002, 3 (4) : 353 - 386
  • [9] Efficient Aggregation Algorithms for Probabilistic Data
    Jayram, T. S.
    Kale, Satyen
    Vee, Erik
    PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2007, : 346 - 355
  • [10] DWS-AQA: A cost effective approach for very large data warehouses
    Bernardino, J
    Furtado, P
    Madeira, H
    IDEAS 2002: INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2002, : 233 - 242