Efficient aggregation algorithms on very large compressed data warehouses

被引：2

作者：

Li, JZ ^{[1
]}

Li, YS

Srivastava, J

机构：

[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China

[2] Beijing Inst Technol, Beijing 100876, Peoples R China

[3] Univ Minnesota, Minneapolis, MN 55455 USA

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2000年 / 15卷 / 03期

基金：

中国国家自然科学基金;

关键词：

OLAP; aggregation; data warehouse;

D O I：

10.1007/BF02948809

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multidimensional aggregation is a dominant operation on data warehouses for on-line analytical processing (OLAP). Many efficient algorithms to compute multidimensional aggregation on relational database based data warehouses have been developed. However, to our knowledge, there is nothing to date in the literature about aggregation algorithms on multidimensional data warehouses that store datasets in multidimensional arrays rather than in tables. This paper presents a set of multidimensional aggregation algorithms on very large and compressed multidimensional data warehouses. These algorithms operate directly on compressed datasets in multidimensional data warehouses without the need to first decompress them. They are applicable to a variety of data compression methods. The algorithms have different performance behavior as a function of dataset parameters, sizes of outputs and main memory availability. The algorithms are described and analyzed with respect to the I/O and CPU costs. A decision procedure to select the most efficient algorithm, given an aggregation request, is also proposed. The analytical and experimental results show that the algorithms are more efficient than the traditional aggregation algorithms.

引用

页码：213 / 229

页数：17

共 50 条

[1] Efficient aggregation algorithms on very large compressed data warehouses
Jianzhong Li
Yingshu Li
Jaideep Srivastava
Journal of Computer Science and Technology, 2000, 15 : 213 - 229
[2] Aggregation algorithms for very large compressed data warehouses
Li, JZ
Rotem, D
Srivastava, J
PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 651 - 662
[3] Efficient aggregation algorithms for compressed data warehouses
Li, JZ
Srivastava, J
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (03) : 515 - 529
[4] Cube algorithms for very large compressed data warehouses
Gao, H.
Li, J.
Ruan Jian Xue Bao/Journal of Software, 2001, 12 (06): : 830 - 839
[5] Efficient Aggregation Algorithms on Very LargeCompressed Data Warehouses
李建中
李英姝
Jaideep Srivastava
Journal of Computer Science and Technology, 2000, (03) : 213 - 229
[6] Efficient algorithms for on-line analysis processing on compressed data warehouses
Li, Jianzhong
Gao, Hong
2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 11 - 12
[7] Data classification and management in very large data warehouses
Chelluri, K
Kumar, V
THIRD INTERNATIONAL WORKSHOP ON ADVANCED ISSUES OF E-COMMERCE AND WEB-BASED INFORMATION SYSTEMS, PROCEEDINGS, 2001, : 52 - 57
[8] Querying Compressed Data in Data Warehouses
Anindya Datta
Helen Thomas
Information Technology and Management, 2002, 3 (4) : 353 - 386
[9] Efficient Aggregation Algorithms for Probabilistic Data
Jayram, T. S.
Kale, Satyen
Vee, Erik
PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2007, : 346 - 355
[10] DWS-AQA: A cost effective approach for very large data warehouses
Bernardino, J
Furtado, P
Madeira, H
IDEAS 2002: INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2002, : 233 - 242

← 1 2 3 4 5 →