Cube algorithms for very large compressed data warehouses

被引：0

作者：

Gao, H. ^{[1
]}

Li, J. ^{[1
]}

机构：

[1] Dept. of Computer Science and Eng., Harbin Institute of Technology, Harbin 150001, China

来源：

Ruan Jian Xue Bao/Journal of Software | 2001年 / 12卷 / 06期

关键词：

Computer aided analysis - Data compression - Data storage equipment - Data warehouses - Database systems - Input output programs - Online systems;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Data compression is an effective approach to improve the data warehouses. On line analysis processing (OLAP) is the most important application on the data warehouses, and Cube is one of the most operators in OLAP. Thus, it is a big challenge to develop efficient algorithms for compressed data warehouses. Although many algorithms to compute Cube have been developed recently, there is little to date in the literatures about Cube algorithms for compressed data warehouse. To the author's knowledge, there is only one paper that presented a Cube algorithm for compressed data warehouses with a special compression method called chunk-offset. A set of Cube algorithms for very large and compressed data warehouses are proposed in this paper. These algorithms operate directly on compressed datasets without the need of decompressing them first. They are applicable to a variety of data compression methods. The detail analysis of I/O and CPU cost are also given, and compared with the existed algorithms by experiment. The analytical and experimental results show that the algorithms proposed in this paper are more efficient than other existed ones.

引用

页码：830 / 839

共 50 条

[31] Generalized association rule mining algorithms based on data cube
Hong, Zhang
Bo, Zhang
Ling-Dong, Kong
Zheng-Xing, Cai
SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 803 - +
[32] Classical and Re-Learning Based Clustering Algorithms for Huge Data Warehouses
Shah, Syed Zubair Ahmad
Amjad, Mohammad
2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, : 209 - 213
[33] Algorithms for efficient processing of complex queries in node-partitioned data warehouses
Furtado, P
INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2004, : 117 - 122
[34] Algorithms and data structures for compressed-memory machines
Franaszek, PA
Heidelberger, P
Poff, DE
Robinson, JT
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2001, 45 (02) : 245 - 258
[35] Towards Enabling Outlier Detection in Large, High Dimensional Data Warehouses
Georgoulas, Konstantinos
Kotidis, Yannis
SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 591 - 594
[36] Parallel data cube construction: Algorithms, theoretical analysis, and experimental evaluation
Jin, Ruoming
Yang, Ge
Agrawal, Gagan
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2003, 2913 : 74 - 84
[37] Scalable and efficient broadcasting algorithms for very large internetworks
Chatterjee, S
Bassiouni, MA
COMPUTER COMMUNICATIONS, 1998, 21 (10) : 912 - 923
[38] Parallel data cube construction: Algorithms, theoretical analysis, and experimental evaluation
Jin, RM
Yang, G
Agrawal, G
HIGH PERFORMANCE COMPUTING - HIPC 2003, 2003, 2913 : 74 - 84
[39] On the scaling of feedback algorithms for very large multicast groups
Fuhrmann, TT
Widmer, J
COMPUTER COMMUNICATIONS, 2001, 24 (5-6) : 539 - 547
[40] Scalable and efficient broadcasting algorithms for very large internetworks
Chatterjee, S
Bassiouni, MA
1996 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS - CONVERGING TECHNOLOGIES FOR TOMORROW'S APPLICATIONS, VOLS. 1-3, 1996, : 1642 - 1647

← 1 2 3 4 5 →