Cube algorithms for very large compressed data warehouses

被引:0
|
作者
Gao, H. [1 ]
Li, J. [1 ]
机构
[1] Dept. of Computer Science and Eng., Harbin Institute of Technology, Harbin 150001, China
来源
Ruan Jian Xue Bao/Journal of Software | 2001年 / 12卷 / 06期
关键词
Computer aided analysis - Data compression - Data storage equipment - Data warehouses - Database systems - Input output programs - Online systems;
D O I
暂无
中图分类号
学科分类号
摘要
Data compression is an effective approach to improve the data warehouses. On line analysis processing (OLAP) is the most important application on the data warehouses, and Cube is one of the most operators in OLAP. Thus, it is a big challenge to develop efficient algorithms for compressed data warehouses. Although many algorithms to compute Cube have been developed recently, there is little to date in the literatures about Cube algorithms for compressed data warehouse. To the author's knowledge, there is only one paper that presented a Cube algorithm for compressed data warehouses with a special compression method called chunk-offset. A set of Cube algorithms for very large and compressed data warehouses are proposed in this paper. These algorithms operate directly on compressed datasets without the need of decompressing them first. They are applicable to a variety of data compression methods. The detail analysis of I/O and CPU cost are also given, and compared with the existed algorithms by experiment. The analytical and experimental results show that the algorithms proposed in this paper are more efficient than other existed ones.
引用
收藏
页码:830 / 839
相关论文
共 50 条
  • [31] Generalized association rule mining algorithms based on data cube
    Hong, Zhang
    Bo, Zhang
    Ling-Dong, Kong
    Zheng-Xing, Cai
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 803 - +
  • [32] Classical and Re-Learning Based Clustering Algorithms for Huge Data Warehouses
    Shah, Syed Zubair Ahmad
    Amjad, Mohammad
    2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, : 209 - 213
  • [33] Algorithms for efficient processing of complex queries in node-partitioned data warehouses
    Furtado, P
    INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2004, : 117 - 122
  • [34] Algorithms and data structures for compressed-memory machines
    Franaszek, PA
    Heidelberger, P
    Poff, DE
    Robinson, JT
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2001, 45 (02) : 245 - 258
  • [35] Towards Enabling Outlier Detection in Large, High Dimensional Data Warehouses
    Georgoulas, Konstantinos
    Kotidis, Yannis
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 591 - 594
  • [36] Parallel data cube construction: Algorithms, theoretical analysis, and experimental evaluation
    Jin, Ruoming
    Yang, Ge
    Agrawal, Gagan
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2003, 2913 : 74 - 84
  • [37] Scalable and efficient broadcasting algorithms for very large internetworks
    Chatterjee, S
    Bassiouni, MA
    COMPUTER COMMUNICATIONS, 1998, 21 (10) : 912 - 923
  • [38] Parallel data cube construction: Algorithms, theoretical analysis, and experimental evaluation
    Jin, RM
    Yang, G
    Agrawal, G
    HIGH PERFORMANCE COMPUTING - HIPC 2003, 2003, 2913 : 74 - 84
  • [39] On the scaling of feedback algorithms for very large multicast groups
    Fuhrmann, TT
    Widmer, J
    COMPUTER COMMUNICATIONS, 2001, 24 (5-6) : 539 - 547
  • [40] Scalable and efficient broadcasting algorithms for very large internetworks
    Chatterjee, S
    Bassiouni, MA
    1996 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS - CONVERGING TECHNOLOGIES FOR TOMORROW'S APPLICATIONS, VOLS. 1-3, 1996, : 1642 - 1647