Efficient aggregation algorithms on very large compressed data warehouses

被引:2
|
作者
Li, JZ [1 ]
Li, YS
Srivastava, J
机构
[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
[2] Beijing Inst Technol, Beijing 100876, Peoples R China
[3] Univ Minnesota, Minneapolis, MN 55455 USA
基金
中国国家自然科学基金;
关键词
OLAP; aggregation; data warehouse;
D O I
10.1007/BF02948809
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multidimensional aggregation is a dominant operation on data warehouses for on-line analytical processing (OLAP). Many efficient algorithms to compute multidimensional aggregation on relational database based data warehouses have been developed. However, to our knowledge, there is nothing to date in the literature about aggregation algorithms on multidimensional data warehouses that store datasets in multidimensional arrays rather than in tables. This paper presents a set of multidimensional aggregation algorithms on very large and compressed multidimensional data warehouses. These algorithms operate directly on compressed datasets in multidimensional data warehouses without the need to first decompress them. They are applicable to a variety of data compression methods. The algorithms have different performance behavior as a function of dataset parameters, sizes of outputs and main memory availability. The algorithms are described and analyzed with respect to the I/O and CPU costs. A decision procedure to select the most efficient algorithm, given an aggregation request, is also proposed. The analytical and experimental results show that the algorithms are more efficient than the traditional aggregation algorithms.
引用
收藏
页码:213 / 229
页数:17
相关论文
共 50 条
  • [41] Efficient Algorithms for Highly Compressed Data: The Word Problem in Generalized Higman Groups Is in P
    Jürn Laun
    Theory of Computing Systems, 2014, 55 : 742 - 770
  • [42] Designing data warehouses for equipment management system with genetic algorithms
    Chen, K. -Y.
    Chen, M. -C.
    Liu, W. -Y.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2008, 46 (21) : 6113 - 6135
  • [43] Efficient Parameters for Compressed Sensing Recovery Algorithms
    Shalaby, Wafaa A.
    Saad, Waleed
    Shokair, Mona
    Dessouky, Moawad
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 94 (03) : 1715 - 1736
  • [44] EFFICIENT DECODING OF COMPRESSED DATA
    BASSIOUNI, MA
    MUKHERJEE, A
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1995, 46 (01): : 1 - 8
  • [45] The lord of the rings:: Efficient maintanence of views at data warehouses
    Agrawal, D
    El Abbadi, A
    Mostéfaoui, A
    Raymal, M
    Roy, M
    DISTRIBUTED COMPUTING, PROCEEDINGS, 2002, 2508 : 33 - 47
  • [46] Performance Analysis of Federated Learning Aggregation Algorithms for Secure and Efficient Data Handling
    Agarwal, Vaibhav
    Attigeri, Girija
    Kolekar, Sucheta, V
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (14)
  • [47] Hybrid greedy and genetic algorithms for optimization of relational data warehouses
    Velinov, Goran
    Gligoroski, Danilo
    Kon-Popovska, Margita
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 470 - +
  • [48] Energy efficient ant colony algorithms for data aggregation in wireless sensor networks
    Lin, Chi
    Wu, Guowei
    Xia, Feng
    Li, Mingchu
    Yao, Lin
    Pei, Zhongyi
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (06) : 1686 - 1702
  • [49] Efficient algorithms for maximum lifetime data gathering and aggregation in wireless sensor networks
    Kalpakis, K
    Dasgupta, K
    Namjoshi, P
    COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2003, 42 (06): : 697 - 716
  • [50] Efficient Algorithms for Kernel Aggregation Queries
    Chan, Tsz Nam
    Hou, Leong U.
    Cheng, Reynold
    Yiu, Man Lung
    Mittal, Shivansh
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2726 - 2739