Parallel data cube construction: Algorithms, theoretical analysis, and experimental evaluation

被引:0
|
作者
Jin, RM [1 ]
Yang, G [1 ]
Agrawal, G [1 ]
机构
[1] Ohio State Univ, Dept Comp & Informat Sci, Columbus, OH 43210 USA
来源
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data cube construction is a commonly used operation in data warehouses. Because of the volume of data that is stored and analyzed in a data warehouse and the amount of computation involved in data cube construction, it is natural to consider parallel machines for this operation. This paper presents two new algorithms for parallel data cube construction, along with their theoretical analysis and experimental evaluation. Our work is based upon a new data-structure, called the aggregation tree, which results in minimally bounded memory requirements. An aggregation tree is parameterized by the ordering of dimensions. We prove that the same ordering of the dimensions minimizes both the computational and communication requirements, for both the algorithms. We also describe a method for partitioning the initial array, which again minimizes the communication volume for both the algorithms. Experimental results further validate the theoretical results.
引用
收藏
页码:74 / 84
页数:11
相关论文
共 50 条
  • [1] Parallel data cube construction: Algorithms, theoretical analysis, and experimental evaluation
    Jin, Ruoming
    Yang, Ge
    Agrawal, Gagan
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2003, 2913 : 74 - 84
  • [2] A New Parallel Data Cube Construction Scheme
    Jin, Dong
    Tsuji, Tatsuo
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2012, 4 (02) : 32 - 45
  • [3] Communication and memory optimal parallel data cube construction
    Jin, RM
    Vaidyanathan, K
    Yang, G
    Agrawal, G
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2005, 16 (12) : 1105 - 1119
  • [4] Communication and memory optimal parallel data cube construction
    Jin, RM
    Yang, G
    Vaidyanathan, K
    Agrawal, G
    2003 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2003, : 573 - 580
  • [5] Using tiling to scale parallel data cube construction
    Jin, RM
    Vaidyanathan, K
    Yang, G
    Agrawal, G
    2004 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2004, : 365 - 372
  • [6] Implementing data cube construction using a cluster middleware: algorithms, implementation experience, and performance evaluation
    Yang, G
    Jin, RM
    Agrawal, G
    FUTURE GENERATION COMPUTER SYSTEMS, 2003, 19 (04) : 533 - 550
  • [7] Implementing data cube construction using a cluster middleware: Algorithms, implementation experience, and performance evaluation
    Yang, G
    Jin, RM
    Agrawal, G
    CCGRID 2002: 2ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2002, : 84 - 92
  • [8] Parallel Data Cube Construction Based on an Extendible Multidimensional Array
    Jin, Dong
    Tsuji, Tatsuo
    TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1139 - 1145
  • [9] Parallel ROLAP data cube construction on shared-nothing multiprocessors
    Chen, Y
    Dehne, F
    Eavis, T
    Rau-Chaplin, A
    DISTRIBUTED AND PARALLEL DATABASES, 2004, 15 (03) : 219 - 236
  • [10] Parallel ROLAP Data Cube Construction on Shared-Nothing Multiprocessors
    Ying Chen
    Frank Dehne
    Todd Eavis
    Andrew Rau-Chaplin
    Distributed and Parallel Databases, 2004, 15 : 219 - 236