Parallel processing of Multi-Join Expansion_Aggregate data cube query in high performance database systems

被引:1
|
作者
Taniar, D [1 ]
Tan, RBN [1 ]
机构
[1] Monash Univ, Sch Business Syst, Clayton, Vic 3800, Australia
关键词
D O I
10.1109/ISPAN.2002.1004260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data cube queries containing aggregate functions often combine multiple tables through join operations. We can extend this to "Multi-Join Expansion_Aggregate" data cube queries by using more than one aggregate functions in "SELECT" statement in conjunction with relational operators. In parallel processing for such queries, it must be decided which attribute to use as a partitioning attribute, in particular, join attribute or cube-by. Based on the partitioning attribute, we introduce three parallel multi-join expansion-aggregate data cube query methods, namely Multi-join Partition Method (MPM), Expansion Partition Method (EPM) and Early Expansion Partition with Replication Method (EPRM). All three methods use the join attribute and cube-by as the partitioning attribute. Performance evaluation of the three parallel processing methods is also carried out and presented here.
引用
收藏
页码:51 / 56
页数:6
相关论文
共 14 条
  • [1] Distributed multi-join query processing in data grids
    Yang, Donghua
    Li, Hanzhong
    INFORMATION SCIENCES, 2007, 177 (17) : 3574 - 3591
  • [2] Join and multi-join processing in data integration systems
    Tan, KL
    Eng, PK
    Ooi, BC
    Zhang, M
    DATA & KNOWLEDGE ENGINEERING, 2002, 40 (02) : 217 - 239
  • [3] Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses
    Kurunji, Swathi
    Ge, Tingjian
    Fu, Xinwen
    Liu, Benyuan
    Chen, Cindy X.
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2013, 5 (04) : 113 - 130
  • [4] Parallel "GroupBy-Before-Join" query processing for high performance parallel/distributed database systems
    Taniar, David
    Rahayu, Wenny
    20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS, 2006, : 693 - +
  • [5] Performance evaluation of parallel GroupBy-Before-Join query processing in high performance database systems
    Taniar, D
    Rahayu, JW
    Ekonomosa, H
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 2001, 2110 : 241 - 250
  • [6] ON RESOURCE SCHEDULING OF MULTI-JOIN QUERIES IN PARALLEL DATABASE-SYSTEMS
    TAN, KL
    LU, HJ
    INFORMATION PROCESSING LETTERS, 1993, 48 (04) : 189 - 195
  • [7] Performance analysis of "Groupby-After-Join" query processing in parallel database systems
    Taniar, D
    Tan, RBN
    Leung, CHC
    Liu, KH
    INFORMATION SCIENCES, 2004, 168 (1-4) : 25 - 50
  • [8] Data partitioning for single-round multi-join evaluation in massively parallel systems
    Ameloot, Tom J.
    Geck, Gaetano
    Ketsman, Bas
    Neven, Frank
    Schwentick, Thomas
    SIGMOD RECORD, 2016, 45 (01) : 33 - 40
  • [9] Reasoning on Data Partitioning for Single-Round Multi-Join Evaluation in Massively Parallel Systems
    Ameloot, Tom J.
    Geck, Gaetano
    Ketsman, Bas
    Neven, Frank
    Schwentick, Thomas
    COMMUNICATIONS OF THE ACM, 2017, 60 (03) : 93 - 100
  • [10] Parallel data cube construction for high performance on-line analytical processing
    Goil, S
    Choudhary, A
    FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 10 - 15