Parallel processing of Multi-Join Expansion_Aggregate data cube query in high performance database systems

被引：1

作者：

Taniar, D ^{[1
]}

Tan, RBN ^{[1
]}

机构：

[1] Monash Univ, Sch Business Syst, Clayton, Vic 3800, Australia

来源：

I-SPAN'02: INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND NETWORKS, PROCEEDINGS | 2002年

关键词：

D O I：

10.1109/ISPAN.2002.1004260

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data cube queries containing aggregate functions often combine multiple tables through join operations. We can extend this to "Multi-Join Expansion_Aggregate" data cube queries by using more than one aggregate functions in "SELECT" statement in conjunction with relational operators. In parallel processing for such queries, it must be decided which attribute to use as a partitioning attribute, in particular, join attribute or cube-by. Based on the partitioning attribute, we introduce three parallel multi-join expansion-aggregate data cube query methods, namely Multi-join Partition Method (MPM), Expansion Partition Method (EPM) and Early Expansion Partition with Replication Method (EPRM). All three methods use the join attribute and cube-by as the partitioning attribute. Performance evaluation of the three parallel processing methods is also carried out and presented here.

引用

页码：51 / 56

页数：6

共 14 条

[1] Distributed multi-join query processing in data grids
Yang, Donghua
Li, Hanzhong
INFORMATION SCIENCES, 2007, 177 (17) : 3574 - 3591
[2] Join and multi-join processing in data integration systems
Tan, KL
Eng, PK
Ooi, BC
Zhang, M
DATA & KNOWLEDGE ENGINEERING, 2002, 40 (02) : 217 - 239
[3] Optimizing Communication for Multi-Join Query Processing in Cloud Data Warehouses
Kurunji, Swathi
Ge, Tingjian
Fu, Xinwen
Liu, Benyuan
Chen, Cindy X.
INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2013, 5 (04) : 113 - 130
[4] Parallel "GroupBy-Before-Join" query processing for high performance parallel/distributed database systems
Taniar, David
Rahayu, Wenny
20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS, 2006, : 693 - +
[5] Performance evaluation of parallel GroupBy-Before-Join query processing in high performance database systems
Taniar, D
Rahayu, JW
Ekonomosa, H
HIGH-PERFORMANCE COMPUTING AND NETWORKING, 2001, 2110 : 241 - 250
[6] ON RESOURCE SCHEDULING OF MULTI-JOIN QUERIES IN PARALLEL DATABASE-SYSTEMS
TAN, KL
LU, HJ
INFORMATION PROCESSING LETTERS, 1993, 48 (04) : 189 - 195
[7] Performance analysis of "Groupby-After-Join" query processing in parallel database systems
Taniar, D
Tan, RBN
Leung, CHC
Liu, KH
INFORMATION SCIENCES, 2004, 168 (1-4) : 25 - 50
[8] Data partitioning for single-round multi-join evaluation in massively parallel systems
Ameloot, Tom J.
Geck, Gaetano
Ketsman, Bas
Neven, Frank
Schwentick, Thomas
SIGMOD RECORD, 2016, 45 (01) : 33 - 40
[9] Reasoning on Data Partitioning for Single-Round Multi-Join Evaluation in Massively Parallel Systems
Ameloot, Tom J.
Geck, Gaetano
Ketsman, Bas
Neven, Frank
Schwentick, Thomas
COMMUNICATIONS OF THE ACM, 2017, 60 (03) : 93 - 100
[10] Parallel data cube construction for high performance on-line analytical processing
Goil, S
Choudhary, A
FOURTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING, PROCEEDINGS, 1997, : 10 - 15

← 1 2 →