Optimizing Distributed Join for Array Database System

被引:0
|
作者
Li, Jing [1 ]
Li, Hui [1 ]
Chen, Mei [1 ]
Zhu, Ming [2 ]
机构
[1] Guizhou Univ, Guizhou Engn Lab ACMIS, Guiyang, Guizhou, Peoples R China
[2] Chinese Acad Sci, Natl Astron Observ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
distributed array database; join algorithm; network overhead; CPU cost;
D O I
10.1109/ITME.2016.127
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the sustained and rapid development of science and technology, the explosion of scientific data for analysis has brought the huge pressure. In order to reduce pressure, scientists use the array database instead of RDBMS to store and manage the scientific data. But according to our experiments, we find that the array database outperforms RDBMS on the simple queries but it can't support the complex multi-table join query very well. And because the network communication is the slowest component of multi-table join queries in distributed parallel databases, we introduce an optimized join algorithm that not only can minimize network communication by optimizing the transfer schedule, but also can reduce the CPU utilization, prevent it to become the bottleneck for the intensive computations. Our evaluation based on real scientific data and database shows the optimized algorithm adapts to diverse datasets and query types and it makes the array database outperforms RDBMS on multi-table queries of real workloads.
引用
收藏
页码:640 / 644
页数:5
相关论文
共 50 条
  • [41] DESIGN OF A DISTRIBUTED CODASYL DATABASE SYSTEM
    FINK, T
    INFORMATION SYSTEMS, 1985, 10 (04) : 425 - 440
  • [42] Integrating heterogeneous distributed database system
    Madhavaram, M
    Ali, DL
    Zhou, M
    COMPUTERS & INDUSTRIAL ENGINEERING, 1996, 31 (1-2) : 315 - 318
  • [43] Design and implementation of a distributed database system
    Basumallick, S
    Wong, JSK
    JOURNAL OF SYSTEMS AND SOFTWARE, 1996, 34 (01) : 21 - 29
  • [44] A DISTRIBUTED DIRECTORY DATABASE SYSTEM FOR TELECOMMUNICATIONS
    DEEN, SM
    TAYLOR, MC
    INGRAM, PA
    RAYNER, KW
    COMPUTER JOURNAL, 1988, 31 (02): : 175 - 181
  • [45] TDSQL: Tencent Distributed Database System
    Chen, Yuxing
    Pan, Anqun
    Lei, Hailin
    Ye, Anda
    Han, Shuo
    Tang, Yan
    Lu, Wei
    Chai, Yunpeng
    Zhang, Feng
    Du, Xiaoyong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (12): : 3869 - 3882
  • [46] DITSE: An experimental distributed database system
    Ni, CY
    Yuan, SM
    INFORMATION AND SOFTWARE TECHNOLOGY, 1996, 38 (02) : 103 - 110
  • [47] PRINCIPLES OF DATABASE MANAGEMENT IN A DISTRIBUTED SYSTEM
    JOYCE, J
    COMPUTER COMMUNICATIONS, 1978, 1 (05) : 257 - 262
  • [48] ALLOCATING RELATIONS IN A DISTRIBUTED DATABASE SYSTEM
    REID, DJ
    ORLOWSKA, M
    MATHEMATICAL AND COMPUTER MODELLING, 1995, 22 (08) : 33 - 47
  • [49] DISTRIBUTED DATABASE SYSTEM VDN.
    Munz, Rudolf
    Database Engineering, 1982, 5 (04): : 33 - 36
  • [50] MULTILEVEL INTERFACE TO A DISTRIBUTED DATABASE SYSTEM
    DESAI, BC
    ZHANG, L
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1991, 542 : 122 - 131