Efficient retrieval of multidimensional datasets through parallel I/O

被引:4
|
作者
Prabhakar, S [1 ]
Abdel-Ghaffar, K [1 ]
Agrawal, D [1 ]
El Abbadi, A [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
关键词
D O I
10.1109/HIPC.1998.738011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many scientific and engineering applications process large multidimensional datasets. An important access pattern for these applications is the retrieval of data corresponding to ranges of values in multiple dimensions. Performance is limited by disks largely due to high disk latencies. Tiling and distributing the data across multiple disks is an effective technique for improving performance through parallel I/O. The distribution of tiles across the disks is an important factor in achieving gains. Several schemes for declustering multidimensional data to improve the performance of range queries have been proposed in the literature. We extend the class of Cyclic schemes which have been developed earlier for two-dimensional data to multi pie dimensions. We establish important properties of Cyclic schemes, based upon which we reduce the search space for determining good declustering schemes within the class of Cyclic schemes. Through experimental evaluation, we establish that the Cyclic schemes are superior to other declustering schemes, including the state-of-the-art, both in terms of the degree of parallelism and robustness.
引用
收藏
页码:375 / 382
页数:8
相关论文
共 50 条
  • [1] Efficient disk allocation schemes for parallel retrieval of multidimensional grid data
    Chen, CM
    Sinha, R
    Bhatia, R
    THIRTEENTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2001, : 213 - 222
  • [2] PIDX: Efficient Parallel I/O for Multi-resolution Multi-dimensional Scientific Datasets
    Kumar, Sidharth
    Vishwanath, Venkatram
    Carns, Philip
    Summa, Brian
    Scorzelli, Giorgio
    Pascucci, Valerio
    Ross, Robert
    Chen, Jacqueline
    Kolla, Hemanth
    Grout, Ray
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 103 - 111
  • [3] Efficient parallel I/O in seismic imaging
    Oldfield, RA
    Womble, DE
    Ober, CC
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 1998, 12 (03): : 333 - 344
  • [4] An Efficient Encoding Scheme for Dynamic Multidimensional Datasets
    Omar, Mehnuma Tabassum
    Hasan, K. M. Azharul
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 517 - 523
  • [5] Parallel Computation of Dominance Scores for Multidimensional Datasets on GPUs
    Chen, Wei-Mei
    Tsai, Hsin-Hung
    Ling, Joon Fong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (06) : 764 - 776
  • [6] Efficient parallel I/O on SCI connected clusters
    Worringen, J
    CLUSTER 2000: IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2000, : 371 - 372
  • [7] Efficient distributed algorithms for parallel I/O scheduling
    Wu, JJ
    Lin, YF
    Liu, PF
    11TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL I, PROCEEDINGS, 2005, : 460 - 466
  • [8] Extended collective I/O for efficient retrieval of large objects
    More, S
    Choudhary, A
    FIFTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 1998, : 359 - 366
  • [9] EFFICIENT FUSION OF MULTIDIMENSIONAL DESCRIPTORS FOR IMAGE RETRIEVAL
    Bhowmik, Neelanjan
    Gonzalez, Ricardo, V
    Gouet-Brunet, Valerie
    Pedrini, Helio
    Bloch, Gabriel
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5766 - 5770
  • [10] Scaling Parallel I/O Performance through I/O Delegate and Caching System
    Nisar, Arifa
    Liao, Wei-keng
    Choudhary, Alok
    INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2008, : 487 - 498