Efficient retrieval of multidimensional datasets through parallel I/O

被引:4
|
作者
Prabhakar, S [1 ]
Abdel-Ghaffar, K [1 ]
Agrawal, D [1 ]
El Abbadi, A [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
关键词
D O I
10.1109/HIPC.1998.738011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many scientific and engineering applications process large multidimensional datasets. An important access pattern for these applications is the retrieval of data corresponding to ranges of values in multiple dimensions. Performance is limited by disks largely due to high disk latencies. Tiling and distributing the data across multiple disks is an effective technique for improving performance through parallel I/O. The distribution of tiles across the disks is an important factor in achieving gains. Several schemes for declustering multidimensional data to improve the performance of range queries have been proposed in the literature. We extend the class of Cyclic schemes which have been developed earlier for two-dimensional data to multi pie dimensions. We establish important properties of Cyclic schemes, based upon which we reduce the search space for determining good declustering schemes within the class of Cyclic schemes. Through experimental evaluation, we establish that the Cyclic schemes are superior to other declustering schemes, including the state-of-the-art, both in terms of the degree of parallelism and robustness.
引用
收藏
页码:375 / 382
页数:8
相关论文
共 50 条
  • [31] Efficient Multidimensional Harmonic Retrieval: A Hierarchical Signal Separation Framework
    Lin, Chun-Hung
    Fang, Wen-Hsien
    IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (05) : 427 - 430
  • [32] Parallel File System Analysis Through Application I/O Tracing
    Wright, S. A.
    Hammond, S. D.
    Pennycook, S. J.
    Bird, R. F.
    Herdman, J. A.
    Miller, I.
    Vadgama, A.
    Bhalerao, A.
    Jarvis, S. A.
    COMPUTER JOURNAL, 2013, 56 (02): : 141 - 155
  • [33] An optimization of Apriori algorithm through the usage of parallel I/O and hints
    Pérez, MS
    Pons, RA
    García, F
    Carretero, J
    Córdoba, ML
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2002, 2475 : 449 - 452
  • [34] Improvements to Efficient Retrieval of Very Large Temporal Datasets with the TravelLight Method
    de Carvalho, Alexandre Valle
    Oliveira, Marco Amaro
    Rocha, Artur
    PROCEEDINGS OF THE 2014 9TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2014), 2014,
  • [35] Approximate Spreading Activation for Efficient Knowledge Retrieval from Large Datasets
    Grinberg, Maurice
    Haltakov, Vladimir
    Stefanov, Hristo
    NEURAL NETS WIRN10, 2011, 226 : 326 - 333
  • [36] An Efficient Data Retrieval Parallel Reeb Graph Algorithm
    Hajij, Mustafa
    Rosen, Paul
    ALGORITHMS, 2020, 13 (10)
  • [37] Efficient information retrieval by dispatching mobile agents in parallel
    Qu, Wenyu
    Li, Keqiu
    Zhang, Chao
    MUE: 2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2008, : 73 - +
  • [38] Efficient Memory Partitioning for Parallel Data Access in Multidimensional Arrays
    Meng, Chenyue
    Yin, Shouyi
    Ouyang, Peng
    Liu, Leibo
    Wei, Shaojun
    2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2015,
  • [39] Parallel I/O: An introduction
    Womble, DE
    Greenberg, DS
    PARALLEL COMPUTING, 1997, 23 (4-5) : 403 - 417
  • [40] IOSPReD: I/O Specialized Packaging of Reduced Datasets and Data-Intensive Applications for Efficient Reproducibility
    Niddodi, Chaitra
    Gehani, Ashish
    Malik, Tanu
    Mohan, Sibin
    Rilee, Michael Lee
    IEEE ACCESS, 2023, 11 (1718-1731) : 1718 - 1731