Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression

被引:10
|
作者
Bicer, Tekin [1 ]
Yin, Jian [2 ]
Agrawal, Gagan [1 ]
机构
[1] Ohio State Univ, Comp Sci & Engn, Columbus, OH 43210 USA
[2] Pacific NW Natl Lab, Richland, WA 99352 USA
关键词
D O I
10.1109/CCGrid.2014.112
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Increasing number of cores in parallel computer systems are allowing scientific simulations to be executed with increasing spatial and temporal granularity. However, this also implies that increasing larger-sized datasets need to be output, stored, managed, and then visualized and/or analyzed using a variety of methods. In examining the possibility of using compression to accelerate all of these steps, we focus on two important questions: "Can compression help save time when data is output from, or input into, a parallel program?", and "How can a scientist's effort in using compression with a parallel program be minimized?". We focus on PnetCDF, and show how transparent compression can be supported, thus allowing an existing simulation program to start outputting and storing data in a compressed fashion, and similarly, allow a data analysis application to read compressed data. We address challenges in supporting compression when parallel writes are being performed. In our experiments, we first analyze the effects of using compression with microbenchmarks, and then, continue our evaluation using a scientific simulation application, and two data analysis applications. While we obtain up to a factor of 2 improvement in performance for microbenchmarks, the execution time of simulation application is improved up to 22%, and the maximum speedup of data analysis applications is 1.83 (with an average speedup of 1.36).
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [31] Optimal parallel I/O using replication
    Tosun, AS
    Ferhatosmanoglu, H
    2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS OF THE WORKSHOPS, 2002, : 506 - 513
  • [32] A case study for scientific I/O: Improving the FLASH astrophysics code
    Latham, Rob
    Daley, Chris
    Liao, Wei-Keng
    Gao, Kui
    Ross, Rob
    Dubey, Anshu
    Choudhary, Alok
    Computational Science and Discovery, 2012, 5 (01)
  • [33] Improving Scalability of Database Systems by Reshaping User Parallel I/O
    Li, Ning
    Jiang, Hong
    Che, Hao
    Wang, Zhijun
    Nguyen, Minh Q.
    PROCEEDINGS OF THE SEVENTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS '22), 2022, : 592 - 609
  • [34] Improving MPI-HMMER's Scalability With Parallel I/O
    Walters, John Paul
    Darole, Rohan
    Chaudhary, Vipin
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 1022 - 1032
  • [35] Models of parallel applications with large computation and I/O requirements
    Rosti, E
    Serazzi, G
    Smirni, E
    Squillante, MS
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (03) : 286 - 307
  • [36] Multi-Threaded Parallel I/O for OpenMP Applications
    Kshitij Mehta
    Edgar Gabriel
    International Journal of Parallel Programming, 2015, 43 : 286 - 309
  • [37] Multi-Threaded Parallel I/O for OpenMP Applications
    Mehta, Kshitij
    Gabriel, Edgar
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2015, 43 (02) : 286 - 309
  • [38] Overlay striping and optimal parallel I/O for modern applications
    Triantafillou, P
    Faloutsos, C
    PARALLEL COMPUTING, 1998, 24 (01) : 21 - 43
  • [39] Performance model of I/O-intensive parallel applications
    Chen, Yongran
    Qi, Xingyun
    Dou, Wenhua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2007, 44 (04): : 707 - 713
  • [40] AN INVESTIGATION OF SCALABLE SIMD I/O TECHNIQUES WITH APPLICATION TO PARALLEL JPEG COMPRESSION
    COOK, GW
    DELP, EJ
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 30 (02) : 111 - 128