ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems

被引:31
|
作者
Byna, Suren [1 ]
Breitenfeld, M. Scot [2 ]
Dong, Bin [1 ]
Koziol, Quincey [1 ]
Pourmal, Elena [2 ]
Robinson, Dana [2 ]
Soumagne, Jerome [2 ]
Tang, Houjun [1 ]
Vishwanath, Venkatram [3 ]
Warren, Richard [2 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94597 USA
[2] HDF Grp, Champaign, IL 61820 USA
[3] Argonne Natl Lab, Lemont, IL 60439 USA
关键词
parallel I; O; Hierarchical Data Format version 5 (HDF5); I; O performance; virtual object layer; HDF5; optimizations;
D O I
10.1007/s11390-020-9822-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications at exascale generate and analyze massive amounts of data. A critical requirement of these applications is the capability to access and manage this data efficiently on exascale systems. Parallel I/O, the key technology enables moving data between compute nodes and storage, faces monumental challenges from new applications, memory, and storage architectures considered in the designs of exascale systems. As the storage hierarchy is expanding to include node-local persistent memory, burst buffers, etc., as well as disk-based storage, data movement among these layers must be efficient. Parallel I/O libraries of the future should be capable of handling file sizes of many terabytes and beyond. In this paper, we describe new capabilities we have developed in Hierarchical Data Format version 5 (HDF5), the most popular parallel I/O library for scientific applications. HDF5 is one of the most used libraries at the leadership computing facilities for performing parallel I/O on existing HPC systems. The state-of-the-art features we describe include: Virtual Object Layer (VOL), Data Elevator, asynchronous I/O, full-featured single-writer and multiple-reader (Full SWMR), and parallel querying. In this paper, we introduce these features, their implementations, and the performance and feature benefits to applications and other libraries.
引用
收藏
页码:145 / 160
页数:16
相关论文
共 50 条
  • [31] Efficient parallel I/O on SCI connected clusters
    Worringen, J
    CLUSTER 2000: IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2000, : 371 - 372
  • [32] Efficient distributed algorithms for parallel I/O scheduling
    Wu, JJ
    Lin, YF
    Liu, PF
    11TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL I, PROCEEDINGS, 2005, : 460 - 466
  • [33] Iteration Based Collective I/O Strategy for Parallel I/O Systems
    Wang, Zhixiang
    Shi, Xuanhua
    Jin, Hai
    Wu, Song
    Chen, Yong
    2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, : 287 - 294
  • [34] Stampi-I/O: A flexible parallel-I/O library for heterogeneous computing environment
    Tsujita, Y
    Imamura, T
    Takemiya, H
    Yamagishi, N
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2002, 2474 : 288 - 295
  • [35] The Minos Computing Library: Efficient Parallel Programming for Extremely Heterogeneous Systems
    Gioiosa, Roberto
    Mutlu, Burcu O.
    Lee, Seyong
    Vetter, Jeffrey S.
    Picierro, Giulio
    Cesati, Marco
    GPGPU'20: PROCEEDINGS OF THE 13TH ANNUAL WORKSHOP ON GENERAL PURPOSE PROCESSING USING GRAPHICS PROCESSING UNIT (GPU), 2020, : 1 - 10
  • [36] Applying Selectively Parallel I/O Compression to Parallel Storage Systems
    Filgueira, Rosa
    Atkinson, Malcolm
    Tanimura, Yusuke
    Kojima, Isao
    EURO-PAR 2014 PARALLEL PROCESSING, 2014, 8632 : 282 - 293
  • [37] A Checkpoint of Research on Parallel I/O for High-Performance Computing
    Boito, Francieli Zanon
    Inacio, Eduardo C.
    Bez, Jean Luca
    Navaux, Philippe O. A.
    Dantas, Mario A. R.
    Denneulin, Yves
    ACM COMPUTING SURVEYS, 2018, 51 (02)
  • [38] A study of real world I/O performance in parallel scientific computing
    Kimpe, Dries
    Lani, Andrea
    Quintino, Tiago
    Vandewalle, Stefan
    Poedts, Stefaan
    Deconinck, Herman
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 871 - +
  • [39] Computing with SN P systems with I/O mode
    Adorna, Henry N.
    JOURNAL OF MEMBRANE COMPUTING, 2020, 2 (04) : 230 - 245
  • [40] Computing with SN P systems with I/O mode
    Henry N. Adorna
    Journal of Membrane Computing, 2020, 2 : 230 - 245