ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems

被引:31
|
作者
Byna, Suren [1 ]
Breitenfeld, M. Scot [2 ]
Dong, Bin [1 ]
Koziol, Quincey [1 ]
Pourmal, Elena [2 ]
Robinson, Dana [2 ]
Soumagne, Jerome [2 ]
Tang, Houjun [1 ]
Vishwanath, Venkatram [3 ]
Warren, Richard [2 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94597 USA
[2] HDF Grp, Champaign, IL 61820 USA
[3] Argonne Natl Lab, Lemont, IL 60439 USA
关键词
parallel I; O; Hierarchical Data Format version 5 (HDF5); I; O performance; virtual object layer; HDF5; optimizations;
D O I
10.1007/s11390-020-9822-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications at exascale generate and analyze massive amounts of data. A critical requirement of these applications is the capability to access and manage this data efficiently on exascale systems. Parallel I/O, the key technology enables moving data between compute nodes and storage, faces monumental challenges from new applications, memory, and storage architectures considered in the designs of exascale systems. As the storage hierarchy is expanding to include node-local persistent memory, burst buffers, etc., as well as disk-based storage, data movement among these layers must be efficient. Parallel I/O libraries of the future should be capable of handling file sizes of many terabytes and beyond. In this paper, we describe new capabilities we have developed in Hierarchical Data Format version 5 (HDF5), the most popular parallel I/O library for scientific applications. HDF5 is one of the most used libraries at the leadership computing facilities for performing parallel I/O on existing HPC systems. The state-of-the-art features we describe include: Virtual Object Layer (VOL), Data Elevator, asynchronous I/O, full-featured single-writer and multiple-reader (Full SWMR), and parallel querying. In this paper, we introduce these features, their implementations, and the performance and feature benefits to applications and other libraries.
引用
收藏
页码:145 / 160
页数:16
相关论文
共 50 条
  • [41] I/O-Efficient Algorithms for Computing Contours on a Terrain
    Agarwal, Pankaj K.
    Arge, Lars
    Molhave, Thomas
    Sadri, Bardia
    PROCEEDINGS OF THE TWENTY-FOURTH ANNUAL SYMPOSIUM ON COMPUTATIONAL GEOMETRY (SGG'08), 2008, : 129 - 138
  • [42] Efficient parallel I/O in community atmosphere model (CAM)
    Tseng, Yu-Heng
    Ding, Chris
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2008, 22 (02): : 206 - 218
  • [43] PARALLEL I/O SYSTEMS - GUEST EDITORS INTRODUCTION
    CHOUDHARY, A
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 17 (1-2) : 1 - 3
  • [44] Parallel I/O for distributed systems: Issues and implementation
    Sunderam, VS
    Moyer, SA
    FUTURE GENERATION COMPUTER SYSTEMS, 1996, 12 (01) : 25 - 38
  • [45] Evaluating Asynchronous Parallel I/O on HPC Systems
    Ravi, John
    Byna, Suren
    Koziol, Quincey
    Tang, Houjun
    Becchi, Michela
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 211 - 221
  • [46] MASSIVELY PARALLEL I/O FOR PARTITIONED SOLVER SYSTEMS
    Liu, Ning
    Fu, Jing
    Carothers, Christopher D.
    Sahni, Onkar
    Jansen, Kenneth E.
    Shephard, Mark S.
    PARALLEL PROCESSING LETTERS, 2010, 20 (04) : 377 - 395
  • [47] Efficient parallel I/O scheduling in the presence of data duplication
    Liu, PF
    Wang, DW
    Wu, JJ
    2003 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2003, : 231 - 238
  • [48] UFCR: An efficient I/O method for parallel file system
    Huo, Yanmei
    Ju, Jiubin
    Hu, Liang
    SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 223 - +
  • [49] Recorder 2.0: Efficient Parallel I/O Tracing and Analysis
    Wang, Chen
    Sun, Jinghan
    Snir, Marc
    Mohror, Kathryn
    Gonsiorowski, Elsa
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 1052 - 1059
  • [50] Efficient retrieval of multidimensional datasets through parallel I/O
    Prabhakar, S
    Abdel-Ghaffar, K
    Agrawal, D
    El Abbadi, A
    FIFTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 1998, : 375 - 382