ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems

被引:31
|
作者
Byna, Suren [1 ]
Breitenfeld, M. Scot [2 ]
Dong, Bin [1 ]
Koziol, Quincey [1 ]
Pourmal, Elena [2 ]
Robinson, Dana [2 ]
Soumagne, Jerome [2 ]
Tang, Houjun [1 ]
Vishwanath, Venkatram [3 ]
Warren, Richard [2 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94597 USA
[2] HDF Grp, Champaign, IL 61820 USA
[3] Argonne Natl Lab, Lemont, IL 60439 USA
关键词
parallel I; O; Hierarchical Data Format version 5 (HDF5); I; O performance; virtual object layer; HDF5; optimizations;
D O I
10.1007/s11390-020-9822-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications at exascale generate and analyze massive amounts of data. A critical requirement of these applications is the capability to access and manage this data efficiently on exascale systems. Parallel I/O, the key technology enables moving data between compute nodes and storage, faces monumental challenges from new applications, memory, and storage architectures considered in the designs of exascale systems. As the storage hierarchy is expanding to include node-local persistent memory, burst buffers, etc., as well as disk-based storage, data movement among these layers must be efficient. Parallel I/O libraries of the future should be capable of handling file sizes of many terabytes and beyond. In this paper, we describe new capabilities we have developed in Hierarchical Data Format version 5 (HDF5), the most popular parallel I/O library for scientific applications. HDF5 is one of the most used libraries at the leadership computing facilities for performing parallel I/O on existing HPC systems. The state-of-the-art features we describe include: Virtual Object Layer (VOL), Data Elevator, asynchronous I/O, full-featured single-writer and multiple-reader (Full SWMR), and parallel querying. In this paper, we introduce these features, their implementations, and the performance and feature benefits to applications and other libraries.
引用
收藏
页码:145 / 160
页数:16
相关论文
共 50 条
  • [1] ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems
    Suren Byna
    M. Scot Breitenfeld
    Bin Dong
    Quincey Koziol
    Elena Pourmal
    Dana Robinson
    Jerome Soumagne
    Houjun Tang
    Venkatram Vishwanath
    Richard Warren
    Journal of Computer Science and Technology, 2020, 35 : 145 - 160
  • [2] HDF5 in the exascale era: Delivering efficient and scalable parallel I/O for exascale applications
    Breitenfeld, M. Scot
    Tang, Houjun
    Zheng, Huihuo
    Henderson, Jordan
    Byna, Suren
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2025, 39 (01): : 65 - 78
  • [3] Toward Exascale Computing Systems: An Energy Efficient Massive Parallel Computational Model
    Ashraf, Muhammad Usman
    Eassa, Fathy Alburaei
    Albeshri, Aiiad Ahmad
    Algarni, Abdullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (02) : 118 - 126
  • [4] Preparing 25Gbps Electrical I/O for Exascale Computing Systems
    Shan, Lei
    Kwark, Young
    Rimolo-Donadio, Renato
    Baks, Christian
    Gaynes, Michael
    Chainer, Timothy
    2014 IEEE 64TH ELECTRONIC COMPONENTS AND TECHNOLOGY CONFERENCE (ECTC), 2014, : 1955 - 1958
  • [5] Software-defined QoS for I/O in exascale computing
    Yusheng Hua
    Xuanhua Shi
    Hai Jin
    Wei Liu
    Yan Jiang
    Yong Chen
    Ligang He
    CCF Transactions on High Performance Computing, 2019, 1 : 49 - 59
  • [6] Software-defined QoS for I/O in exascale computing
    Hua, Yusheng
    Shi, Xuanhua
    Jin, Hai
    Liu, Wei
    Jiang, Yan
    Chen, Yong
    He, Ligang
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2019, 1 (01) : 49 - 59
  • [7] Parallel I/O scheduling in multiprogrammed cluster computing systems
    Abawajy, JH
    COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 223 - 229
  • [8] On the energy footprint of I/O management in Exascale HPC systems
    Dorier, Matthieu
    Yildiz, Orcun
    Ibrahim, Shadi
    Orgerie, Anne-Cecile
    Antoniu, Gabriel
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 62 : 17 - 28
  • [9] Performance analysis of parallel I/O scheduling approaches on cluster computing systems
    Abawajy, JH
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 724 - 729
  • [10] Efficient parallel implementation of reservoir computing systems
    Alomar, M. L.
    Skibinsky-Gitlin, Erik S.
    Frasser, Christiam F.
    Canals, Vincent
    Isern, Eugeni
    Roca, Miquel
    Rossello, Josep L.
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2299 - 2313