Understanding HPC Application I/O Behavior Using System Level Statistics

被引:15
|
作者
Paul, Arnab K. [1 ]
Faaland, Olaf [2 ]
Moody, Adam [2 ]
Gonsiorowski, Elsa [2 ]
Mohror, Kathryn [2 ]
Butt, Ali R. [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
[2] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
来源
2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020) | 2020年
基金
美国国家科学基金会;
关键词
FILE-ACCESS;
D O I
10.1109/HiPC50609.2020.00034
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The processor performance of high performance computing (HPC) systems is increasing at a much higher rate than storage performance. This imbalance leads to I/O performance bottlenecks in massively parallel HPC applications. Therefore, there is a need for improvements in storage and file system designs to meet the ever-growing I/O needs of HPC applications. Storage and file system designers require a deep understanding of how HPC application I/O behavior affects current storage system installations in order to improve them. In this work, we contribute to this understanding using application-agnostic file system statistics gathered on compute nodes as well as metadata and object storage file system servers. We analyze file system statistics of more than 4 million jobs over a period of three years on two systems at Lawrence Livermore National Laboratory that include a 15 PiB Lustre file system for storage. The results of our study add to the state-of-the-art in I/O understanding by providing insight into how general HPC workloads affect the performance of large-scale storage systems. Some key observations in our study show that reads and writes are evenly distributed across the storage system; applications which perform I/O, spread that I/O across similar to 78% of the minutes of their runtime on average; less than 22% of HPC users who submit write-intensive jobs perform efficient writes to the file system; and I/O contention seriously impacts I/O performance.
引用
收藏
页码:202 / 211
页数:10
相关论文
共 50 条
  • [41] Measuring I/O Performance of Lustre and the Temporary File System for Tradespace Applications on HPC Systems
    Kosta, Leonard
    Hunter, Harrison
    George, Glover
    Strelzoff, Andrew
    Matthews, Suzanne J.
    PROCEEDINGS OF THE SOUTHEAST CONFERENCE ACM SE'17, 2017, : 187 - 190
  • [42] Accelerating I/O performance of ZFS-based Lustre file system in HPC environment
    Jiwoo Bang
    Chungyong Kim
    Eun-Kyu Byun
    Hanul Sung
    Jaehwan Lee
    Hyeonsang Eom
    The Journal of Supercomputing, 2023, 79 : 7665 - 7691
  • [43] RZBENCH: Performance Evaluation of Current HPC Architectures Using Low-Level and Application Benchmarks
    Hager, Georg
    Stengel, Holger
    Zeiser, Thomas
    Wellein, Gerhard
    HIGH PERFORMANCE COMPUTING IN SCIENCE AND ENGINEERING, GARCH/MUNICH 2007, 2009, : 485 - 501
  • [44] A File Is Not a File: Understanding the I/O Behavior of Apple Desktop Applications
    Harter, Tyler
    Dragga, Chris
    Vaughn, Michael
    Arpaci-Dusseau, Andrea C.
    Arpaci-Dusseau, Remzi H.
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2012, 30 (03):
  • [45] A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications
    Harter, Tyler
    Dragga, Chris
    Vaughn, Michael
    Arpaci-Dusseau, Andrea C.
    Arpaci-Dusseau, Remzi H.
    SOSP 11: PROCEEDINGS OF THE TWENTY-THIRD ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2011, : 71 - 83
  • [46] Understanding Flash-Based Storage I/O Behavior of Games
    Maruf, Adnan
    Yang, Zhengyu
    Davis, Bridget
    Kim, Daniel
    Wong, Jeffrey
    Durand, Matthew
    Bhimani, Janki
    2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 521 - 526
  • [47] Footprinting Parallel I/O - Machine Learning to Classify Application's I/O Behavior
    Betke, Eugen
    Kunkel, Julian
    HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 214 - 226
  • [48] Gauge: An Interactive Data-Driven Visualization Tool for HPC Application I/O Performance Analysis
    del Rosario, Eliakin
    Currier, Mikaela
    Isakov, Mihailo
    Madireddy, Sandeep
    Balaprakash, Prasanna
    Carns, Philip
    Ross, Robert B.
    Harms, Kevin
    Snyder, Shane
    Kinsy, Michel A.
    PROCEEDINGS OF 2020 IEEE/ACM FIFTH INTERNATIONAL PARALLEL DATA SYSTEMS WORKSHOP (PDSW 2020), 2020, : 15 - 21
  • [49] Comparison of Level Shifter Architectures: Application to I/O cell
    Petrica, Radu-Valentin
    Dobre, Mihaela-Daniela
    Coll, Philippe
    Draghici, Florin
    Brezeanu, Gheorghe
    CAS 2018 PROCEEDINGS: 2018 INTERNATIONAL SEMICONDUCTOR CONFERENCE, 2018, : 209 - 212
  • [50] Parallel execution of I/O system and application functionality
    Enblom, L
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 786 - 792