Profiling the Usage of an Extreme-Scale Archival Storage System

被引:1
|
作者
Sim, Hyogi [1 ]
Vazhkudai, Sudharshan S. [1 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37830 USA
关键词
D O I
10.1109/MASCOTS.2019.00050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Profiling the archival storage system in scientific computing environments has received much less attention compared to the parallel file system, but is equally important since it stores the final data products safely, for a long duration. In this paper, we analyze eight years worth of data transfer logs for accessing the archival file system (HPSS) in the Oak Ridge Leadership Computing Facility (OLCF), which has been hosting the world's largest supercomputers and file systems. Our analysis encompasses about 135 million data transfer activities to the 80 PB High Performance Storage System (HPSS), between 2010 and 2017. We analyze the logs from several dimensions, including studying the workload characteristics (e.g., access patterns, frequency of accesses and temporal behavior), file system characteristics (e.g., directory depth, file system scaling trends, file types), and scientific user behavior (e.g., domain-specific usage and organization). Based on the analysis, we derive insights into the future evolution of the archive in terms of provisioning, desired features and functionality from the archive software, role and right sizing of the archive tiers, quota management, and the importance of smart and efficient metadata and storage management. We believe our study will prove useful for both operating current archival storage and the better provisioning of future systems.
引用
收藏
页码:410 / 422
页数:13
相关论文
共 50 条
  • [21] Extreme-scale parallel computing: bottlenecks and strategies
    Mo, Ze-yao
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (10) : 1251 - 1260
  • [22] Extreme-scale earthquake simulations on Sunway TaihuLight
    Haohuan Fu
    Bingwei Chen
    Wenqiang Zhang
    Zhenguo Zhang
    Wei Zhang
    Guangwen Yang
    Xiaofei Chen
    CCF Transactions on High Performance Computing, 2019, 1 : 14 - 24
  • [23] mOS: An Architecture for Extreme-Scale Operating Systems
    Wisniewski, Robert W.
    Inglett, Todd
    Keppel, Pardo
    Murty, Ravi
    Riesen, Rolf
    PROCEEDINGS OF THE 4TH INTERNATIONAL WORKSHOP ON RUNTIME AND OPERATING SYSTEMS FOR SUPERCOMPUTERS, ROSS 2014, 2014,
  • [24] Sublinear Algorithms for Extreme-Scale Data Analysis
    Seshadhri, C.
    Pinar, Ali
    Thompson, David
    Bennett, Janine C.
    TOPOLOGICAL AND STATISTICAL METHODS FOR COMPLEX DATA: TACKLING LARGE-SCALE, HIGH-DIMENSIONAL, AND MULTIVARIATE DATA SPACES, 2015, : 39 - 54
  • [25] Measuring the Resiliency of Extreme-Scale Computing Environments
    Bell Labs-Nokia, 600 Mountain Ave, New Provicence
    NJ
    07974, United States
    不详
    IL
    61801, United States
    Springer Ser. Reliab. Eng., (609-655):
  • [26] Accelerating Extreme-Scale Numerical Weather Prediction
    Deconinck, Willem
    Hamrud, Mats
    Kuehnlein, Christian
    Mozdzynski, George
    Smolarkiewicz, Piotr K.
    Szmelter, Joanna
    Wedi, Nils P.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT II, 2016, 9574 : 583 - 593
  • [27] Accelerating incremental checkpointing for extreme-scale computing
    Ferreira, Kurt B.
    Riesen, Rolf
    Bridges, Patrick
    Arnold, Dorian
    Brightwell, Ron
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 30 : 66 - 77
  • [28] Extreme-scale earthquake simulations on Sunway TaihuLight
    Fu, Haohuan
    Chen, Bingwei
    Zhang, Wenqiang
    Zhang, Zhenguo
    Zhang, Wei
    Yang, Guangwen
    Chen, Xiaofei
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2019, 1 (01) : 14 - 24
  • [29] 3rd IEEE International Workshop on Extreme-Scale Storage and Analysis (ESSA 2022)
    Tatebe, Osamu
    Antoniu, Gabriel
    Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2022, 2022, : 1098 - 1099
  • [30] A Synopses Data Engine for Interactive Extreme-Scale Analytics
    Kontaxakis, Antonis
    Giatrakos, Nikos
    Deligiannakis, Antonios
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2085 - 2088