Profiling the Usage of an Extreme-Scale Archival Storage System

被引:1
|
作者
Sim, Hyogi [1 ]
Vazhkudai, Sudharshan S. [1 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37830 USA
关键词
D O I
10.1109/MASCOTS.2019.00050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Profiling the archival storage system in scientific computing environments has received much less attention compared to the parallel file system, but is equally important since it stores the final data products safely, for a long duration. In this paper, we analyze eight years worth of data transfer logs for accessing the archival file system (HPSS) in the Oak Ridge Leadership Computing Facility (OLCF), which has been hosting the world's largest supercomputers and file systems. Our analysis encompasses about 135 million data transfer activities to the 80 PB High Performance Storage System (HPSS), between 2010 and 2017. We analyze the logs from several dimensions, including studying the workload characteristics (e.g., access patterns, frequency of accesses and temporal behavior), file system characteristics (e.g., directory depth, file system scaling trends, file types), and scientific user behavior (e.g., domain-specific usage and organization). Based on the analysis, we derive insights into the future evolution of the archive in terms of provisioning, desired features and functionality from the archive software, role and right sizing of the archive tiers, quota management, and the importance of smart and efficient metadata and storage management. We believe our study will prove useful for both operating current archival storage and the better provisioning of future systems.
引用
收藏
页码:410 / 422
页数:13
相关论文
共 50 条
  • [1] Toward an extreme-scale electronic structure system
    Vallejo, Jorge L. Galvez
    Snowdon, Calum
    Stocks, Ryan
    Kazemian, Fazeleh
    Yu, Fiona Chuo Yan
    Seidl, Christopher
    Seeger, Zoe
    Alkan, Melisa
    Poole, David
    Westheimer, Bryce M.
    Basha, Mehaboob
    De La Pierre, Marco
    Rendell, Alistair
    Izgorodina, Ekaterina I.
    Gordon, Mark S.
    Barca, Giuseppe M. J.
    JOURNAL OF CHEMICAL PHYSICS, 2023, 159 (04):
  • [2] Extreme-scale computer architecture
    Josep Torrellas
    NationalScienceReview, 2016, 3 (01) : 19 - 23
  • [3] Visualizing extreme-scale data
    Ma, Kwan-Liu
    VISUALIZATION AND DATA ANALYSIS 2008, 2008, 6809
  • [4] ARCHITECTURES FOR EXTREME-SCALE COMPUTING
    Torrellas, Josep
    COMPUTER, 2009, 42 (11) : 28 - 35
  • [5] Extreme-scale computer architecture
    Torrellas, Josep
    NATIONAL SCIENCE REVIEW, 2016, 3 (01) : 19 - 23
  • [6] Architecting a Flash-Based Storage System for Low-Cost Inference of Extreme-Scale DNNs
    Jin, Yunho
    Kim, Shine
    Ham, Tae Jun
    Lee, Jae W.
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (12) : 3153 - 3164
  • [7] Algorithm development for extreme-scale computing
    Jiachang Sun
    Chao Yang
    Xiao-Chuan Cai
    National Science Review, 2016, 3 (01) : 26 - 27
  • [8] Improving the Performance of the Extreme-scale Simulator
    Engelmann, Christian
    Naughton, Thomas
    2014 IEEE/ACM 18TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT 2014), 2014, : 198 - 207
  • [9] Toward Extreme-Scale Processor Chips
    Torrellas, Josep
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 290 - 290
  • [10] Extreme-Scale Visual Analytics Introduction
    Wong, Pak Chung
    Shen, Han-Wei
    Pascucci, Valerio
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2012, 32 (04) : 23 - 25