Towards I/O analysis of HPC systems and a generic architecture to collect access patterns

被引:11
|
作者
Wiedemann, Marc C. [1 ,2 ]
Kunkel, Julian M. [2 ]
Zimmer, Michaela [2 ]
Ludwig, Thomas [2 ]
Resch, Michael [3 ]
Boenisch, Thomas [3 ]
Wang, Xuan [3 ]
Chut, Andriy [3 ]
Aguilera, Alvaro [4 ]
Nagel, Wolfgang E. [4 ]
Kluge, Michael [4 ]
Mickler, Holger [4 ]
机构
[1] Bundesstr 45a, D-20146 Hamburg, Germany
[2] Univ Hamburg, Deutsch Klimarechenzentrum GmbH, Hamburg, Germany
[3] Univ Stuttgart, High Performance Comp Ctr Stuttgart HLRS, Stuttgart, Germany
[4] Tech Univ Dresden, Zentrum Informationsdienste & Hochleistungsrechne, Dresden, Germany
来源
关键词
I/O analysis; I/O path; Causality tree;
D O I
10.1007/s00450-012-0221-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In high-performance computing applications, a high-level I/O call will trigger activities on a multitude of hardware components. These are massively parallel systems supported by huge storage systems and internal software layers. Their complex interplay currently makes it impossible to identify the causes for and the locations of I/O bottlenecks. Existing tools indicate when a bottleneck occurs but provide little guidance in identifying the cause or improving the situation. We have thus initiated Scalable I/O for Extreme Performance to find solutions for this problem. To achieve this goal in SIOX, we will build a system to record access information on all layers and components, to recognize access patterns, and to characterize the I/O system. The system will ultimately be able to recognize the causes of the I/O bottlenecks and propose optimizations for the I/O middleware that can improve I/O performance, such as throughput rate and latency. Furthermore, the SIOX system will be able to support decision making while planning new I/O systems. In this paper, we introduce the SIOX system and describe its current status: We first outline our approach for collecting the required access information. We then provide the architectural concept, the methods for reconstructing the I/O path and an excerpt of the interface for data collection. This paper focuses especially on the architecture, which collects and combines the relevant access information along the I/O path, and which is responsible for the efficient transfer of this information. An abstract modelling approach allows us to better understand the complexity of the analysis of the I/O activities on parallel computing systems, and an abstract interface allows us to adapt the SIOX system to various HPC file systems.
引用
收藏
页码:241 / 251
页数:11
相关论文
共 50 条
  • [31] Learning I/O Access Patterns to Improve Prefetching in SSDs
    Chakraborttii, Chandranil
    Litz, Heiner
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE TRACK, ECML PKDD 2020, PT IV, 2021, 12460 : 427 - 443
  • [32] Redundancy Analysis and Elimination on Access Patterns of the Windows Applications Based on I/O Log Data
    Lee, Jun-Ha
    Kwon, Hyuk-Yoon
    IEEE ACCESS, 2020, 8 : 40640 - 40655
  • [33] Towards a generic Identity and Access Assurance model by component analysis - a conceptual review
    Damon, Ferdinand
    Coetzee, Marijke
    2013 ENTERPRISE SYSTEMS CONFERENCE (ES), 2013,
  • [34] New I/O architecture in process control systems
    Oda, S
    Torigoe, K
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 3251 - 3254
  • [35] Mitigate I/O Access Pattern Divergence With Heterogeneous Architecture In HDFS
    Shi, HuaShen
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 300 - 304
  • [36] Memory-Conscious Collective I/O for Extreme-scale HPC Systems
    Lu, Yin
    Chen, Yong
    Thakur, Rajeev
    Zhuang, Yu
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1361 - +
  • [37] Design and implementation of I/O performance prediction scheme on HPC systems through large-scale log analysis
    Sunggon Kim
    Alex Sim
    Kesheng Wu
    Suren Byna
    Yongseok Son
    Journal of Big Data, 10
  • [38] Design and implementation of I/O performance prediction scheme on HPC systems through large-scale log analysis
    Kim, Sunggon
    Sim, Alex
    Wu, Kesheng
    Byna, Suren
    Son, Yongseok
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [39] Memory-Conscious Collective I/O for Extreme-Scale HPC Systems
    Lu, Yin
    Chen, Yong
    Thakur, Rajeev
    Zhuang, Yu
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1360 - 1360
  • [40] On the Root Causes of Cross-Application I/O Interference in HPC Storage Systems
    Yildiz, Orcun
    Dorier, Matthieu
    Ibrahim, Shadi
    Ross, Rob
    Antoniu, Gabriel
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 750 - 759