Using Pattern-Models to Guide SSD Deployment for Big Data Applications in HPC Systems

被引:0
|
作者
Chen, Junjie [1 ]
Roth, Philip C. [2 ]
Chen, Yong [1 ,3 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[2] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37830 USA
[3] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Big Data; Solid State Drives; Hybrid Storage Systems; High Performance Computing; Exascale Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flash-memory based Solid State Drives (SSDs) embrace higher performance and lower power consumption compared to traditional storage devices (HDDs). These benefits are needed in HPC systems, especially with the growing demand of supporting Big Data applications. In this paper, we study placement and deployment strategies of SSDs in HPC systems to maximize the performance improvement, given a practical fixed hardware budget constraint. We propose a pattern-model approach to guide SSD deployment for HPC systems through two steps; characterizing workload and mapping deployment strategy. The first step is responsible for characterizing the access patterns of the workload and the second step contributes the actual deployment recommendation for Parallel File System (PFS) configuration combining with an analytical model. We have carried out initial experimental tests and the results confirmed that the proposed approach can guide placement of SSDs in HPC systems for accelerating data accesses. Our research will be helpful in guiding designs and developments for Big Data applications in current and projected HPC systems including exascale systems.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Distribution, Data, Deployment Software Architecture Convergence in Big Data Systems
    Gorton, Ian
    Klein, John
    IEEE SOFTWARE, 2015, 32 (03) : 78 - 85
  • [23] Eley: On the Effectiveness of Burst Buffers for Big Data Processing in HPC systems
    Yildiz, Orcun
    Zhou, Amelie Chi
    Ibrahim, Shadi
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 87 - 91
  • [24] 2013 Workshop on Middleware for HPC and Big Data Systems (MHPC 2013)
    Alexander, Michael
    Zanetti, Gianluigi
    Nanos, Anastassios
    Tao, Jie
    Wang, Lizhe
    EURO-PAR 2013: PARALLEL PROCESSING WORKSHOPS, 2014, 8374 : 384 - 384
  • [25] LiU: Hiding Disk Access Latency for HPC Applications with a New SSD-Enabled Data Layout
    Huang, Dachuan
    Zhang, Xuechen
    Shi, Wei
    Zheng, Mai
    Jiang, Song
    Qin, Feng
    2013 IEEE 21ST INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2013), 2013, : 111 - +
  • [26] A Survey of Power and Energy Predictive Models in HPC Systems and Applications
    O'Brien, Kenneth
    Pietri, Ilia
    Reddy, Ravi
    Lastovetsky, Alexey
    Sakellariou, Rizos
    ACM COMPUTING SURVEYS, 2017, 50 (03)
  • [27] Big Data in the Clinic Using Data to Guide Practice
    Krupinski, Elizabeth A.
    2016 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2016, : 518 - 519
  • [28] Automated Spark Clusters Deployment for Big Data with Standalone Applications Integration
    Fernandez, A. M.
    Torres, J. F.
    Troncoso, A.
    Martinez-Alvarez, F.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2016, 2016, 9868 : 150 - 159
  • [29] Autonomic deployment decision making for big data analytics applications in the cloud
    Qinghua Lu
    Zheng Li
    Weishan Zhang
    Laurence T. Yang
    Soft Computing, 2017, 21 : 4501 - 4512
  • [30] A Big Data Architecture for Automotive Applications: PSA Group Deployment Experience
    Haroun, Amir
    Mostefaoui, Ahmed
    Dessables, Francois
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 921 - 928