Performance Analysis of Emerging Data Analytics and HPC Workloads

被引:0
|
作者
Daley, Christopher S. [1 ]
Dosanjh, Sudip [1 ]
Prabhat [1 ]
Wright, Nicholas J. [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
关键词
Workload characteristics; data analytics; big data; high performance computing; SEXTRACTOR;
D O I
10.1145/3149393.3149400
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Supercomputers are increasingly being used to run a data analytics workload in addition to a traditional simulation science workload. This mixed workload must be rigorously characterized to ensure that appropriately balanced machines are deployed. In this paper we analyze a suite of applications representing the simulation science and data workload at the NERSC supercomputing center. We show how time is spent in application compute, library compute, communication and I/O, and present application performance on both the Intel Xeon and Intel Xeon-Phi partitions of the Cori supercomputer. We find commonality in the libraries used, I/O motifs and methods of parallelism, and obtain similar node-to-node performance for the base application configurations. We demonstrate that features of the Intel Xeon-Phi node architecture and a Burst Buffer can improve application performance, providing evidence that an exascale-era energy-efficient platform can support a mixed workload.
引用
收藏
页码:43 / 48
页数:6
相关论文
共 50 条
  • [21] Understanding Data Analytics Workloads on Intel®Xeon Phi™
    Xie, Biwei
    Liu, Xu
    Mckee, Sally A.
    Zhan, Jianfeng
    Jia, Zhen
    Wang, Lei
    Zhang, Lixin
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 206 - 215
  • [22] Understanding Big Data Analytics Workloads on Modern Processors
    Jia, Zhen
    Zhan, Jianfeng
    Wang, Lei
    Luo, Chunjie
    Gao, Wanling
    Jin, Yi
    Han, Rui
    Zhang, Lixin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1797 - 1810
  • [23] Characterizing Data Analytics Workloads on Intel Xeon Phi
    Xie, Biwei
    Liu, Xu
    Zhan, Jianfeng
    Jia, Zhen
    Zhu, Yuqing
    Wang, Lei
    Zhang, Lixin
    2015 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2015, : 114 - 115
  • [24] Intelligent colocation of HPC workloads
    Zacarias, Felippe Vieira
    Petrucci, Vinicius
    Nishtala, Rajiv
    Carpenter, Paul
    Mosse, Daniel
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 151 : 125 - 137
  • [25] Performance modeling of emerging HPC architectures
    Bhatia, Nikhil
    Alam, Sadaf R.
    Vetter, Jeffrey S.
    PROCEEDINGS OF THE HPCMP USERS GROUP CONFERENCE 2006, 2006, : 367 - 373
  • [26] Performance characterization of containerization for HPC workloads on InfiniBand clusters: an empirical study
    Peini Liu
    Jordi Guitart
    Cluster Computing, 2022, 25 : 847 - 868
  • [27] ThunderX2 Performance and Energy-Efficiency for HPC Workloads
    Calore, Enrico
    Gabbana, Alessandro
    Schifano, Sebastiano Fabio
    Tripiccione, Raffaele
    COMPUTATION, 2020, 8 (01)
  • [28] Performance characterization of containerization for HPC workloads on InfiniBand clusters: an empirical study
    Liu, Peini
    Guitart, Jordi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (02): : 847 - 868
  • [29] The Morality of Online War; the Fates of Data Analytics, HPC
    Arquilla, John
    Reed, Daniel A.
    COMMUNICATIONS OF THE ACM, 2015, 58 (10) : 12 - 13
  • [30] Editorial: Big scientific data analytics on HPC and cloud
    Wang, Jianwu
    Yin, Junqi
    Nguyen, Mai H.
    Wang, Jingbo
    Xu, Weijia
    FRONTIERS IN BIG DATA, 2024, 7