Performance Analysis of Emerging Data Analytics and HPC Workloads

被引:0
|
作者
Daley, Christopher S. [1 ]
Dosanjh, Sudip [1 ]
Prabhat [1 ]
Wright, Nicholas J. [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
关键词
Workload characteristics; data analytics; big data; high performance computing; SEXTRACTOR;
D O I
10.1145/3149393.3149400
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Supercomputers are increasingly being used to run a data analytics workload in addition to a traditional simulation science workload. This mixed workload must be rigorously characterized to ensure that appropriately balanced machines are deployed. In this paper we analyze a suite of applications representing the simulation science and data workload at the NERSC supercomputing center. We show how time is spent in application compute, library compute, communication and I/O, and present application performance on both the Intel Xeon and Intel Xeon-Phi partitions of the Cori supercomputer. We find commonality in the libraries used, I/O motifs and methods of parallelism, and obtain similar node-to-node performance for the base application configurations. We demonstrate that features of the Intel Xeon-Phi node architecture and a Burst Buffer can improve application performance, providing evidence that an exascale-era energy-efficient platform can support a mixed workload.
引用
收藏
页码:43 / 48
页数:6
相关论文
共 50 条
  • [11] FPGA Accelerated HPC and Data Analytics
    Strickland, Mike
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 1 - 1
  • [12] Hybrid Resource Management for HPC and Data Intensive Workloads
    Souza, Abel
    Rezaei, Mohamad
    Laure, Erwin
    Tordsson, Johan
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 399 - 409
  • [13] Parallel I/O Evaluation Techniques and Emerging HPC Workloads: A Perspective
    Neuwirth, Sarah
    Paul, Arnab K.
    2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 671 - 679
  • [14] Performance evaluation of High Bandwidth Memory for HPC Workloads
    Kabat, Amit Kumar
    Pandey, Shubhang
    Gopalakrishnan, Venkatesh Tiruchirai
    2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 172 - 177
  • [15] Data Sharing Analysis of Emerging Parallel Media Mining Workloads
    Chen, Yu
    Li, Wenlong
    Lin, Junmin
    Jaleel, Aamer
    Tang, Zhizhong
    HIGH PERFORMANCE COMPUTING - HIPC 2008, PROCEEDINGS, 2008, 5374 : 87 - +
  • [16] Big Data and HPC collocation: Using HPC idle resources for Big Data Analytics
    Mercier, Michael
    Glesser, David
    Georgiou, Yiannis
    Richard, Olivier
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 347 - 352
  • [17] An I/O Analysis of HPC Workloads on CephFS and Lustre
    Chiusole, Alberto
    Cozzini, Stefano
    van der Ster, Daniel
    Lamanna, Massimo
    Giuliani, Graziano
    HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 300 - 316
  • [18] A Conceptual Framework for HPC Operational Data Analytics
    Netti, Alessio
    Shin, Woong
    Ott, Michael
    Wilde, Torsten
    Bates, Natalie
    2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 596 - 603
  • [19] A New Routing Scheme for Jellyfish and its Performance with HPC Workloads
    Yuan, Xin
    Mahapatra, Santosh
    Nienaber, Wickus
    Pakin, Scott
    Lang, Michael
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [20] Fast Modeling of Analytics Workloads for Big Data Services
    Yang, Lin
    Li, Changsheng
    Fan, Liya
    Xu, Jingmin
    PROCEEDINGS 2014 INTERNATIONAL CONFERENCE ON SERVICE SCIENCES (ICSS 2014), 2014, : 101 - 105