An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

被引:36
|
作者
Yang, Charlene [1 ]
Gayatri, Rahulkumar [1 ]
Kurth, Thorsten [1 ]
Basu, Protonu [2 ]
Ronaghi, Zahra [1 ]
Adetokunbo, Adedoyin [3 ]
Friesen, Brian [1 ]
Cook, Brandon [1 ]
Doerfler, Douglas [1 ]
Oliker, Leonid [2 ]
Deslippe, Jack [1 ]
Williams, Samuel [2 ]
机构
[1] Lawrence Berkeley Natl Lab, Natl Energy Res Sci Comp Ctr, Berkeley, CA 94720 USA
[2] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA USA
[3] Los Alamos Natl Lab, Los Alamos, NM USA
来源
PROCEEDINGS OF 2018 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC 2018) | 2018年
关键词
performance portability; performance model; Roofline; KNL; GPU; performance counters; MODEL;
D O I
10.1109/P3HPC.2018.00005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
System and node architectures continue to diversify to better balance on-node computation, memory capacity, memory bandwidth, interconnect bandwidth, power, and cost for specific computational workloads. For many application developers, achieving performance portability (effectively exploiting the capabilities of multiple architectures) is a desired goal. Unfortunately, dramatically different per-node performance coupled with differences in machine balance can lead to developers being unable to determine whether they have attained performance portability or simply written portable code. The Roofline model provides a means of quantitatively assessing how well a given application makes use of a target platform's computational capabilities. In this paper, we extend the Roofline model so that it 1) empirically captures a more realistic set of performance bounds for CPUs and GPUs, 2) factors in the true cost of different floating-point instructions when counting FLOPs, 3) incorporates the effects of different memory access patterns, and 4) with appropriate pairing of code performance and Roofline ceiling, facilitates the performance portability analysis.
引用
收藏
页码:14 / 23
页数:10
相关论文
共 50 条
  • [41] An Empirical Roofline Model for Extreme-Scale I/O Workload Analysis
    Zhu, Zhaobin
    Bartelheimer, Niklas
    Neuwirth, Sarah
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 622 - 627
  • [42] Developing a methodology for assessing virtual teams' performance perception
    Gustavo Siqueira Ferreira, Pedro
    Pinheiro de Lima, Edson
    Gouvea da Costa, Sergio E.
    INTERNATIONAL JOURNAL OF PRODUCTIVITY AND PERFORMANCE MANAGEMENT, 2012, 61 (07) : 710 - 729
  • [43] Assessing the performance of a consultant anaesthetist by control chart methodology
    Runcie, C. J.
    ANAESTHESIA, 2009, 64 (03) : 293 - 296
  • [44] A NEW METHODOLOGY FOR ASSESSING THE PERFORMANCE OF ROAD SURFACE MARKINGS
    Asdrubali, Francesco
    Buratti, Cinzia
    Moretti, Elisa
    D'Alessandro, Francesco
    Schiavoni, Samuele
    ROAD AND RAIL INFRASTRUCTURE II, 2012, : 169 - 175
  • [46] Roofline Performance Analysis of DNN Architectures on CPU and GPU Systems
    Prashanth, H. C.
    Rao, Madhav
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [47] A general framework for quantitatively assessing ecological stochasticity
    Ning, Daliang
    Deng, Ye
    Tiedje, James M.
    Zhou, Jizhong
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (34) : 16892 - 16898
  • [48] Performance analysis of deep learning workloads using roofline trajectories
    M. Haseeb Javed
    Khaled Z. Ibrahim
    Xiaoyi Lu
    CCF Transactions on High Performance Computing, 2019, 1 : 224 - 239
  • [49] A lightweight approach to performance portability with targetDP
    Gray, Alan
    Stratford, Kevin
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (02): : 288 - 301
  • [50] Toward a Better Performance Portability Metric
    Marowka, Ami
    2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 181 - 184