An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

被引:36
|
作者
Yang, Charlene [1 ]
Gayatri, Rahulkumar [1 ]
Kurth, Thorsten [1 ]
Basu, Protonu [2 ]
Ronaghi, Zahra [1 ]
Adetokunbo, Adedoyin [3 ]
Friesen, Brian [1 ]
Cook, Brandon [1 ]
Doerfler, Douglas [1 ]
Oliker, Leonid [2 ]
Deslippe, Jack [1 ]
Williams, Samuel [2 ]
机构
[1] Lawrence Berkeley Natl Lab, Natl Energy Res Sci Comp Ctr, Berkeley, CA 94720 USA
[2] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA USA
[3] Los Alamos Natl Lab, Los Alamos, NM USA
关键词
performance portability; performance model; Roofline; KNL; GPU; performance counters; MODEL;
D O I
10.1109/P3HPC.2018.00005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
System and node architectures continue to diversify to better balance on-node computation, memory capacity, memory bandwidth, interconnect bandwidth, power, and cost for specific computational workloads. For many application developers, achieving performance portability (effectively exploiting the capabilities of multiple architectures) is a desired goal. Unfortunately, dramatically different per-node performance coupled with differences in machine balance can lead to developers being unable to determine whether they have attained performance portability or simply written portable code. The Roofline model provides a means of quantitatively assessing how well a given application makes use of a target platform's computational capabilities. In this paper, we extend the Roofline model so that it 1) empirically captures a more realistic set of performance bounds for CPUs and GPUs, 2) factors in the true cost of different floating-point instructions when counting FLOPs, 3) incorporates the effects of different memory access patterns, and 4) with appropriate pairing of code performance and Roofline ceiling, facilitates the performance portability analysis.
引用
收藏
页码:14 / 23
页数:10
相关论文
共 50 条
  • [1] Evaluating Performance Portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs Using the Roofline Methodology
    Mehta, Neil A.
    Gayatri, Rahulkumar
    Ghadar, Yasaman
    Knight, Christopher
    Deslippe, Jack
    ACCELERATOR PROGRAMMING USING DIRECTIVES, WACCPD 2020, 2021, 12655 : 3 - 24
  • [2] A methodology for quantitatively assessing vehicular rutting on terrains
    Jones, R
    Horner, D
    Sullivan, P
    Ahlvin, R
    JOURNAL OF TERRAMECHANICS, 2005, 42 (3-4) : 245 - 257
  • [3] Performance assessment of FPGAs as HPC accelerators using the FPGA Empirical Roofline
    Calore, Enrico
    Schifano, Sebastiano Fabio
    2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, : 83 - 90
  • [4] Portability of Predictive Academic Performance Models: An Empirical Sensitivity Analysis
    Arroyo-Barriguete, Jose Luis
    Carabias-Lopez, Susana
    Curto-Gonzalez, Tomas
    Hernandez, Adolfo
    MATHEMATICS, 2021, 9 (08)
  • [5] Methodology for quantitatively assessing the energy security of Malaysia and other southeast Asian countries
    Sharifuddin, Shahnaz
    ENERGY POLICY, 2014, 65 : 574 - 582
  • [6] Assessing ageing quantitatively
    Wolfrum, J
    Scherer, U
    Eibl, S
    Fürst, W
    KUNSTSTOFFE-PLAST EUROPE, 2005, 95 (01): : 94 - 96
  • [7] Assessing the performance portability of modern parallel programming models using TeaLeaf
    Martineau, Matthew
    McIntosh-Smith, Simon
    Gaudin, Wayne
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (15):
  • [8] Roofline analysis with Cray performance analysis tools (CrayPat) and roofline-based performance projections for a future architecture
    Kwack, JaeHyuk
    Arnold, Galen
    Mendes, Celso
    Bauer, Gregory H.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (16):
  • [9] Portability efficiency approach for calculating performance portability
    Marowka, Ami
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 170
  • [10] Roofline-based Data Migration Methodology for Hybrid Memories
    Lee, Jongmin
    Lee, Kwangho
    Kim, Mucheol
    Park, Geunchul
    Park, Chan Yeol
    JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (03): : 849 - 859