Evaluating Performance Portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs Using the Roofline Methodology

被引:1
|
作者
Mehta, Neil A. [1 ]
Gayatri, Rahulkumar [1 ]
Ghadar, Yasaman [2 ]
Knight, Christopher [2 ]
Deslippe, Jack [1 ]
机构
[1] Lawrence Berkeley Natl Lab, NERSC, Berkeley, CA 94720 USA
[2] Argonne Natl Lab, Lemont, IL USA
关键词
Roofline analysis; Performance portability; SNAP; MODEL;
D O I
10.1007/978-3-030-74224-9_1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we show that OpenMP 4.5 based implementation of TestSNAP, a proxy-app for the Spectral Neighbor Analysis Potential (SNAP) in LAMMPS, can be ported across the NVIDIA, Intel, and AMD GPUs. Roofline analysis is employed to assess the performance of TestSNAP on each of the architectures. The main contributions of this paper are two-fold: 1) Provide OpenMP as a viable option for application portability across multiple GPU architectures, and 2) provide a methodology based on the roofline analysis to determine the performance portability of OpenMP implementations on the target architectures. The GPUs used for this work are Intel Gen9, AMD Radeon Instinct MI60, and NVIDIA Volta V100.
引用
收藏
页码:3 / 24
页数:22
相关论文
共 28 条
  • [21] Performance Analysis of the NVIDIA HPC SDK and AMD AOCC Compilers in an HPC Cluster Using Pooled, Robust and Relative Metrics
    Lluerta, Yectli A.
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 726 - 737
  • [22] Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA
    Habich, J.
    Zeiser, T.
    Hager, G.
    Wellein, G.
    ADVANCES IN ENGINEERING SOFTWARE, 2011, 42 (05) : 266 - 272
  • [23] Evaluating green performance of the airports using hybrid BWM and VIKOR methodology
    Kumar, Ashwani
    Aswin, A.
    Gupta, Himanshu
    TOURISM MANAGEMENT, 2020, 76
  • [24] Evaluating the performance of Indonesia's nuclear energy program using INPRO methodology
    Johari, Johanna Maria Christina
    Pane, Jupiter Sitorus
    Dewayatna, Winter
    Langenati, Ratih
    Suryaman, Ganisa Kurniati
    Adhi, Arief Sasongko
    Cahyono, Agus
    Rahmadi, Gagad
    Herutomo, Bambang
    Sunarko, Dedy
    Priambodo, Dedy
    Sriyana
    Suparman
    KERNTECHNIK, 2023, 88 (03) : 326 - 340
  • [25] Evaluating performance portability of five shared-memory programming models using a high-order unstructured CFD solver
    Dai, Zhe
    Deng, Liang
    Che, Yonggang
    Li, Ming
    Zhang, Jian
    Wang, Yueqing
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 187
  • [26] Modeling the performance of Single-stage Nitrogen removal using Anammox and Partial nitritation (SNAP) process with backpropagation neural network and response surface methodology
    Antwi, Philip
    Zhang, Dachao
    Xiao, Longwen
    Kabutey, Felix Tetteh
    Quashie, Frank Koblah
    Luo, Wuhui
    Meng, Jia
    Li, Jianzheng
    SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 690 : 108 - 120
  • [27] Evaluating performance-impacting parameters for water-mounted solar PV systems using response surface methodology
    Makhija, Amandeep Singh
    Tiwari, Vivek
    Bohra, Shabbir S.
    RENEWABLE ENERGY, 2025, 244
  • [28] Evaluating the Short-term Performance of Spray-on Rejuvenators using a Modified Bending Beam Rheometer Test Methodology
    Vaddy, Poornachandra
    Islam, Tanzila
    Kutay, Mohammed Emin
    Vrtis, Michael
    Haider, Syed Waqar
    Cetin, Bora
    TRANSPORTATION RESEARCH RECORD, 2023,