Unified Schemes for Directive-Based GPU Offloading

被引:0
|
作者
Miki, Yohei [1 ]
Hanawa, Toshihiro [1 ]
机构
[1] Univ Tokyo, Informat Technol Ctr, Chiba 2770882, Japan
来源
IEEE ACCESS | 2024年 / 12卷
基金
日本学术振兴会;
关键词
Graphics processing units; Codes; Kernel; Costs; Multicore processing; Switches; Supercomputers; Programming; Libraries; User interfaces; Directive; GPU; OpenACC; OpenMP target; preprocessor macro; vendor lock-in;
D O I
10.1109/ACCESS.2024.3509380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs. Although OpenACC and OpenMP target provide similar features, both methods have pros and cons. OpenACC has better functions and an abundance of documents, but it is virtually for NVIDIA GPUs. OpenMP target supports NVIDIA/AMD/Intel GPUs but has fewer functions than OpenACC. Here, we have developed a header-only library, Solomon (Simple Off-LOading Macros Orchestrating multiple Notations), to unify the interface for GPU offloading with the support of both OpenACC and OpenMP target. Solomon provides three types of notations to reduce users' implementation and learning costs: intuitive notation for beginners and OpenACC/OpenMP-like notations for experienced developers. This manuscript denotes Solomon's implementation and usage and demonstrates the GPU-offloading in N-body simulation and the three-dimensional diffusion equation. The library and sample codes are provided as open-source software and publicly and freely available at https://github.com/ymikirepo/solomon.
引用
收藏
页码:181644 / 181665
页数:22
相关论文
共 50 条
  • [31] A directive-based MPI code generator for Linux PC clusters
    Yang, Chao-Tung
    Lai, Kuan-Chou
    JOURNAL OF SUPERCOMPUTING, 2009, 50 (02): : 177 - 207
  • [32] A directive-based MPI code generator for Linux PC clusters
    Chao-Tung Yang
    Kuan-Chou Lai
    The Journal of Supercomputing, 2009, 50 : 177 - 207
  • [33] ATF: A generic directive-based auto-tuning framework
    Rasch, Ari
    Gorlatch, Sergei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (05):
  • [34] NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model
    Xu, Rengan
    Tian, Xiaonan
    Chandrasekaran, Sunita
    Yan, Yonghong
    Chapman, Barbara
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING (LCPC 2014), 2015, 8967 : 67 - 81
  • [35] Mapping-Free GPU Offloading in OpenMP Using Unified Memory
    Hong, Jia-Sian
    You, Yi-Ping
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP-W 2023, 2023, : 104 - 111
  • [36] A Directive-based Data Layout Abstraction for Performance Portability of OpenACC Applications
    Hoshino, Tetsuya
    Maruyama, Naoya
    Matsuoka, Satoshi
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1147 - 1154
  • [37] Dealing with Portability and Performance on Heterogeneous Systems with Directive-based Programming Approaches
    Bodin, F.
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 2001 - 2064
  • [38] Compiling a High-Level Directive-Based Programming Model for GPGPUs
    Tian, Xiaonan
    Xu, Rengan
    Yan, Yonghong
    Yun, Zhifeng
    Chandrasekaran, Sunita
    Chapman, Barbara
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2013, 2014, 8664 : 105 - 120
  • [39] ZEN: A directive-based language for automatic experiment management of distributed and parallel programs
    Prodan, R
    Fahringer, T
    2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDING, 2002, : 93 - 100
  • [40] Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
    Shan, Baodi
    Araya-Polo, Mauricio
    Chapman, Barbara
    ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 126 - 140