Unified Schemes for Directive-Based GPU Offloading

被引：0

作者：

Miki, Yohei ^{[1
]}

Hanawa, Toshihiro ^{[1
]}

机构：

[1] Univ Tokyo, Informat Technol Ctr, Chiba 2770882, Japan

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

日本学术振兴会;

关键词：

Graphics processing units; Codes; Kernel; Costs; Multicore processing; Switches; Supercomputers; Programming; Libraries; User interfaces; Directive; GPU; OpenACC; OpenMP target; preprocessor macro; vendor lock-in;

D O I：

10.1109/ACCESS.2024.3509380

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs. Although OpenACC and OpenMP target provide similar features, both methods have pros and cons. OpenACC has better functions and an abundance of documents, but it is virtually for NVIDIA GPUs. OpenMP target supports NVIDIA/AMD/Intel GPUs but has fewer functions than OpenACC. Here, we have developed a header-only library, Solomon (Simple Off-LOading Macros Orchestrating multiple Notations), to unify the interface for GPU offloading with the support of both OpenACC and OpenMP target. Solomon provides three types of notations to reduce users' implementation and learning costs: intuitive notation for beginners and OpenACC/OpenMP-like notations for experienced developers. This manuscript denotes Solomon's implementation and usage and demonstrates the GPU-offloading in N-body simulation and the three-dimensional diffusion equation. The library and sample codes are provided as open-source software and publicly and freely available at https://github.com/ymikirepo/solomon.

引用

页码：181644 / 181665

页数：22

共 50 条

[31] A directive-based MPI code generator for Linux PC clusters
Yang, Chao-Tung
Lai, Kuan-Chou
JOURNAL OF SUPERCOMPUTING, 2009, 50 (02): : 177 - 207
[32] A directive-based MPI code generator for Linux PC clusters
Chao-Tung Yang
Kuan-Chou Lai
The Journal of Supercomputing, 2009, 50 : 177 - 207
[33] ATF: A generic directive-based auto-tuning framework
Rasch, Ari
Gorlatch, Sergei
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (05):
[34] NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model
Xu, Rengan
Tian, Xiaonan
Chandrasekaran, Sunita
Yan, Yonghong
Chapman, Barbara
LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING (LCPC 2014), 2015, 8967 : 67 - 81
[35] Mapping-Free GPU Offloading in OpenMP Using Unified Memory
Hong, Jia-Sian
You, Yi-Ping
PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP-W 2023, 2023, : 104 - 111
[36] A Directive-based Data Layout Abstraction for Performance Portability of OpenACC Applications
Hoshino, Tetsuya
Maruyama, Naoya
Matsuoka, Satoshi
PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1147 - 1154
[37] Dealing with Portability and Performance on Heterogeneous Systems with Directive-based Programming Approaches
Bodin, F.
2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 2001 - 2064
[38] Compiling a High-Level Directive-Based Programming Model for GPGPUs
Tian, Xiaonan
Xu, Rengan
Yan, Yonghong
Yun, Zhifeng
Chandrasekaran, Sunita
Chapman, Barbara
LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2013, 2014, 8664 : 105 - 120
[39] ZEN: A directive-based language for automatic experiment management of distributed and parallel programs
Prodan, R
Fahringer, T
2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDING, 2002, : 93 - 100
[40] Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
Shan, Baodi
Araya-Polo, Mauricio
Chapman, Barbara
ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 126 - 140

← 1 2 3 4 5 →