Assessing the Impact of Compiler Optimizations on GPUs Reliability

被引：2

作者：

Dos Santos, Fernando Fernandes ^{[1
,4
]}

Carro, Luigi ^{[2
]}

Vella, Flavio ^{[3
]}

Rech, Paolo ^{[3
]}

机构：

[1] Univ Rennes, INRIA, Rennes, France

[2] Univ Fed Rio Grande do Sul, Inst Informat, Ave Bento Gonccalves 9500,Campus Vale,Bloco 4, Porto Alegre, RS, Brazil

[3] Univ Trento, Via Sommarive 9, I-38123 Povo, TN, Italy

[4] Univ Rennes, INRIA Ctr Rennes, Campus Beaulieu,263 Ave Gen Leclerc, F-35042 Rennes, France

来源：

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION | 2024年 / 21卷 / 02期

关键词：

Graphics processing units; reliability; neutron-induced errors; error rate; FAULT INJECTION; ERRORS;

D O I：

10.1145/3638249

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Graphics Processing Units (GPUs) compilers have evolved in order to support general-purpose programming languages for multiple architectures. NVIDIA CUDA Compiler (NVCC) has many compilation levels before generating the machine code and applies complex optimizations to improve performance. These optimizations modify how the software is mapped in the underlying hardware; thus, as we show in this article, they can also affect GPU reliability. We evaluate the effects on the GPU error rate of the optimization flags applied at the NVCC Parallel Thread Execution (PTX) compiling phase by analyzing two NVIDIA GPU architectures (Kepler and Volta) and two compiler versions (NVCC 10.2 and 11.3). We compare and combine fault propagation analysis based on software fault injection, hardware utilization distribution obtained with application-level profiling, and machine instructions radiation-induced error rate measured with beam experiments. We consider eight different workloads and 144 combinations of compilation flags, and we show that optimizations can impact the GPUs' error rate of up to an order of magnitude. Additionally, through accelerated neutron beam experiments on a NVIDIA Kepler GPU, we show that the error rate of the unoptimized GEMM (-O0 flag) is lower than the optimized GEMM's (-O3 flag) error rate. When the performance is evaluated together with the error rate, we show that the most optimized versions (-O1 and -O3) always produce a higher amount of correct data than the unoptimized code (-O0).

引用

页数：22

共 50 条

[31] Generating Compiler Optimizations from Proofs
Tate, Ross
Stepp, Michael
Lerner, Sorin
ACM SIGPLAN NOTICES, 2010, 45 (01) : 389 - 402
[32] Advanced Compiler Optimizations for Sparse Computations
J Parallel Distrib Comput, (14):
[33] Influence of compiler optimizations on system power
Kandemir, M
Vijaykrishnan, N
Irwin, MJ
Ye, W
37TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2000, 2000, : 304 - 307
[34] ADVANCED COMPILER OPTIMIZATIONS FOR SPARSE COMPUTATIONS
BIK, AJC
WIJSHOFF, HAG
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 31 (01) : 14 - 24
[35] COMP: Compiler Optimizations for Manycore Processors
Song, Linhai
Feng, Min
Ravi, Nishkam
Yang, Yi
Chakradhar, Srimat
2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2014, : 659 - 671
[36] COMPILER OPTIMIZATIONS FOR IMPROVING DATA LOCALITY
CARR, S
MCKINLEY, KS
TSENG, CW
SIGPLAN NOTICES, 1994, 29 (11): : 252 - 262
[37] Effect of compiler optimizations on memory energy
Kim, HS
Irwin, MJ
Vijaykrishnan, N
Kandemir, M
2000 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS: DESIGN AND IMPLEMENTATION, 2000, : 663 - 672
[38] COMPILER OPTIMIZATIONS FOR ELIMINATING BARRIER SYNCHRONIZATION
TSENG, CW
SIGPLAN NOTICES, 1995, 30 (08): : 144 - 155
[39] Influence of compiler optimizations on system power
Kandemir, M
Vijaykrishnan, N
Irwin, MJ
Ye, W
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2001, 9 (06) : 801 - 804
[40] A compiler framework for speculative analysis and optimizations
Lin, J
Chen, T
Hsu, WC
Ju, RDC
Ngai, TF
Yew, PC
Chan, S
ACM SIGPLAN NOTICES, 2003, 38 (05) : 289 - 299

← 1 2 3 4 5 →