Assessing the Impact of Compiler Optimizations on GPUs Reliability

被引:2
|
作者
Dos Santos, Fernando Fernandes [1 ,4 ]
Carro, Luigi [2 ]
Vella, Flavio [3 ]
Rech, Paolo [3 ]
机构
[1] Univ Rennes, INRIA, Rennes, France
[2] Univ Fed Rio Grande do Sul, Inst Informat, Ave Bento Gonccalves 9500,Campus Vale,Bloco 4, Porto Alegre, RS, Brazil
[3] Univ Trento, Via Sommarive 9, I-38123 Povo, TN, Italy
[4] Univ Rennes, INRIA Ctr Rennes, Campus Beaulieu,263 Ave Gen Leclerc, F-35042 Rennes, France
关键词
Graphics processing units; reliability; neutron-induced errors; error rate; FAULT INJECTION; ERRORS;
D O I
10.1145/3638249
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphics Processing Units (GPUs) compilers have evolved in order to support general-purpose programming languages for multiple architectures. NVIDIA CUDA Compiler (NVCC) has many compilation levels before generating the machine code and applies complex optimizations to improve performance. These optimizations modify how the software is mapped in the underlying hardware; thus, as we show in this article, they can also affect GPU reliability. We evaluate the effects on the GPU error rate of the optimization flags applied at the NVCC Parallel Thread Execution (PTX) compiling phase by analyzing two NVIDIA GPU architectures (Kepler and Volta) and two compiler versions (NVCC 10.2 and 11.3). We compare and combine fault propagation analysis based on software fault injection, hardware utilization distribution obtained with application-level profiling, and machine instructions radiation-induced error rate measured with beam experiments. We consider eight different workloads and 144 combinations of compilation flags, and we show that optimizations can impact the GPUs' error rate of up to an order of magnitude. Additionally, through accelerated neutron beam experiments on a NVIDIA Kepler GPU, we show that the error rate of the unoptimized GEMM (-O0 flag) is lower than the optimized GEMM's (-O3 flag) error rate. When the performance is evaluated together with the error rate, we show that the most optimized versions (-O1 and -O3) always produce a higher amount of correct data than the unoptimized code (-O0).
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Tuning Compiler Optimizations for Simultaneous Multithreading
    Jack L. Lo
    Susan J. Eggers
    Henry M. Levy
    Sujay S. Parekh
    Dean M. Tullsen
    International Journal of Parallel Programming, 1999, 27 : 477 - 503
  • [42] GLOBAL OPTIMIZATIONS IN A PROLOG COMPILER FOR THE TOAM
    ZHOU, NF
    JOURNAL OF LOGIC PROGRAMMING, 1993, 15 (04): : 275 - 294
  • [43] Tuning compiler optimizations for simultaneous multithreading
    Lo, JL
    Eggers, SJ
    Levy, HM
    Parekh, SS
    Tulsen, DM
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1999, 27 (06) : 477 - 503
  • [44] A Framework for Formal Verification of Compiler Optimizations
    Mansky, William
    Gunter, Elsa
    INTERACTIVE THEOREM PROVING, PROCEEDINGS, 2010, 6172 : 371 - 386
  • [45] A Study of Conflicting Pairs of Compiler Optimizations
    Ben Asher, Yosi
    Haber, Gadi
    Stein, Esti
    2017 IEEE 11TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2017), 2017, : 52 - 58
  • [46] Generating Compiler Optimizations from Proofs
    Tate, Ross
    Stepp, Michael
    Lerner, Sorin
    POPL'10: PROCEEDINGS OF THE 37TH ANNUAL ACM SIGPLAN-SIGACT SYMPOSIUM ON PRINCIPLES OF PROGRAMMING LANGUAGES, 2010, : 389 - 402
  • [47] Evaluating the Effects of Compiler Optimizations on Mutation Testing at the Compiler IR Level
    Harir, Farah
    Shi, August
    Converse, Hayes
    Khurshid, Sarfraz
    Marinov, Darko
    2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2016, : 105 - 115
  • [48] GPURoofline: A Model for Guiding Performance Optimizations on GPUs
    Jia, Haipeng
    Zhang, Yunquan
    Long, Guoping
    Xu, Jianliang
    Yan, Shengen
    Li, Yan
    EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 920 - 932
  • [49] Simplification of array access patterns for compiler optimizations
    Paek, Y
    Hoeflinger, J
    Padua, D
    ACM SIGPLAN NOTICES, 1998, 33 (05) : 60 - 71
  • [50] Locality-Based Optimizations in the Chapel Compiler
    Kayraklioglu, Engin
    Ronaghan, Elliot
    Ferguson, Michael P.
    Chamberlain, Bradford L.
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING (LCPC 2021), 2022, 13181 : 3 - 17