Energy-Aware Tile Size Selection for Affine Programs on GPUs

被引:0
|
作者
Jayaweera, Malith [1 ]
Kong, Martin [2 ]
Wang, Yanzhi [1 ]
Kaeli, David [1 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA USA
[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH USA
基金
美国国家科学基金会;
关键词
loop tiling; energy optimization; affine transformations; GPUs; POWER; PERFORMANCE; EFFICIENCY; ALGORITHM; MODEL;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Loop tiling is a high-order transformation used to increase data locality and performance. While previous work has considered its application to several domains and architectures, its potential impact on energy efficiency has been largely ignored. In this work, we present an Energy-Aware Tile Size Selection Scheme (EATSS) for affine programs targeting GPUs. We automatically derive non-linear integer formulations for affine programs and use the Z3 solver to find effective tile sizes that meet architectural resource constraints, while maximizing performance and minimizing energy consumption. Our approach builds on the insight that reducing the liveness of in-cache data, together with exploiting automatic power scaling, can lead to substantial gains in performance and energy efficiency. We evaluate EATSS on NVIDIA Xavier and GA100 GPUs, and report median performance-per-Watt improvement relative to PPCG on several affine kernels. On Polybench kernels, we achieve 1.5x and 1.2x improvement and obtain up to 6.3x improvement on non-Polybench high-dimensional affine kernels.
引用
收藏
页码:13 / 27
页数:15
相关论文
共 50 条
  • [1] Energy-aware hybrid precision selection framework for mobile GPUs
    Hsiao, Chih-Chieh
    Chu, Slo-Li
    Chen, Chen-Yu
    COMPUTERS & GRAPHICS-UK, 2013, 37 (05): : 431 - 444
  • [2] Tile Size Selection of Affine Programs for GPGPUs using Polyhedral Cross-Compilation
    Abdelaal, Khaled
    Kong, Martin
    PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2021, 2021, : 13 - 26
  • [3] Energy-aware acceleration on GPUs: Findings on a bioinformatics benchmark
    Perez, J.
    Rodriguez, A.
    Chico, J. F.
    Lopez-Rodriguez, D.
    Ujaldon, M.
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2018, 20 : 88 - 101
  • [4] Multiobjective energy-aware node selection
    Le, Qiang
    Kaplan, Lance M.
    McClellan, James H.
    2006 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2006, : 2342 - +
  • [5] Energy-aware node selection for localization
    Le, Qiang
    Kaplan, Lance M.
    2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 2124 - +
  • [6] Model-Driven Tile Size Selection for DOACROSS Loops on GPUs
    Di, Peng
    Xue, Jingling
    EURO-PAR 2011 PARALLEL PROCESSING, PT 2, 2011, 6853 : 401 - 412
  • [7] A Methodology for Efficient Tile Size Selection for Affine Loop Kernels
    Kelefouras, Vasilios
    Djemame, Karim
    Keramidas, Georgios
    Voros, Nikolaos
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2022, 50 (3-4) : 405 - 432
  • [8] A Methodology for Efficient Tile Size Selection for Affine Loop Kernels
    Vasilios Kelefouras
    Karim Djemame
    Georgios Keramidas
    Nikolaos Voros
    International Journal of Parallel Programming, 2022, 50 : 405 - 432
  • [9] A Practical Tile Size Selection Model for Affine Loop Nests
    Narasimhan, Kumudha
    Acharya, Aravind
    Baid, Abhinav
    Bondhugula, Uday
    PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2021, 2021, : 27 - 39
  • [10] Reformulated Conjugate Gradient for the Energy-Aware Solution of Linear Systems on GPUs
    Aliaga, Jose I.
    Perez, Joaquin
    Quintana-Orti, Enrique S.
    Anzt, Hartwig
    2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 320 - 329